Advances in machine vision systems have revolutionized applications such as autonomous driving,robotic navigation,and augmented reality.Despite substantial progress,challenges persist,including dynamic backgrounds,occ...Advances in machine vision systems have revolutionized applications such as autonomous driving,robotic navigation,and augmented reality.Despite substantial progress,challenges persist,including dynamic backgrounds,occlusion,and limited labeled data.To address these challenges,we introduce a comprehensive methodology toenhance image classification and object detection accuracy.The proposed approach involves the integration ofmultiple methods in a complementary way.The process commences with the application of Gaussian filters tomitigate the impact of noise interference.These images are then processed for segmentation using Fuzzy C-Meanssegmentation in parallel with saliency mapping techniques to find the most prominent regions.The Binary RobustIndependent Elementary Features(BRIEF)characteristics are then extracted fromdata derived fromsaliency mapsand segmented images.For precise object separation,Oriented FAST and Rotated BRIEF(ORB)algorithms areemployed.Genetic Algorithms(GAs)are used to optimize Random Forest classifier parameters which lead toimproved performance.Our method stands out due to its comprehensive approach,adeptly addressing challengessuch as changing backdrops,occlusion,and limited labeled data concurrently.A significant enhancement hasbeen achieved by integrating Genetic Algorithms(GAs)to precisely optimize parameters.This minor adjustmentnot only boosts the uniqueness of our system but also amplifies its overall efficacy.The proposed methodologyhas demonstrated notable classification accuracies of 90.9%and 89.0%on the challenging Corel-1k and MSRCdatasets,respectively.Furthermore,detection accuracies of 87.2%and 86.6%have been attained.Although ourmethod performed well in both datasets it may face difficulties in real-world data especially where datasets havehighly complex backgrounds.Despite these limitations,GAintegration for parameter optimization shows a notablestrength in enhancing the overall adaptability and performance of our system.展开更多
Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false...Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing imagery.Additionally,these complexities contribute to inaccuracies in target localization and hinder precise target categorization.This paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery analysis.Specifically,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex backgrounds.To resolve these issues,we introduce a novel approach.First,we propose the implementation of a lightweight multi-scale module called CEF.This module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature information.It effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing imagery.Second,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone section.This allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed images.Finally,a dynamic head attentionmechanism is introduced.This allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different sizes.Consequently,the precision of object detection is significantly improved.The trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 and map@0.5:0.95,separately.These results illustrate the clear advantages of the method.展开更多
Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variati...Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications.展开更多
Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing com...Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing complex spatial data that is also influenced by temporal dynamics.Despite the progress made in existing VSOD models,they still struggle in scenes of great background diversity within and between frames.Additionally,they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration.We propose a multi-stream temporal enhanced network(MSTENet)to address these problems.It investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward,yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time consumption.The distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision,facilitating enhanced extraction of collaborative saliency cues.Another notable differentiation is the innovative integration of spatial and temporal features,wherein the temporal module is integrated into the multi-stream structure,enabling comprehensive spatial-temporal interactions within an end-to-end framework.Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps(Titan XP).Our code and models are available at https://github.com/RuJiaLe/MSTENet.展开更多
We are investigating the distributed optimization problem,where a network of nodes works together to minimize a global objective that is a finite sum of their stored local functions.Since nodes exchange optimization p...We are investigating the distributed optimization problem,where a network of nodes works together to minimize a global objective that is a finite sum of their stored local functions.Since nodes exchange optimization parameters through the wireless network,large-scale training models can create communication bottlenecks,resulting in slower training times.To address this issue,CHOCO-SGD was proposed,which allows compressing information with arbitrary precision without reducing the convergence rate for strongly convex objective functions.Nevertheless,most convex functions are not strongly convex(such as logistic regression or Lasso),which raises the question of whether this algorithm can be applied to non-strongly convex functions.In this paper,we provide the first theoretical analysis of the convergence rate of CHOCO-SGD on non-strongly convex objectives.We derive a sufficient condition,which limits the fidelity of compression,to guarantee convergence.Moreover,our analysis demonstrates that within the fidelity threshold,this algorithm can significantly reduce transmission burden while maintaining the same convergence rate order as its no-compression equivalent.Numerical experiments further validate the theoretical findings by demonstrating that CHOCO-SGD improves communication efficiency and keeps the same convergence rate order simultaneously.And experiments also show that the algorithm fails to converge with low compression fidelity and in time-varying topologies.Overall,our study offers valuable insights into the potential applicability of CHOCO-SGD for non-strongly convex objectives.Additionally,we provide practical guidelines for researchers seeking to utilize this algorithm in real-world scenarios.展开更多
Agriculture plays a vital role in the food production process that occupies nearly one-third of the total surface of the earth.Rice is propagated from the seeds of paddy and it is a stable food almost used byfifty per...Agriculture plays a vital role in the food production process that occupies nearly one-third of the total surface of the earth.Rice is propagated from the seeds of paddy and it is a stable food almost used byfifty percent of the total world population.The extensive growth of the human population alarms us to ensure food security and the country should take proper food steps to improve the yield of food grains.This paper concentrates on improving the yield of paddy by predicting the factors that influence the growth of paddy with the help of Evolutionary Computation Techniques.Most of the researchers used to relay on historical records of meteorological parameters to predict the yield of paddy.There is a lack in analyzing the day to day impact of meteorological parameters such as direction of wind,relative humidity,Instant Wind Speed in paddy cultivation.The real time meteorological data collected and analysis the impact of weather parameters from the day of paddy sowing to till the last day of paddy harvesting with regular time series.A Robust Optimized Artificial Neural Network(ROANN)Algorithm with Genetic Algorithm(GA)and Multi Objective Particle Swarm Optimization Algorithm(MOPSO)proposed to predict the factors that to be concentrated by farmers to improve the paddy yield in cultivation.A real time paddy data collected from farmers of Tamilnadu and the meteorological parameters were matched with the cropping pattern of the farmers to construct the database.The input parameters were optimized either by using GA or MOPSO optimization algorithms to reconstruct the database.Reconstructed database optimized by using Artificial Neural Network Back Propagation Algorithm.The reason for improving the growth of paddy was identified using the output of the Neural Network.Performance metrics such as Accuracy,Error Rate etc were used to measure the performance of the proposed algorithm.Comparative analysis made between ANN with GA and ANN with MOPSO to identify the recommendations for improving the paddy yield.展开更多
To improve customer satisfaction of cold chain logistics of fresh agricultural goods enterprises and reduce the comprehensive distribution cost composed of fixed cost, transportation cost, cargo damage cost, refrigera...To improve customer satisfaction of cold chain logistics of fresh agricultural goods enterprises and reduce the comprehensive distribution cost composed of fixed cost, transportation cost, cargo damage cost, refrigeration cost, and time penalty cost, a multi-objective path optimization model of fresh agricultural products distribution considering client satisfaction is constructed. The model is solved using an enhanced Elitist Non-dominated Sorting Genetic Algorithm (NSGA-II), and differential evolution is incorporated to the evolution operator. The algorithm produced by the revised algorithm produces a better Pareto optimum solution set, efficiently balances the relationship between customer pleasure and cost, and serves as a reference for the long-term growth of organizations. .展开更多
A multi-objective linear programming problem is made from fuzzy linear programming problem. It is due the fact that it is used fuzzy programming method during the solution. The Multi objective linear programming probl...A multi-objective linear programming problem is made from fuzzy linear programming problem. It is due the fact that it is used fuzzy programming method during the solution. The Multi objective linear programming problem can be converted into the single objective function by various methods as Chandra Sen’s method, weighted sum method, ranking function method, statistical averaging method. In this paper, Chandra Sen’s method and statistical averaging method both are used here for making single objective function from multi-objective function. Two multi-objective programming problems are solved to verify the result. One is numerical example and the other is real life example. Then the problems are solved by ordinary simplex method and fuzzy programming method. It can be seen that fuzzy programming method gives better optimal values than the ordinary simplex method.展开更多
To research the effect of the selection method of multi-objects genetic algorithm problem on optimizing result, thismethod is analyzed theoretically and discussed by using an autonomous underwater vehicle(AUV) as an o...To research the effect of the selection method of multi-objects genetic algorithm problem on optimizing result, thismethod is analyzed theoretically and discussed by using an autonomous underwater vehicle(AUV) as an object. A changingweight vtlue method is put forward and a selection formula is modified. Some experiments were implemented on an AUV.TwinBurger. The results shows that this method is effective and feasible.展开更多
The marine environment is becoming increasingly complex due tothe various marine vehicles,and the diversity of maritime objects poses a challengeto marine environmental governance.Maritime object detection technologyp...The marine environment is becoming increasingly complex due tothe various marine vehicles,and the diversity of maritime objects poses a challengeto marine environmental governance.Maritime object detection technologyplays an important role in this segment.In the field of computer vision,there is no sufficiently comprehensive public dataset for maritime objects inthe contrast to the automotive application domain.The existing maritimedatasets either have no bounding boxes(which are made for object classification)or cover limited varieties of maritime objects.To fulfil the vacancy,this paper proposed the Multi-Category Large-Scale Dataset for MaritimeObject Detection(MCMOD)which is collected by 3 onshore video camerasthat capture data under various environmental conditions such as fog,rain,evening,etc.The whole dataset consists of 16,166 labelled images alongwith 98,590 maritime objects which are classified into 10 classes.Comparedwith the existing maritime datasets,MCMOD contains a relatively balancedquantity of objects of different sizes(in the view).To evaluate MCMOD,this paper applied several state-of-the-art object detection approaches fromcomputer vision research on it and compared their performances.Moreover,a comparison between MCMOD and an existing maritime dataset was conducted.Experimental results indicate that the proposed dataset classifies moretypes of maritime objects and covers more small-scale objects,which canfacilitate the trained detectors to recognize more types of maritime objects anddetect maritime objects over a relatively long distance.The obtained resultsalso showthat the adopted approaches need to be further improved to enhancetheir capabilities in the maritime domain.展开更多
The northeastern China cold vortex(NCCV)plays an important role in regional rainstorms over East Asia.Using the National Centers for Environmental Prediction Final reanalysis dataset and the Global Precipitation Measu...The northeastern China cold vortex(NCCV)plays an important role in regional rainstorms over East Asia.Using the National Centers for Environmental Prediction Final reanalysis dataset and the Global Precipitation Measurement product,an objective algorithm for identifying heavy-precipitation NCCV(HPCV)events was designed,and the climatological features of 164 HPCV events from 2001 to 2019 were investigated.The number of HPCV events showed an upward linear trend,with the highest frequency of occurrence in summer.The most active region of HPCV samples was the Northeast China Plain between 40°–55°N.Most HPCV events lasted 3–5 days and had radii ranging from 250 to 1000 km.The duration of HPCV events with larger sizes was longer.About half of the HPCV events moved into(moved out of)the definition region(35°–60°N,115°–145°E),and half initiated(dissipated)within the region.The initial position was close to the western boundary of the definition region,and the final position was mainly near the eastern boundary.The locations associated with the precipitation were mostly concentrated within 2000 km southeast of the HPCV systems,and they were farther from the center in the cold season than in the warm season.展开更多
Design change is an inevitable part of the product development process.This study proposes an improved binary multi‐objective PSO algorithm guided by problem char-acteristics(P‐BMOPSO)to solve the optimisation probl...Design change is an inevitable part of the product development process.This study proposes an improved binary multi‐objective PSO algorithm guided by problem char-acteristics(P‐BMOPSO)to solve the optimisation problem of complex product change plan considering service performance.Firstly,a complex product multi‐layer network with service performance is established for the first time to reveal the impact of change effect propagation on the product service performance.Secondly,the concept of service performance impact(SPI)is defined by decoupling the impact of strongly associated nodes on the service performance in the process of change affect propagation.Then,a triple‐objective selection model of change nodes is established,which includes the three indicators:SPI degree,change cost,and change time.Furthermore,an integer multi‐objective particle swarm optimisation algorithm guided by problem characteristics is developed to solve the model above.Experimental results on the design change problem of a certain type of Skyworth TV verify the effectiveness of the established optimisation model and the proposed P‐BMOPSO algorithm.展开更多
Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific...Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.展开更多
The integrity and fineness characterization of non-connected regions and contours is a major challenge for existing salient object detection.The key to address is how to make full use of the subjective and objective s...The integrity and fineness characterization of non-connected regions and contours is a major challenge for existing salient object detection.The key to address is how to make full use of the subjective and objective structural information obtained in different steps.Therefore,by simulating the human visual mechanism,this paper proposes a novel multi-decoder matching correction network and subjective structural loss.Specifically,the loss pays different attentions to the foreground,boundary,and background of ground truth map in a top-down structure.And the perceived saliency is mapped to the corresponding objective structure of the prediction map,which is extracted in a bottom-up manner.Thus,multi-level salient features can be effectively detected with the loss as constraint.And then,through the mapping of improved binary cross entropy loss,the differences between salient regions and objects are checked to pay attention to the error prone region to achieve excellent error sensitivity.Finally,through tracking the identifying feature horizontally and vertically,the subjective and objective interaction is maximized.Extensive experiments on five benchmark datasets demonstrate that compared with 12 state-of-the-art methods,the algorithm has higher recall and precision,less error and strong robustness and generalization ability,and can predict complete and refined saliency maps.展开更多
On grounds of the advent of real-time applications,like autonomous driving,visual surveillance,and sports analysis,there is an augmenting focus of attention towards Multiple-Object Tracking(MOT).The tracking-by-detect...On grounds of the advent of real-time applications,like autonomous driving,visual surveillance,and sports analysis,there is an augmenting focus of attention towards Multiple-Object Tracking(MOT).The tracking-by-detection paradigm,a commonly utilized approach,connects the existing recognition hypotheses to the formerly assessed object trajectories by comparing the simila-rities of the appearance or the motion between them.For an efficient detection and tracking of the numerous objects in a complex environment,a Pearson Simi-larity-centred Kuhn-Munkres(PS-KM)algorithm was proposed in the present study.In this light,the input videos were,initially,gathered from the MOT dataset and converted into frames.The background subtraction occurred whichfiltered the inappropriate data concerning the frames after the frame conversion stage.Then,the extraction of features from the frames was executed.Afterwards,the higher dimensional features were transformed into lower-dimensional features,and feature reduction process was performed with the aid of Information Gain-centred Singular Value Decomposition(IG-SVD).Next,using the Modified Recurrent Neural Network(MRNN)method,classification was executed which identified the categories of the objects additionally.The PS-KM algorithm identi-fied that the recognized objects were tracked.Finally,the experimental outcomes exhibited that numerous targets were precisely tracked by the proposed system with 97%accuracy with a low false positive rate(FPR)of 2.3%.It was also proved that the present techniques viz.RNN,CNN,and KNN,were effective with regard to the existing models.展开更多
A multi-objective optimization method based on Pareto Genetic Algorithm is presented for shape design of membrane structures from a structural view point.Several non-dimensional variables are defined as optimization v...A multi-objective optimization method based on Pareto Genetic Algorithm is presented for shape design of membrane structures from a structural view point.Several non-dimensional variables are defined as optimization variables,which are decision factors of shapes of membrane structures.Three objectives are proposed including maximization of stiffness,maximum uniformity of stress and minimum reaction under external loads.Pareto Multi-objective Genetic Algorithm is introduced to solve the Pareto solutions.Consequently,the dependence of the optimality upon the optimization variables is derived to provide guidelines on how to determine design parameters.Moreover,several examples illustrate the proposed methods and applications.The study shows that the multi-objective optimization method in this paper is feasible and efficient for membrane structures;the research on Pareto solutions can provide explicit and useful guidelines for shape design of membrane structures.展开更多
The data analysis of blasting sites has always been the research goal of relevant researchers.The rise of mobile blasting robots has aroused many researchers’interest in machine learning methods for target detection ...The data analysis of blasting sites has always been the research goal of relevant researchers.The rise of mobile blasting robots has aroused many researchers’interest in machine learning methods for target detection in the field of blasting.Serverless Computing can provide a variety of computing services for people without hardware foundations and rich software development experience,which has aroused people’s interest in how to use it in the field ofmachine learning.In this paper,we design a distributedmachine learning training application based on the AWS Lambda platform.Based on data parallelism,the data aggregation and training synchronization in Function as a Service(FaaS)are effectively realized.It also encrypts the data set,effectively reducing the risk of data leakage.We rent a cloud server and a Lambda,and then we conduct experiments to evaluate our applications.Our results indicate the effectiveness,rapidity,and economy of distributed training on FaaS.展开更多
By avoiding or reducing the production of waste, waste minimization is an effective approach to solve
the pollution problem in chemical industry. Process integration supported by multi-objective optimization provides...By avoiding or reducing the production of waste, waste minimization is an effective approach to solve
the pollution problem in chemical industry. Process integration supported by multi-objective optimization provides
a framework for process design or process retrofit by simultaneously optimizing on the aspects of environment and
economics. Multi-objective genetic algorithm is applied in this area as the solution approach for the multi-objective
optimization problem.展开更多
基金a grant from the Basic Science Research Program through the National Research Foundation(NRF)(2021R1F1A1063634)funded by the Ministry of Science and ICT(MSIT)Republic of Korea.This research is supported and funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2024R410)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.The authors are thankful to the Deanship of Scientific Research at Najran University for funding this work under the Research Group Funding program Grant Code(NU/RG/SERC/12/6).
文摘Advances in machine vision systems have revolutionized applications such as autonomous driving,robotic navigation,and augmented reality.Despite substantial progress,challenges persist,including dynamic backgrounds,occlusion,and limited labeled data.To address these challenges,we introduce a comprehensive methodology toenhance image classification and object detection accuracy.The proposed approach involves the integration ofmultiple methods in a complementary way.The process commences with the application of Gaussian filters tomitigate the impact of noise interference.These images are then processed for segmentation using Fuzzy C-Meanssegmentation in parallel with saliency mapping techniques to find the most prominent regions.The Binary RobustIndependent Elementary Features(BRIEF)characteristics are then extracted fromdata derived fromsaliency mapsand segmented images.For precise object separation,Oriented FAST and Rotated BRIEF(ORB)algorithms areemployed.Genetic Algorithms(GAs)are used to optimize Random Forest classifier parameters which lead toimproved performance.Our method stands out due to its comprehensive approach,adeptly addressing challengessuch as changing backdrops,occlusion,and limited labeled data concurrently.A significant enhancement hasbeen achieved by integrating Genetic Algorithms(GAs)to precisely optimize parameters.This minor adjustmentnot only boosts the uniqueness of our system but also amplifies its overall efficacy.The proposed methodologyhas demonstrated notable classification accuracies of 90.9%and 89.0%on the challenging Corel-1k and MSRCdatasets,respectively.Furthermore,detection accuracies of 87.2%and 86.6%have been attained.Although ourmethod performed well in both datasets it may face difficulties in real-world data especially where datasets havehighly complex backgrounds.Despite these limitations,GAintegration for parameter optimization shows a notablestrength in enhancing the overall adaptability and performance of our system.
基金the Scientific Research Fund of Hunan Provincial Education Department(23A0423).
文摘Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate backgrounds.These inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing imagery.Additionally,these complexities contribute to inaccuracies in target localization and hinder precise target categorization.This paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery analysis.Specifically,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex backgrounds.To resolve these issues,we introduce a novel approach.First,we propose the implementation of a lightweight multi-scale module called CEF.This module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature information.It effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing imagery.Second,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone section.This allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed images.Finally,a dynamic head attentionmechanism is introduced.This allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different sizes.Consequently,the precision of object detection is significantly improved.The trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 and map@0.5:0.95,separately.These results illustrate the clear advantages of the method.
基金the Key Research and Development Program of Hainan Province(Grant Nos.ZDYF2023GXJS163,ZDYF2024GXJS014)National Natural Science Foundation of China(NSFC)(Grant Nos.62162022,62162024)+2 种基金the Major Science and Technology Project of Hainan Province(Grant No.ZDKJ2020012)Hainan Provincial Natural Science Foundation of China(Grant No.620MS021)Youth Foundation Project of Hainan Natural Science Foundation(621QN211).
文摘Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications.
基金funded by the Natural Science Foundation China(NSFC)under Grant No.62203192.
文摘Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing complex spatial data that is also influenced by temporal dynamics.Despite the progress made in existing VSOD models,they still struggle in scenes of great background diversity within and between frames.Additionally,they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration.We propose a multi-stream temporal enhanced network(MSTENet)to address these problems.It investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward,yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time consumption.The distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision,facilitating enhanced extraction of collaborative saliency cues.Another notable differentiation is the innovative integration of spatial and temporal features,wherein the temporal module is integrated into the multi-stream structure,enabling comprehensive spatial-temporal interactions within an end-to-end framework.Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps(Titan XP).Our code and models are available at https://github.com/RuJiaLe/MSTENet.
基金supported in part by the Shanghai Natural Science Foundation under the Grant 22ZR1407000.
文摘We are investigating the distributed optimization problem,where a network of nodes works together to minimize a global objective that is a finite sum of their stored local functions.Since nodes exchange optimization parameters through the wireless network,large-scale training models can create communication bottlenecks,resulting in slower training times.To address this issue,CHOCO-SGD was proposed,which allows compressing information with arbitrary precision without reducing the convergence rate for strongly convex objective functions.Nevertheless,most convex functions are not strongly convex(such as logistic regression or Lasso),which raises the question of whether this algorithm can be applied to non-strongly convex functions.In this paper,we provide the first theoretical analysis of the convergence rate of CHOCO-SGD on non-strongly convex objectives.We derive a sufficient condition,which limits the fidelity of compression,to guarantee convergence.Moreover,our analysis demonstrates that within the fidelity threshold,this algorithm can significantly reduce transmission burden while maintaining the same convergence rate order as its no-compression equivalent.Numerical experiments further validate the theoretical findings by demonstrating that CHOCO-SGD improves communication efficiency and keeps the same convergence rate order simultaneously.And experiments also show that the algorithm fails to converge with low compression fidelity and in time-varying topologies.Overall,our study offers valuable insights into the potential applicability of CHOCO-SGD for non-strongly convex objectives.Additionally,we provide practical guidelines for researchers seeking to utilize this algorithm in real-world scenarios.
基金support of RUSA-Phase 2.0 grant sanctioned vide Letter No.F.24-51/2014-U,Policy(TNMulti-Gen),Dep.of Edn.Govt.of India,Dt.09.10.2018.
文摘Agriculture plays a vital role in the food production process that occupies nearly one-third of the total surface of the earth.Rice is propagated from the seeds of paddy and it is a stable food almost used byfifty percent of the total world population.The extensive growth of the human population alarms us to ensure food security and the country should take proper food steps to improve the yield of food grains.This paper concentrates on improving the yield of paddy by predicting the factors that influence the growth of paddy with the help of Evolutionary Computation Techniques.Most of the researchers used to relay on historical records of meteorological parameters to predict the yield of paddy.There is a lack in analyzing the day to day impact of meteorological parameters such as direction of wind,relative humidity,Instant Wind Speed in paddy cultivation.The real time meteorological data collected and analysis the impact of weather parameters from the day of paddy sowing to till the last day of paddy harvesting with regular time series.A Robust Optimized Artificial Neural Network(ROANN)Algorithm with Genetic Algorithm(GA)and Multi Objective Particle Swarm Optimization Algorithm(MOPSO)proposed to predict the factors that to be concentrated by farmers to improve the paddy yield in cultivation.A real time paddy data collected from farmers of Tamilnadu and the meteorological parameters were matched with the cropping pattern of the farmers to construct the database.The input parameters were optimized either by using GA or MOPSO optimization algorithms to reconstruct the database.Reconstructed database optimized by using Artificial Neural Network Back Propagation Algorithm.The reason for improving the growth of paddy was identified using the output of the Neural Network.Performance metrics such as Accuracy,Error Rate etc were used to measure the performance of the proposed algorithm.Comparative analysis made between ANN with GA and ANN with MOPSO to identify the recommendations for improving the paddy yield.
文摘To improve customer satisfaction of cold chain logistics of fresh agricultural goods enterprises and reduce the comprehensive distribution cost composed of fixed cost, transportation cost, cargo damage cost, refrigeration cost, and time penalty cost, a multi-objective path optimization model of fresh agricultural products distribution considering client satisfaction is constructed. The model is solved using an enhanced Elitist Non-dominated Sorting Genetic Algorithm (NSGA-II), and differential evolution is incorporated to the evolution operator. The algorithm produced by the revised algorithm produces a better Pareto optimum solution set, efficiently balances the relationship between customer pleasure and cost, and serves as a reference for the long-term growth of organizations. .
文摘A multi-objective linear programming problem is made from fuzzy linear programming problem. It is due the fact that it is used fuzzy programming method during the solution. The Multi objective linear programming problem can be converted into the single objective function by various methods as Chandra Sen’s method, weighted sum method, ranking function method, statistical averaging method. In this paper, Chandra Sen’s method and statistical averaging method both are used here for making single objective function from multi-objective function. Two multi-objective programming problems are solved to verify the result. One is numerical example and the other is real life example. Then the problems are solved by ordinary simplex method and fuzzy programming method. It can be seen that fuzzy programming method gives better optimal values than the ordinary simplex method.
文摘To research the effect of the selection method of multi-objects genetic algorithm problem on optimizing result, thismethod is analyzed theoretically and discussed by using an autonomous underwater vehicle(AUV) as an object. A changingweight vtlue method is put forward and a selection formula is modified. Some experiments were implemented on an AUV.TwinBurger. The results shows that this method is effective and feasible.
基金supported by the Important Science and Technology Project of Hainan Province under Grant(ZDKJ2020010).
文摘The marine environment is becoming increasingly complex due tothe various marine vehicles,and the diversity of maritime objects poses a challengeto marine environmental governance.Maritime object detection technologyplays an important role in this segment.In the field of computer vision,there is no sufficiently comprehensive public dataset for maritime objects inthe contrast to the automotive application domain.The existing maritimedatasets either have no bounding boxes(which are made for object classification)or cover limited varieties of maritime objects.To fulfil the vacancy,this paper proposed the Multi-Category Large-Scale Dataset for MaritimeObject Detection(MCMOD)which is collected by 3 onshore video camerasthat capture data under various environmental conditions such as fog,rain,evening,etc.The whole dataset consists of 16,166 labelled images alongwith 98,590 maritime objects which are classified into 10 classes.Comparedwith the existing maritime datasets,MCMOD contains a relatively balancedquantity of objects of different sizes(in the view).To evaluate MCMOD,this paper applied several state-of-the-art object detection approaches fromcomputer vision research on it and compared their performances.Moreover,a comparison between MCMOD and an existing maritime dataset was conducted.Experimental results indicate that the proposed dataset classifies moretypes of maritime objects and covers more small-scale objects,which canfacilitate the trained detectors to recognize more types of maritime objects anddetect maritime objects over a relatively long distance.The obtained resultsalso showthat the adopted approaches need to be further improved to enhancetheir capabilities in the maritime domain.
基金supported by the National Key R&D Program of China under Grant No.2018YFC1507302the National Natural Science Foundation of China under Grant No.42175006+1 种基金Jiangsu Youth Talent Promotion Project(2021-084)the Basic Research Fund of CAMS under Grant No.2020R002.
文摘The northeastern China cold vortex(NCCV)plays an important role in regional rainstorms over East Asia.Using the National Centers for Environmental Prediction Final reanalysis dataset and the Global Precipitation Measurement product,an objective algorithm for identifying heavy-precipitation NCCV(HPCV)events was designed,and the climatological features of 164 HPCV events from 2001 to 2019 were investigated.The number of HPCV events showed an upward linear trend,with the highest frequency of occurrence in summer.The most active region of HPCV samples was the Northeast China Plain between 40°–55°N.Most HPCV events lasted 3–5 days and had radii ranging from 250 to 1000 km.The duration of HPCV events with larger sizes was longer.About half of the HPCV events moved into(moved out of)the definition region(35°–60°N,115°–145°E),and half initiated(dissipated)within the region.The initial position was close to the western boundary of the definition region,and the final position was mainly near the eastern boundary.The locations associated with the precipitation were mostly concentrated within 2000 km southeast of the HPCV systems,and they were farther from the center in the cold season than in the warm season.
基金supported by The National Key Research and Development Program of China(No.2020YFB1708200).
文摘Design change is an inevitable part of the product development process.This study proposes an improved binary multi‐objective PSO algorithm guided by problem char-acteristics(P‐BMOPSO)to solve the optimisation problem of complex product change plan considering service performance.Firstly,a complex product multi‐layer network with service performance is established for the first time to reveal the impact of change effect propagation on the product service performance.Secondly,the concept of service performance impact(SPI)is defined by decoupling the impact of strongly associated nodes on the service performance in the process of change affect propagation.Then,a triple‐objective selection model of change nodes is established,which includes the three indicators:SPI degree,change cost,and change time.Furthermore,an integer multi‐objective particle swarm optimisation algorithm guided by problem characteristics is developed to solve the model above.Experimental results on the design change problem of a certain type of Skyworth TV verify the effectiveness of the established optimisation model and the proposed P‐BMOPSO algorithm.
基金funded by the Basic Science Major Foundation(Natural Science)of the Jiangsu Higher Education Institutions of China(Grant:22KJA520012)the Xuzhou Science and Technology Plan Project(Grant:KC21303,KC22305)the sixth“333 project”of Jiangsu Province.
文摘Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.
基金supported by the National Natural Science Foundation of China(No.52174021)Key Research and Develop-ment Project of Hainan Province(No.ZDYF2022GXJS 003).
文摘The integrity and fineness characterization of non-connected regions and contours is a major challenge for existing salient object detection.The key to address is how to make full use of the subjective and objective structural information obtained in different steps.Therefore,by simulating the human visual mechanism,this paper proposes a novel multi-decoder matching correction network and subjective structural loss.Specifically,the loss pays different attentions to the foreground,boundary,and background of ground truth map in a top-down structure.And the perceived saliency is mapped to the corresponding objective structure of the prediction map,which is extracted in a bottom-up manner.Thus,multi-level salient features can be effectively detected with the loss as constraint.And then,through the mapping of improved binary cross entropy loss,the differences between salient regions and objects are checked to pay attention to the error prone region to achieve excellent error sensitivity.Finally,through tracking the identifying feature horizontally and vertically,the subjective and objective interaction is maximized.Extensive experiments on five benchmark datasets demonstrate that compared with 12 state-of-the-art methods,the algorithm has higher recall and precision,less error and strong robustness and generalization ability,and can predict complete and refined saliency maps.
文摘On grounds of the advent of real-time applications,like autonomous driving,visual surveillance,and sports analysis,there is an augmenting focus of attention towards Multiple-Object Tracking(MOT).The tracking-by-detection paradigm,a commonly utilized approach,connects the existing recognition hypotheses to the formerly assessed object trajectories by comparing the simila-rities of the appearance or the motion between them.For an efficient detection and tracking of the numerous objects in a complex environment,a Pearson Simi-larity-centred Kuhn-Munkres(PS-KM)algorithm was proposed in the present study.In this light,the input videos were,initially,gathered from the MOT dataset and converted into frames.The background subtraction occurred whichfiltered the inappropriate data concerning the frames after the frame conversion stage.Then,the extraction of features from the frames was executed.Afterwards,the higher dimensional features were transformed into lower-dimensional features,and feature reduction process was performed with the aid of Information Gain-centred Singular Value Decomposition(IG-SVD).Next,using the Modified Recurrent Neural Network(MRNN)method,classification was executed which identified the categories of the objects additionally.The PS-KM algorithm identi-fied that the recognized objects were tracked.Finally,the experimental outcomes exhibited that numerous targets were precisely tracked by the proposed system with 97%accuracy with a low false positive rate(FPR)of 2.3%.It was also proved that the present techniques viz.RNN,CNN,and KNN,were effective with regard to the existing models.
基金Sponsored by the National Natural Science Foundation of China(Grant No.50608022)
文摘A multi-objective optimization method based on Pareto Genetic Algorithm is presented for shape design of membrane structures from a structural view point.Several non-dimensional variables are defined as optimization variables,which are decision factors of shapes of membrane structures.Three objectives are proposed including maximization of stiffness,maximum uniformity of stress and minimum reaction under external loads.Pareto Multi-objective Genetic Algorithm is introduced to solve the Pareto solutions.Consequently,the dependence of the optimality upon the optimization variables is derived to provide guidelines on how to determine design parameters.Moreover,several examples illustrate the proposed methods and applications.The study shows that the multi-objective optimization method in this paper is feasible and efficient for membrane structures;the research on Pareto solutions can provide explicit and useful guidelines for shape design of membrane structures.
文摘The data analysis of blasting sites has always been the research goal of relevant researchers.The rise of mobile blasting robots has aroused many researchers’interest in machine learning methods for target detection in the field of blasting.Serverless Computing can provide a variety of computing services for people without hardware foundations and rich software development experience,which has aroused people’s interest in how to use it in the field ofmachine learning.In this paper,we design a distributedmachine learning training application based on the AWS Lambda platform.Based on data parallelism,the data aggregation and training synchronization in Function as a Service(FaaS)are effectively realized.It also encrypts the data set,effectively reducing the risk of data leakage.We rent a cloud server and a Lambda,and then we conduct experiments to evaluate our applications.Our results indicate the effectiveness,rapidity,and economy of distributed training on FaaS.
文摘By avoiding or reducing the production of waste, waste minimization is an effective approach to solve
the pollution problem in chemical industry. Process integration supported by multi-objective optimization provides
a framework for process design or process retrofit by simultaneously optimizing on the aspects of environment and
economics. Multi-objective genetic algorithm is applied in this area as the solution approach for the multi-objective
optimization problem.