To address the problems of missing inside and incomplete edge contours in camouflaged target detection results,we propose a camouflaged moving target detection algorithm based on local minimum difference constraints(L...To address the problems of missing inside and incomplete edge contours in camouflaged target detection results,we propose a camouflaged moving target detection algorithm based on local minimum difference constraints(LMDC).The algorithm first uses the mean to optimize the initial background model,removes the stable background region by global comparison,and extracts the edge point set in the potential target region so that each boundary point(seed)grows along the center of the target.Finally,we define the minor difference constraints term,combine the seed path and the target space consistency,and calculate the attributes of each pixel in the potential target area to realize camouflaged moving target detection.The algorithm of this paper is verified based on a public data sofa video and test videos and compared with the five classic algorithms.The experimental results show that the proposed algorithm yields good results based on integrity,accuracy,and a number of objective evaluation indexes,and its overall performance is better than that of the compared algorithms.展开更多
Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the ima...Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.展开更多
Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightwe...Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.展开更多
This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional me...This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.展开更多
In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted...In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).展开更多
To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and...To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.展开更多
In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an...In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.展开更多
Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including hig...Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.展开更多
Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems,...Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.展开更多
In this paper,a detection method combining Cameron decomposition based on polarization scattering characteristics in sea clutter background is proposed.Firstly,the Cameron decomposition is exploited to fuse the radar ...In this paper,a detection method combining Cameron decomposition based on polarization scattering characteristics in sea clutter background is proposed.Firstly,the Cameron decomposition is exploited to fuse the radar echoes of full polarization channels at the data level.Due to the artificial material structure on the surface of the target,it can be shown that the non-reciprocity of the target cell is stronger than that of the clutter cell.Then,based on the analysis of the decomposition results,a new feature with scattering geometry characteristics in polarization domain,denoted as Cameron polarization decomposition scattering weight(CPD-SW),is extracted as the test statistic,which can achieve more detailed descriptions of the clutter scattering characteristics utilizing the difference between their scattering types.Finally,the superiority of the proposed CPD-SW detector over traditional detectors in improving detection performance is verified by the IPIX measured dataset,which has strong stability under short-time observation in threshold detection and can also improve the separability of feature space zin anomaly detection.展开更多
Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The m...Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The method adopts the overall design of backbone network, detection network and algorithmic parameter optimisation method, completes the model training on the self-constructed occlusion target dataset, and adopts the multi-scale perception method for target detection. The HNM algorithm is used to screen positive and negative samples during the training process, and the NMS algorithm is used to post-process the prediction results during the detection process to improve the detection efficiency. After experimental validation, the obtained model has the multi-class average predicted value (mAP) of the dataset. It has general advantages over traditional target detection methods. The detection time of a single target on FDDB dataset is 39 ms, which can meet the need of real-time target detection. In addition, the project team has successfully deployed the method into substations and put it into use in many places in Beijing, which is important for achieving the anomaly of occlusion target detection.展开更多
The object detectors can precisely detect the camouflaged object beyond human perception.The investigations reveal that the CNNs-based(Convolution Neural Networks)detectors are vulnerable to adversarial attacks.Some w...The object detectors can precisely detect the camouflaged object beyond human perception.The investigations reveal that the CNNs-based(Convolution Neural Networks)detectors are vulnerable to adversarial attacks.Some works can fool detectors by crafting the adversarial camouflage attached to the object,leading to wrong prediction.It is hard for military operations to utilize the existing adversarial camouflage due to its conspicuous appearance.Motivated by this,this paper proposes the Dual Attribute Adversarial Camouflage(DAAC)for evading the detection by both detectors and humans.Generating DAAC includes two steps:(1)Extracting features from a specific type of scene to generate individual soldier digital camouflage;(2)Attaching the adversarial patch with scene features constraint to the individual soldier digital camouflage to generate the adversarial attribute of DAAC.The visual effects of the individual soldier digital camouflage and the adversarial patch will be improved after integrating with the scene features.Experiment results show that objects camouflaged by DAAC are well integrated with background and achieve visual concealment while remaining effective in fooling object detectors,thus evading the detections by both detectors and humans in the digital domain.This work can serve as the reference for crafting the adversarial camouflage in the physical world.展开更多
Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few e...Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few effective pixels,few features,and no apparent features,which makes extracting their efficient features difficult and easily leads to false detection,missed detection,and repeated detection,affecting the performance of target detection models.An improved faster region convolutional neural network(RCNN)algorithm(CF-RCNN)integrating convolutional block attention module(CBAM)and feature pyramid networks(FPN)is proposed to improve the detection and recognition accuracy of small-size objects,occluded or truncated objects in complex scenes.Firstly,the CBAM mechanism is integrated into the feature extraction network to improve the detection ability of occluded or truncated objects.Secondly,the FPN-featured pyramid structure is introduced to obtain high-resolution and vital semantic data to enhance the detection effect of small-size objects.The experimental results show that the mean average precision of target detection of the improved algorithm on PASCAL VOC2012 is improved to 76.1%,which is 13.8 percentage points higher than that of the commonly used Faster RCNN and other algorithms.Furthermore,it is better than the commonly used small sample target detection algorithm.展开更多
Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a de...Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a deep convolutional neural network(DCNN)is proposed for meteorology target detection and ground clutter suppression with a large collection of airborne weather radar images as network input.For each weather radar image,the corresponding digital elevation model(DEM)image is extracted on basis of the radar antenna scan-ning parameters and plane position,and is further fed to the net-work as a supplement for ground clutter suppression.The fea-tures of actual meteorology targets are learned in each bottle-neck module of the proposed network and convolved into deeper iterations in the forward propagation process.Then the network parameters are updated by the back propagation itera-tion of the training error.Experimental results on the real mea-sured images show that our proposed DCNN outperforms the counterparts in terms of six evaluation factors.Meanwhile,the network outputs are in good agreement with the expected mete-orology detection results(labels).It is demonstrated that the pro-posed network would have a promising meteorology observa-tion application with minimal effort on network variables or parameter changes.展开更多
In the scene of wideband radar,due to the spread of target scattering points,the attitude and angle of view of the target constantly change in the process of moving.It is difficult to predict,and the actual echo of mu...In the scene of wideband radar,due to the spread of target scattering points,the attitude and angle of view of the target constantly change in the process of moving.It is difficult to predict,and the actual echo of multiple scattered points is not fully matched with the transmitted signal.Therefore,it is challenging for the traditional matching filter method to achieve a complete matching effect in wideband echo detection.In addition,the energy dispersion of complex target echoes is still a problem in radar target detection under broadband conditions.Therefore,this paper proposes a wideband target detection method based on dualchannel correlation processing of range-extended targets.This method fully uses the spatial distribution characteristics of target scattering points of echo signal and the matching characteristics of the dual-channel point extension function itself.The radial accumulation of wideband target echo signal in the complex domain is realized through the adaptive correlation processing of a dual-channel echo signal.The accu-mulation effect of high matching degree is achieved to improve the detection probability and the performance of wideband detection.Finally,electromagnetic simulation experiments and measured data verify that the proposed method has the advan-tages of high signal to noise ratio(SNR)gain and high detection probability under low SNR conditions.展开更多
In recent years,target detection of aerial images of unmannedaerial vehicle(UAV)has become one of the hottest topics.However,targetdetection of UAV aerial images often presents false detection and misseddetection.We p...In recent years,target detection of aerial images of unmannedaerial vehicle(UAV)has become one of the hottest topics.However,targetdetection of UAV aerial images often presents false detection and misseddetection.We proposed a modified you only look once(YOLO)model toimprove the problems arising in object detection in UAV aerial images:(1)A new residual structure is designed to improve the ability to extract featuresby enhancing the fusion of the inner features of the single layer.At the sametime,triplet attention module is added to strengthen the connection betweenspace and channel and better retain important feature information.(2)Thefeature information is enriched by improving the multi-scale feature pyramidstructure and strengthening the feature fusion at different scales.(3)A newloss function is created and the diagonal penalty term of the anchor frame isintroduced to improve the speed of training and the accuracy of reasoning.The proposed model is called residual feature fusion triple attention YOLO(RT-YOLO).Experiments showed that the mean average precision(mAP)ofRT-YOLO is increased from 57.2%to 60.8%on the vehicle detection in aerialimage(VEDAI)dataset,and the mAP is also increased by 1.7%on the remotesensing object detection(RSOD)dataset.The results show that theRT-YOLOoutperforms other mainstream models in UAV aerial image object detection.展开更多
This paper proposes a real-time detection method to improve the Infrared small target detection CenterNet(ISTD-CenterNet)network for detecting small infrared targets in complex environments.The method eliminates the n...This paper proposes a real-time detection method to improve the Infrared small target detection CenterNet(ISTD-CenterNet)network for detecting small infrared targets in complex environments.The method eliminates the need for an anchor frame,addressing the issues of low accuracy and slow speed.HRNet is used as the framework for feature extraction,and an ECBAM attention module is added to each stage branch for intelligent identification of the positions of small targets and significant objects.A scale enhancement module is also added to obtain a high-level semantic representation and fine-resolution prediction map for the entire infrared image.Besides,an improved sensory field enhancement module is designed to leverage semantic information in low-resolution feature maps,and a convolutional attention mechanism module is used to increase network stability and convergence speed.Comparison experiments conducted on the infrared small target data set ESIRST.The experiments show that compared to the benchmark network CenterNet-HRNet,the proposed ISTD-CenterNet improves the recall by 22.85%and the detection accuracy by 13.36%.Compared to the state-of-the-art YOLOv5small,the ISTD-CenterNet recall is improved by 5.88%,the detection precision is improved by 2.33%,and the detection frame rate is 48.94 frames/sec,which realizes the accurate real-time detection of small infrared targets.展开更多
Target signal acquisition and detection based on sonar images is a challenging task due to the complex underwater environment.In order to solve the problem that some semantic information in sonar images is lost and mo...Target signal acquisition and detection based on sonar images is a challenging task due to the complex underwater environment.In order to solve the problem that some semantic information in sonar images is lost and model detection performance is degraded due to the complex imaging environment,we proposed a more effective and robust target detection framework based on deep learning,which can make full use of the acoustic shadow information in the forward-looking sonar images to assist underwater target detection.Firstly,the weighted box fusion method is adopted to generate a fusion box by weighted fusion of prediction boxes with high confidence,so as to obtain accurate acoustic shadow boxes.Further,the acoustic shadow box is cut down to get the feature map containing the acoustic shadow information,and then the acoustic shadow feature map and the target information feature map are adaptively fused to make full use of the acoustic shadow feature information.In addition,we introduce a threshold processing module to improve the attention of the model to important feature information.Through the underwater sonar dataset provided by Pengcheng Laboratory,the proposed method improved the average accuracy by 3.14%at the IoU threshold of 0.7,which is better than the current traditional target detection model.展开更多
Infrared target detection models are more required than ever before to be deployed on embedded platforms,which requires models with less memory consumption and better real-time performance while considering accuracy.T...Infrared target detection models are more required than ever before to be deployed on embedded platforms,which requires models with less memory consumption and better real-time performance while considering accuracy.To address the above challenges,we propose a modified You Only Look Once(YOLO)algorithm PF-YOLOv4-Tiny.The algorithm incorpo-rates spatial pyramidal pooling(SPP)and squeeze-and-excitation(SE)visual attention modules to enhance the target localization capability.The PANet-based-feature pyramid networks(P-FPN)are proposed to transfer semantic information and location information simultaneously to ameliorate detection accuracy.To lighten the network,the standard convolutions other than the backbone network are replaced with depthwise separable convolutions.In post-processing the images,the soft-non-maximum suppression(soft-NMS)algorithm is employed to subside the missed and false detection problems caused by the occlusion between targets.The accuracy of our model can finally reach 61.75%,while the total Params is only 9.3 M and GFLOPs is 11.At the same time,the inference speed reaches 87 FPS on NVIDIA GeForce GTX 1650 Ti,which can meet the requirements of the infrared target detection algorithm for the embedded deployments.展开更多
In this paper,an algorithm based on a fractional time-frequency spectrum feature is proposed to improve the accuracy of synthetic aperture radar(SAR)target detection.By extending the fractional Gabor transform(FrGT)in...In this paper,an algorithm based on a fractional time-frequency spectrum feature is proposed to improve the accuracy of synthetic aperture radar(SAR)target detection.By extending the fractional Gabor transform(FrGT)into two dimensions,the fractional time-frequency spectrum feature of an image can be obtained.In the achievement process,we search for the optimal order and design the optimal window function to accomplish the two-dimensional optimal FrGT.Finally,the energy attenuation gradient(EAG)feature of the optimal time-frequency spectrum is extracted for high-frequency detection.The simulation results show the proposed algorithm has a good performance in SAR target detection and lays the foundation for recognition.展开更多
文摘To address the problems of missing inside and incomplete edge contours in camouflaged target detection results,we propose a camouflaged moving target detection algorithm based on local minimum difference constraints(LMDC).The algorithm first uses the mean to optimize the initial background model,removes the stable background region by global comparison,and extracts the edge point set in the potential target region so that each boundary point(seed)grows along the center of the target.Finally,we define the minor difference constraints term,combine the seed path and the target space consistency,and calculate the attributes of each pixel in the potential target area to realize camouflaged moving target detection.The algorithm of this paper is verified based on a public data sofa video and test videos and compared with the five classic algorithms.The experimental results show that the proposed algorithm yields good results based on integrity,accuracy,and a number of objective evaluation indexes,and its overall performance is better than that of the compared algorithms.
基金This work was jointly supported by the Special Fund for Transformation and Upgrade of Jiangsu Industry and Information Industry-Key Core Technologies(Equipment)Key Industrialization Projects in 2022(No.CMHI-2022-RDG-004):“Key Technology Research for Development of Intelligent Wind Power Operation and Maintenance Mothership in Deep Sea”.
文摘Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.
基金Supported by the fund of the Henan Province Science and Technology Research Project(No.242102210213).
文摘Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.
基金supported by National Sciences Foundation of China Grants(No.61902158).
文摘This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.
基金supported by the National Natural Science Foundation of China (No.U1833203),the National Natural Science Foundation of China (No.62301036)the Aviation Science Foundation (No.2020Z019055001)China Postdoctoral Science Foundation Funded Project (No.2022M720446)。
文摘In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).
基金supported by the National Natural Science Foundation of China(No.51876114)the Shanghai Engineering Research Center of Marine Renewable Energy(Grant No.19DZ2254800).
文摘To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.
文摘In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.
基金National Natural Science Foundation of China(No.42271416)Guangxi Science and Technology Major Project(No.AA22068072)Shennongjia National Park Resources Comprehensive Investigation Research Project(No.SNJNP2023015).
文摘Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.
基金support by the National Natural Science Foundation of China (Grant No. 62005049)Natural Science Foundation of Fujian Province (Grant Nos. 2020J01451, 2022J05113)Education and Scientific Research Program for Young and Middleaged Teachers in Fujian Province (Grant No. JAT210035)。
文摘Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.
基金supported by the National Natural Science Foundation of China(62201251)the Natural Science Foundation of the Jiangsu Higher Education Institutions of China(22KJB510024)the Open Fund for the Hangzhou Institute of Technology Academician Workstation at Xidian University(XH-KY-202306-0291)。
文摘In this paper,a detection method combining Cameron decomposition based on polarization scattering characteristics in sea clutter background is proposed.Firstly,the Cameron decomposition is exploited to fuse the radar echoes of full polarization channels at the data level.Due to the artificial material structure on the surface of the target,it can be shown that the non-reciprocity of the target cell is stronger than that of the clutter cell.Then,based on the analysis of the decomposition results,a new feature with scattering geometry characteristics in polarization domain,denoted as Cameron polarization decomposition scattering weight(CPD-SW),is extracted as the test statistic,which can achieve more detailed descriptions of the clutter scattering characteristics utilizing the difference between their scattering types.Finally,the superiority of the proposed CPD-SW detector over traditional detectors in improving detection performance is verified by the IPIX measured dataset,which has strong stability under short-time observation in threshold detection and can also improve the separability of feature space zin anomaly detection.
文摘Aiming at the problem of low accuracy of traditional target detection methods for target detection in endoscopes in substation environments, a CNN-based real-time detection method for masked targets is proposed. The method adopts the overall design of backbone network, detection network and algorithmic parameter optimisation method, completes the model training on the self-constructed occlusion target dataset, and adopts the multi-scale perception method for target detection. The HNM algorithm is used to screen positive and negative samples during the training process, and the NMS algorithm is used to post-process the prediction results during the detection process to improve the detection efficiency. After experimental validation, the obtained model has the multi-class average predicted value (mAP) of the dataset. It has general advantages over traditional target detection methods. The detection time of a single target on FDDB dataset is 39 ms, which can meet the need of real-time target detection. In addition, the project team has successfully deployed the method into substations and put it into use in many places in Beijing, which is important for achieving the anomaly of occlusion target detection.
基金National Natural Science Foundation of China(grant number 61801512,grant number 62071484)Natural Science Foundation of Jiangsu Province(grant number BK20180080)to provide fund for conducting experiments。
文摘The object detectors can precisely detect the camouflaged object beyond human perception.The investigations reveal that the CNNs-based(Convolution Neural Networks)detectors are vulnerable to adversarial attacks.Some works can fool detectors by crafting the adversarial camouflage attached to the object,leading to wrong prediction.It is hard for military operations to utilize the existing adversarial camouflage due to its conspicuous appearance.Motivated by this,this paper proposes the Dual Attribute Adversarial Camouflage(DAAC)for evading the detection by both detectors and humans.Generating DAAC includes two steps:(1)Extracting features from a specific type of scene to generate individual soldier digital camouflage;(2)Attaching the adversarial patch with scene features constraint to the individual soldier digital camouflage to generate the adversarial attribute of DAAC.The visual effects of the individual soldier digital camouflage and the adversarial patch will be improved after integrating with the scene features.Experiment results show that objects camouflaged by DAAC are well integrated with background and achieve visual concealment while remaining effective in fooling object detectors,thus evading the detections by both detectors and humans in the digital domain.This work can serve as the reference for crafting the adversarial camouflage in the physical world.
基金sponsored by the Natural Science Research Program of Higher Education Jiangsu Province (19KJD520005)Qing Lan Project of Jiangsu Province (Su Teacher’s Letter [2021]No.11)the Young Teacher Development Fund of Pujiang Institute Nanjing Tech University ( [2021]No.73).
文摘Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few effective pixels,few features,and no apparent features,which makes extracting their efficient features difficult and easily leads to false detection,missed detection,and repeated detection,affecting the performance of target detection models.An improved faster region convolutional neural network(RCNN)algorithm(CF-RCNN)integrating convolutional block attention module(CBAM)and feature pyramid networks(FPN)is proposed to improve the detection and recognition accuracy of small-size objects,occluded or truncated objects in complex scenes.Firstly,the CBAM mechanism is integrated into the feature extraction network to improve the detection ability of occluded or truncated objects.Secondly,the FPN-featured pyramid structure is introduced to obtain high-resolution and vital semantic data to enhance the detection effect of small-size objects.The experimental results show that the mean average precision of target detection of the improved algorithm on PASCAL VOC2012 is improved to 76.1%,which is 13.8 percentage points higher than that of the commonly used Faster RCNN and other algorithms.Furthermore,it is better than the commonly used small sample target detection algorithm.
基金supported by the China Ministry of Industry and Information Technology Foundation and Aeronautical Science Foundation of China(ASFC-201920007002)the National Key Research and Development Plan(2021YFB1600603)the Open Fund of Key Laboratory of Civil Aircraft Airworthiness Technology,Civil Aviation University of China.
文摘Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a deep convolutional neural network(DCNN)is proposed for meteorology target detection and ground clutter suppression with a large collection of airborne weather radar images as network input.For each weather radar image,the corresponding digital elevation model(DEM)image is extracted on basis of the radar antenna scan-ning parameters and plane position,and is further fed to the net-work as a supplement for ground clutter suppression.The fea-tures of actual meteorology targets are learned in each bottle-neck module of the proposed network and convolved into deeper iterations in the forward propagation process.Then the network parameters are updated by the back propagation itera-tion of the training error.Experimental results on the real mea-sured images show that our proposed DCNN outperforms the counterparts in terms of six evaluation factors.Meanwhile,the network outputs are in good agreement with the expected mete-orology detection results(labels).It is demonstrated that the pro-posed network would have a promising meteorology observa-tion application with minimal effort on network variables or parameter changes.
文摘In the scene of wideband radar,due to the spread of target scattering points,the attitude and angle of view of the target constantly change in the process of moving.It is difficult to predict,and the actual echo of multiple scattered points is not fully matched with the transmitted signal.Therefore,it is challenging for the traditional matching filter method to achieve a complete matching effect in wideband echo detection.In addition,the energy dispersion of complex target echoes is still a problem in radar target detection under broadband conditions.Therefore,this paper proposes a wideband target detection method based on dualchannel correlation processing of range-extended targets.This method fully uses the spatial distribution characteristics of target scattering points of echo signal and the matching characteristics of the dual-channel point extension function itself.The radial accumulation of wideband target echo signal in the complex domain is realized through the adaptive correlation processing of a dual-channel echo signal.The accu-mulation effect of high matching degree is achieved to improve the detection probability and the performance of wideband detection.Finally,electromagnetic simulation experiments and measured data verify that the proposed method has the advan-tages of high signal to noise ratio(SNR)gain and high detection probability under low SNR conditions.
基金supported in part by the Scientific Research Project of Hunan Provincial Department of Education under Grant 18A332 and 19A066,authors HW.D and Z.C,http://kxjsc.gov.hnedu.cn/in part by the Science and Technology Plan Project of Hunan Province under Grant 2016TP1020,author HW.D,http://kjt.hunan.gov.cn/.
文摘In recent years,target detection of aerial images of unmannedaerial vehicle(UAV)has become one of the hottest topics.However,targetdetection of UAV aerial images often presents false detection and misseddetection.We proposed a modified you only look once(YOLO)model toimprove the problems arising in object detection in UAV aerial images:(1)A new residual structure is designed to improve the ability to extract featuresby enhancing the fusion of the inner features of the single layer.At the sametime,triplet attention module is added to strengthen the connection betweenspace and channel and better retain important feature information.(2)Thefeature information is enriched by improving the multi-scale feature pyramidstructure and strengthening the feature fusion at different scales.(3)A newloss function is created and the diagonal penalty term of the anchor frame isintroduced to improve the speed of training and the accuracy of reasoning.The proposed model is called residual feature fusion triple attention YOLO(RT-YOLO).Experiments showed that the mean average precision(mAP)ofRT-YOLO is increased from 57.2%to 60.8%on the vehicle detection in aerialimage(VEDAI)dataset,and the mAP is also increased by 1.7%on the remotesensing object detection(RSOD)dataset.The results show that theRT-YOLOoutperforms other mainstream models in UAV aerial image object detection.
基金funded by National Natural Science Foundation of China,Fund Number 61703424.
文摘This paper proposes a real-time detection method to improve the Infrared small target detection CenterNet(ISTD-CenterNet)network for detecting small infrared targets in complex environments.The method eliminates the need for an anchor frame,addressing the issues of low accuracy and slow speed.HRNet is used as the framework for feature extraction,and an ECBAM attention module is added to each stage branch for intelligent identification of the positions of small targets and significant objects.A scale enhancement module is also added to obtain a high-level semantic representation and fine-resolution prediction map for the entire infrared image.Besides,an improved sensory field enhancement module is designed to leverage semantic information in low-resolution feature maps,and a convolutional attention mechanism module is used to increase network stability and convergence speed.Comparison experiments conducted on the infrared small target data set ESIRST.The experiments show that compared to the benchmark network CenterNet-HRNet,the proposed ISTD-CenterNet improves the recall by 22.85%and the detection accuracy by 13.36%.Compared to the state-of-the-art YOLOv5small,the ISTD-CenterNet recall is improved by 5.88%,the detection precision is improved by 2.33%,and the detection frame rate is 48.94 frames/sec,which realizes the accurate real-time detection of small infrared targets.
基金This work is supported by National Natural Science Foundation of China(Grant:62272109).
文摘Target signal acquisition and detection based on sonar images is a challenging task due to the complex underwater environment.In order to solve the problem that some semantic information in sonar images is lost and model detection performance is degraded due to the complex imaging environment,we proposed a more effective and robust target detection framework based on deep learning,which can make full use of the acoustic shadow information in the forward-looking sonar images to assist underwater target detection.Firstly,the weighted box fusion method is adopted to generate a fusion box by weighted fusion of prediction boxes with high confidence,so as to obtain accurate acoustic shadow boxes.Further,the acoustic shadow box is cut down to get the feature map containing the acoustic shadow information,and then the acoustic shadow feature map and the target information feature map are adaptively fused to make full use of the acoustic shadow feature information.In addition,we introduce a threshold processing module to improve the attention of the model to important feature information.Through the underwater sonar dataset provided by Pengcheng Laboratory,the proposed method improved the average accuracy by 3.14%at the IoU threshold of 0.7,which is better than the current traditional target detection model.
基金supported by The Natural Science Foundation of the Jiangsu Higher Education Institutions of China(Grants No.19JKB520031).
文摘Infrared target detection models are more required than ever before to be deployed on embedded platforms,which requires models with less memory consumption and better real-time performance while considering accuracy.To address the above challenges,we propose a modified You Only Look Once(YOLO)algorithm PF-YOLOv4-Tiny.The algorithm incorpo-rates spatial pyramidal pooling(SPP)and squeeze-and-excitation(SE)visual attention modules to enhance the target localization capability.The PANet-based-feature pyramid networks(P-FPN)are proposed to transfer semantic information and location information simultaneously to ameliorate detection accuracy.To lighten the network,the standard convolutions other than the backbone network are replaced with depthwise separable convolutions.In post-processing the images,the soft-non-maximum suppression(soft-NMS)algorithm is employed to subside the missed and false detection problems caused by the occlusion between targets.The accuracy of our model can finally reach 61.75%,while the total Params is only 9.3 M and GFLOPs is 11.At the same time,the inference speed reaches 87 FPS on NVIDIA GeForce GTX 1650 Ti,which can meet the requirements of the infrared target detection algorithm for the embedded deployments.
基金supported by the Natural Science Foundation of Sichuan Province of China under Grant No.2022NSFSC40574partially supported by the National Natural Science Foundation of China under Grants No.61571096 and No.61775030.
文摘In this paper,an algorithm based on a fractional time-frequency spectrum feature is proposed to improve the accuracy of synthetic aperture radar(SAR)target detection.By extending the fractional Gabor transform(FrGT)into two dimensions,the fractional time-frequency spectrum feature of an image can be obtained.In the achievement process,we search for the optimal order and design the optimal window function to accomplish the two-dimensional optimal FrGT.Finally,the energy attenuation gradient(EAG)feature of the optimal time-frequency spectrum is extracted for high-frequency detection.The simulation results show the proposed algorithm has a good performance in SAR target detection and lays the foundation for recognition.