Underwater target detection is extensively applied in domains such as underwater search and rescue,environmental monitoring,and marine resource surveys.It is crucial in enabling autonomous underwater robot operations ...Underwater target detection is extensively applied in domains such as underwater search and rescue,environmental monitoring,and marine resource surveys.It is crucial in enabling autonomous underwater robot operations and promoting ocean exploration.Nevertheless,low imaging quality,harsh underwater environments,and obscured objects considerably increase the difficulty of detecting underwater targets,making it difficult for current detection methods to achieve optimal performance.In order to enhance underwater object perception and improve target detection precision,we propose a lightweight underwater target detection method using You Only Look Once(YOLO)v8 with multi-scale cross-channel attention(MSCCA),named YOLOv8-UOD.In the proposed multiscale cross-channel attention module,multi-scale attention(MSA)augments the variety of attentional perception by extracting information from innately diverse sensory fields.The cross-channel strategy utilizes RepVGGbased channel shuffling(RCS)and one-shot aggregation(OSA)to rearrange feature map channels according to specific rules.It aggregates all features only once in the final feature mapping,resulting in the extraction of more comprehensive and valuable feature information.The experimental results show that the proposed YOLOv8-UOD achieves a mAP50 of 95.67%and FLOPs of 23.8 G on the Underwater Robot Picking Contest 2017(URPC2017)dataset,outperforming other methods in terms of detection precision and computational cost-efficiency.展开更多
Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including hig...Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.展开更多
Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightwe...Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.展开更多
Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing ...Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing traditional methods face several significant challenges,including low background suppression ability,low detection rates,and high false alarm rates when identifying infrared small targets in complex environments.This paper proposes a novel infrared small target detection method based on a transformed Gaussian filter kernel and clustering approach.The method provides improved background suppression and detection accuracy compared to traditional techniques while maintaining simplicity and lower computational costs.In the first step,the infrared image is filtered by a new filter kernel and the results of filtering are normalized.In the second step,an adaptive thresholding method is utilized to determine the pixels in small targets.In the final step,a fuzzy C-mean clustering algorithm is employed to group pixels in the same target,thus yielding the detection results.The results obtained from various real infrared image datasets demonstrate the superiority of the proposed method over traditional approaches.Compared with the traditional method of state of the arts detection method,the detection accuracy of the four sequences is increased by 2.06%,0.95%,1.03%,and 1.01%,respectively,and the false alarm rate is reduced,thus providing a more effective and robust solution.展开更多
Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the ima...Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.展开更多
This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional me...This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.展开更多
To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and...To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.展开更多
This paper presents an investigation on the effect of JPEG compression on the similarity between the target image and the background,where the similarity is further used to determine the degree of clutter in the image...This paper presents an investigation on the effect of JPEG compression on the similarity between the target image and the background,where the similarity is further used to determine the degree of clutter in the image.Four new clutter metrics based on image quality assessment are introduced,among which the Haar wavelet-based perceptual similarity index,known as HaarPSI,provides the best target acquisition prediction results.It is shown that the similarity between the target and the background at the boundary between visually lossless and visually lossy compression does not change significantly compared to the case when an uncompressed image is used.In future work,through subjective tests,it is necessary to check whether this presence of compression at the threshold of just noticeable differences will affect the human target acquisition performance.Similarity values are compared with the results of subjective tests of the well-known target Search_2 database,where the degree of agreement between objective and subjective scores,measured through linear correlation,reached a value of 90%.展开更多
In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interferen...In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interference affect airplane detection.Besides,the inconsistency in the size of remote sensing images and the low accuracy of small target detection are crucial challenges that need to be addressed.To tackle these issues,we propose a novel network SDaDCS(SAHI-data augmentation-dilation-channel and spatial attention)based on YOLOX model and the slicing aided hyper inference(SAHI)framework,a new data augmentation technique and dilation-channel and spatial(DCS)attention mechanism.Initially,we create a remote sensing dataset for airplane targets and introduce a new data augmentation technique based on the Rotate-Mixup and mixed data augmentation to enhance data diversity.The DCS attention mechanism,which comprises the dilated convolution block,channel attention and spatial attention,is designed to bolster the feature extraction and discrimination of the network.To address the challenges arised by the difficulties of detecting small targets,we integrate the YOLOX model with the SAHI framework.Experiment results show that,when compared to the original YOLOX model,the proposed SDaDCS remote sensing target detection algorithm enhances overall accuracy by 13.6%.The experimental results validate the effectiveness of the proposed algorithm.展开更多
In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted...In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).展开更多
The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condit...The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condition.However,with the increasing requirement of far-range detection,the time bandwidth product,which is corresponding to radar’s mean power,should be promoted in actual application.Thus,the echo signal generates the scale effect(SE)at large time bandwidth product situation,influencing the intra and inter pulse integration performance.To eliminate SE and correct RM,this paper proposes an effective algorithm,i.e.,scaled location rotation transform(ScLRT).The ScLRT can remove SE to obtain the matching pulse compression(PC)as well as correct RM to complete CI via the location rotation transform,being implemented by seeking the actual rotation angle.Compared to the traditional coherent detection algorithms,Sc LRT can address the SE problem to achieve better detection/estimation capabilities.At last,this paper gives several simulations to assess the viability of ScLRT.展开更多
In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an...In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.展开更多
In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions ...In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions in modeling the background of each pixel. As a result, the number of Gaussian distributions is not fixed but adaptively changes with the change of the pixel value frequency. The pixels of the difference image are divided into two parts according to their values. Then the two parts are separately segmented by the adaptive threshold, and finally the foreground image is obtained. The shadow elimination method based on morphological reconstruction is introduced to improve the performance of foreground image's segmentation. Experimental results show that the proposed algorithm can quickly and accurately build the background model and it is more robust in different real scenes.展开更多
The detection and ima ging of moving targets based on airborne synthetic aperture radar (SAR) is a cru cial technique for the modern radar. Firstly, the mathematical model of SAR ech o signal which comes from moving t...The detection and ima ging of moving targets based on airborne synthetic aperture radar (SAR) is a cru cial technique for the modern radar. Firstly, the mathematical model of SAR ech o signal which comes from moving targets is constructed. Based on this model, th e features of moving target imaging are introduced and the effects of target mov ement to SAR imaging are analyzed. Then the development and the status of this t echnique are reviewed in detail. Finally, some frontiers of this field are point ed out.展开更多
To solve the problem of insufficient ability when detecting the high-speed moving target with passive millimeter wave technology, a direct-detection passive millimeter wave detecting system using the monolithic microw...To solve the problem of insufficient ability when detecting the high-speed moving target with passive millimeter wave technology, a direct-detection passive millimeter wave detecting system using the monolithic microwave integrated cir- cuit (MMIC) millimeter wave radiometer is built, and the measured data are obtained by experiment under different condi- tions. Based on feature analysis of testing signals, it points out that the peak of the first pulse and interval of two peak pulses are valid features which can reflect the motion characteristic of target. A method to calculate the moving speed of target is put forward. The calculating results indicate that the proposed method has enough accuracy and is feasible to determine the parameters of the moving target using for passive millimeter wave system.展开更多
This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the g...This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the grey value of targets is enhanced by calculating the local energy. Image segmentation based on the adaptive threshold is used to solve the problems that the grey value of noise is enhanced with the grey value improvement of targets. Experimental results show that compared with the adaptive Butterworth high-pass filter method, the proposed algorithm is more effective and faster for the infrared small target detection.展开更多
GPR has become an important geophysical method in UXO and landmine detection, for it can detect both metal and non-metallic targets. However, it is difficult to remove the strong clutters from surface-layer reflection...GPR has become an important geophysical method in UXO and landmine detection, for it can detect both metal and non-metallic targets. However, it is difficult to remove the strong clutters from surface-layer reflection and soil due to the low signal to noise ratio of GPR data. In this paper, we use the adaptive chirplet transform to reject these clutters based on their character and then pick up the signal from the UXO by the transform based on the Radon-Wigner distribution. The results from the processing show that the clutter can be rejected effectively and the target response can be measured with high SNR.展开更多
A space-borne synthetic aperture radar (SAR), a high frequency surface wave radar (HFSWR), and a ship automatic identification system (AIS) are the main remote sensors for vessel monitoring in a wide range. Thes...A space-borne synthetic aperture radar (SAR), a high frequency surface wave radar (HFSWR), and a ship automatic identification system (AIS) are the main remote sensors for vessel monitoring in a wide range. These three sensors have their own advantages and weaknesses, and they can complement each other in some situations. So it would improve the capability of vessel target detection to use multiple sensors including SAR, HFSWR, and A/S to identify non-cooperative vessel targets from the fusion results. During the fusion process of multiple sensors' detection results, point association is one of the key steps, and it can affect the accuracy of the data fusion and the efficiency of a non-cooperative target's recognition. This study investigated the point association analyses of vessel target detection under different conditions: space- borne SAR paired with AIS, as well as HFSWR, paired with AIS, and the characteristics of the SAR and the HFSWR and their capability of vessel target detection. Then a point association method of multiple sensors was proposed. Finally, the thresholds selection of key parameters in the points association (including range threshold, radial velocity threshold, and azimuth threshold) were investigated, and their influences on final association results were analyzed.展开更多
According to the oversampling imaging characteristics, an infrared small target detection method based on deep learning is proposed. A 7-layer deep convolutional neural network(CNN) is designed to automatically extrac...According to the oversampling imaging characteristics, an infrared small target detection method based on deep learning is proposed. A 7-layer deep convolutional neural network(CNN) is designed to automatically extract small target features and suppress clutters in an end-to-end manner. The input of CNN is an original oversampling image while the output is a cluttersuppressed feature map. The CNN contains only convolution and non-linear operations, and the resolution of the output feature map is the same as that of the input image. The L1-norm loss function is used, and a mass of training data is generated to train the network effectively. Results show that compared with several baseline methods, the proposed method improves the signal clutter ratio gain and background suppression factor by 3–4 orders of magnitude, and has more powerful target detection performance.展开更多
Sparse representation has recently been proved to be a powerful tool in image processing and object recognition.This paper proposes a novel small target detection algorithm based on this technique.By modelling a small...Sparse representation has recently been proved to be a powerful tool in image processing and object recognition.This paper proposes a novel small target detection algorithm based on this technique.By modelling a small target as a linear combination of certain target samples and then solving a sparse 0-minimization problem,the proposed apporach successfully improves and optimizes the small target representation with innovation.Furthermore,the sparsity concentration index(SCI) is creatively employed to evaluate the coefficients of each block representation and simpfy target identification.In the detection frame,target samples are firstly generated to constitute an over-complete dictionary matrix using Gaussian intensity model(GIM),and then sparse model solvers are applied to finding sparse representation for each sub-image block.Finally,SCI lexicographical evalution of the entire image incorparates with a simple threshold locate target position.The effectiveness and robustness of the proposed algorithm are demonstrated by the exprimental results.展开更多
基金supported in part by the National Natural Science Foundation of China Grants 62402085,61972062,62306060the Liaoning Doctoral Research Start-Up Fund 2023-BS-078+1 种基金the Dalian Youth Science and Technology Star Project 2023RQ023the Liaoning Basic Research Project 2023JH2/101300191.
文摘Underwater target detection is extensively applied in domains such as underwater search and rescue,environmental monitoring,and marine resource surveys.It is crucial in enabling autonomous underwater robot operations and promoting ocean exploration.Nevertheless,low imaging quality,harsh underwater environments,and obscured objects considerably increase the difficulty of detecting underwater targets,making it difficult for current detection methods to achieve optimal performance.In order to enhance underwater object perception and improve target detection precision,we propose a lightweight underwater target detection method using You Only Look Once(YOLO)v8 with multi-scale cross-channel attention(MSCCA),named YOLOv8-UOD.In the proposed multiscale cross-channel attention module,multi-scale attention(MSA)augments the variety of attentional perception by extracting information from innately diverse sensory fields.The cross-channel strategy utilizes RepVGGbased channel shuffling(RCS)and one-shot aggregation(OSA)to rearrange feature map channels according to specific rules.It aggregates all features only once in the final feature mapping,resulting in the extraction of more comprehensive and valuable feature information.The experimental results show that the proposed YOLOv8-UOD achieves a mAP50 of 95.67%and FLOPs of 23.8 G on the Underwater Robot Picking Contest 2017(URPC2017)dataset,outperforming other methods in terms of detection precision and computational cost-efficiency.
基金National Natural Science Foundation of China(No.42271416)Guangxi Science and Technology Major Project(No.AA22068072)Shennongjia National Park Resources Comprehensive Investigation Research Project(No.SNJNP2023015).
文摘Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.
基金Supported by the fund of the Henan Province Science and Technology Research Project(No.242102210213).
文摘Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.
基金supported by the Funding of Jiangsu University of Science and Technology,under the grant number:1132921208.
文摘Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing traditional methods face several significant challenges,including low background suppression ability,low detection rates,and high false alarm rates when identifying infrared small targets in complex environments.This paper proposes a novel infrared small target detection method based on a transformed Gaussian filter kernel and clustering approach.The method provides improved background suppression and detection accuracy compared to traditional techniques while maintaining simplicity and lower computational costs.In the first step,the infrared image is filtered by a new filter kernel and the results of filtering are normalized.In the second step,an adaptive thresholding method is utilized to determine the pixels in small targets.In the final step,a fuzzy C-mean clustering algorithm is employed to group pixels in the same target,thus yielding the detection results.The results obtained from various real infrared image datasets demonstrate the superiority of the proposed method over traditional approaches.Compared with the traditional method of state of the arts detection method,the detection accuracy of the four sequences is increased by 2.06%,0.95%,1.03%,and 1.01%,respectively,and the false alarm rate is reduced,thus providing a more effective and robust solution.
基金This work was jointly supported by the Special Fund for Transformation and Upgrade of Jiangsu Industry and Information Industry-Key Core Technologies(Equipment)Key Industrialization Projects in 2022(No.CMHI-2022-RDG-004):“Key Technology Research for Development of Intelligent Wind Power Operation and Maintenance Mothership in Deep Sea”.
文摘Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.
基金supported by National Sciences Foundation of China Grants(No.61902158).
文摘This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.
基金supported by the National Natural Science Foundation of China(No.51876114)the Shanghai Engineering Research Center of Marine Renewable Energy(Grant No.19DZ2254800).
文摘To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.
文摘This paper presents an investigation on the effect of JPEG compression on the similarity between the target image and the background,where the similarity is further used to determine the degree of clutter in the image.Four new clutter metrics based on image quality assessment are introduced,among which the Haar wavelet-based perceptual similarity index,known as HaarPSI,provides the best target acquisition prediction results.It is shown that the similarity between the target and the background at the boundary between visually lossless and visually lossy compression does not change significantly compared to the case when an uncompressed image is used.In future work,through subjective tests,it is necessary to check whether this presence of compression at the threshold of just noticeable differences will affect the human target acquisition performance.Similarity values are compared with the results of subjective tests of the well-known target Search_2 database,where the degree of agreement between objective and subjective scores,measured through linear correlation,reached a value of 90%.
基金supported in part by National Natural Science Foundation of China(No.62471034)Hebei Natural Science Foundation(No.F2023105001)。
文摘In the field of remote sensing,the rapid and accurate acquisition of the category and location of airplanes has emerged as a prominent research.However,remote sensing fuzzy imaging and complex environmental interference affect airplane detection.Besides,the inconsistency in the size of remote sensing images and the low accuracy of small target detection are crucial challenges that need to be addressed.To tackle these issues,we propose a novel network SDaDCS(SAHI-data augmentation-dilation-channel and spatial attention)based on YOLOX model and the slicing aided hyper inference(SAHI)framework,a new data augmentation technique and dilation-channel and spatial(DCS)attention mechanism.Initially,we create a remote sensing dataset for airplane targets and introduce a new data augmentation technique based on the Rotate-Mixup and mixed data augmentation to enhance data diversity.The DCS attention mechanism,which comprises the dilated convolution block,channel attention and spatial attention,is designed to bolster the feature extraction and discrimination of the network.To address the challenges arised by the difficulties of detecting small targets,we integrate the YOLOX model with the SAHI framework.Experiment results show that,when compared to the original YOLOX model,the proposed SDaDCS remote sensing target detection algorithm enhances overall accuracy by 13.6%.The experimental results validate the effectiveness of the proposed algorithm.
基金supported by the National Natural Science Foundation of China (No.U1833203),the National Natural Science Foundation of China (No.62301036)the Aviation Science Foundation (No.2020Z019055001)China Postdoctoral Science Foundation Funded Project (No.2022M720446)。
文摘In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).
基金supported by the National Natural Science Foundation of China(62101099)the Chinese Postdoctoral Science Foundation(2021M690558,2022T150100,2018M633352,2019T120825)+3 种基金the Young Elite Scientist Sponsorship Program(YESS20200082)the Aeronautical Science Foundation of China(2022Z017080001)the Open Foundation of Science and Technology on Electronic Information Control Laboratorythe Natural Science Foundation of Sichuan Province(2023NSFSC1386)。
文摘The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condition.However,with the increasing requirement of far-range detection,the time bandwidth product,which is corresponding to radar’s mean power,should be promoted in actual application.Thus,the echo signal generates the scale effect(SE)at large time bandwidth product situation,influencing the intra and inter pulse integration performance.To eliminate SE and correct RM,this paper proposes an effective algorithm,i.e.,scaled location rotation transform(ScLRT).The ScLRT can remove SE to obtain the matching pulse compression(PC)as well as correct RM to complete CI via the location rotation transform,being implemented by seeking the actual rotation angle.Compared to the traditional coherent detection algorithms,Sc LRT can address the SE problem to achieve better detection/estimation capabilities.At last,this paper gives several simulations to assess the viability of ScLRT.
文摘In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.
基金The National Natural Science Foundation of China (No.61172135,61101198)the Aeronautical Foundation of China (No.20115152026)
文摘In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions in modeling the background of each pixel. As a result, the number of Gaussian distributions is not fixed but adaptively changes with the change of the pixel value frequency. The pixels of the difference image are divided into two parts according to their values. Then the two parts are separately segmented by the adaptive threshold, and finally the foreground image is obtained. The shadow elimination method based on morphological reconstruction is introduced to improve the performance of foreground image's segmentation. Experimental results show that the proposed algorithm can quickly and accurately build the background model and it is more robust in different real scenes.
文摘The detection and ima ging of moving targets based on airborne synthetic aperture radar (SAR) is a cru cial technique for the modern radar. Firstly, the mathematical model of SAR ech o signal which comes from moving targets is constructed. Based on this model, th e features of moving target imaging are introduced and the effects of target mov ement to SAR imaging are analyzed. Then the development and the status of this t echnique are reviewed in detail. Finally, some frontiers of this field are point ed out.
文摘To solve the problem of insufficient ability when detecting the high-speed moving target with passive millimeter wave technology, a direct-detection passive millimeter wave detecting system using the monolithic microwave integrated cir- cuit (MMIC) millimeter wave radiometer is built, and the measured data are obtained by experiment under different condi- tions. Based on feature analysis of testing signals, it points out that the peak of the first pulse and interval of two peak pulses are valid features which can reflect the motion characteristic of target. A method to calculate the moving speed of target is put forward. The calculating results indicate that the proposed method has enough accuracy and is feasible to determine the parameters of the moving target using for passive millimeter wave system.
基金supported by the National Natural Science Foundation of China (61171194)
文摘This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the grey value of targets is enhanced by calculating the local energy. Image segmentation based on the adaptive threshold is used to solve the problems that the grey value of noise is enhanced with the grey value improvement of targets. Experimental results show that compared with the adaptive Butterworth high-pass filter method, the proposed algorithm is more effective and faster for the infrared small target detection.
基金This work was supported by U.S. Department of Defense Science Research Fund (Grant No. DAAD 19-03-1-0375) and the National Natural Science Foundation of China (Grant No. 40774055).
文摘GPR has become an important geophysical method in UXO and landmine detection, for it can detect both metal and non-metallic targets. However, it is difficult to remove the strong clutters from surface-layer reflection and soil due to the low signal to noise ratio of GPR data. In this paper, we use the adaptive chirplet transform to reject these clutters based on their character and then pick up the signal from the UXO by the transform based on the Radon-Wigner distribution. The results from the processing show that the clutter can be rejected effectively and the target response can be measured with high SNR.
基金The Special Funds for Fundamental Research Project of China under contract No.2008T04the Marine Scientific Research Special Funds for Public Welfare of China under contract No.200905029
文摘A space-borne synthetic aperture radar (SAR), a high frequency surface wave radar (HFSWR), and a ship automatic identification system (AIS) are the main remote sensors for vessel monitoring in a wide range. These three sensors have their own advantages and weaknesses, and they can complement each other in some situations. So it would improve the capability of vessel target detection to use multiple sensors including SAR, HFSWR, and A/S to identify non-cooperative vessel targets from the fusion results. During the fusion process of multiple sensors' detection results, point association is one of the key steps, and it can affect the accuracy of the data fusion and the efficiency of a non-cooperative target's recognition. This study investigated the point association analyses of vessel target detection under different conditions: space- borne SAR paired with AIS, as well as HFSWR, paired with AIS, and the characteristics of the SAR and the HFSWR and their capability of vessel target detection. Then a point association method of multiple sensors was proposed. Finally, the thresholds selection of key parameters in the points association (including range threshold, radial velocity threshold, and azimuth threshold) were investigated, and their influences on final association results were analyzed.
基金supported by the National Key Research and Development Program of China(2016YFB0500901)the Natural Science Foundation of Shanghai(18ZR1437200)the Satellite Mapping Technology and Application National Key Laboratory of Geographical Information Bureau(KLSMTA-201709)
文摘According to the oversampling imaging characteristics, an infrared small target detection method based on deep learning is proposed. A 7-layer deep convolutional neural network(CNN) is designed to automatically extract small target features and suppress clutters in an end-to-end manner. The input of CNN is an original oversampling image while the output is a cluttersuppressed feature map. The CNN contains only convolution and non-linear operations, and the resolution of the output feature map is the same as that of the input image. The L1-norm loss function is used, and a mass of training data is generated to train the network effectively. Results show that compared with several baseline methods, the proposed method improves the signal clutter ratio gain and background suppression factor by 3–4 orders of magnitude, and has more powerful target detection performance.
基金supported by the Inter-governmental Science and Technology Cooperation Project (2009DFA12870)
文摘Sparse representation has recently been proved to be a powerful tool in image processing and object recognition.This paper proposes a novel small target detection algorithm based on this technique.By modelling a small target as a linear combination of certain target samples and then solving a sparse 0-minimization problem,the proposed apporach successfully improves and optimizes the small target representation with innovation.Furthermore,the sparsity concentration index(SCI) is creatively employed to evaluate the coefficients of each block representation and simpfy target identification.In the detection frame,target samples are firstly generated to constitute an over-complete dictionary matrix using Gaussian intensity model(GIM),and then sparse model solvers are applied to finding sparse representation for each sub-image block.Finally,SCI lexicographical evalution of the entire image incorparates with a simple threshold locate target position.The effectiveness and robustness of the proposed algorithm are demonstrated by the exprimental results.