Onboard visual object tracking in unmanned aerial vehicles(UAVs)has attractedmuch interest due to its versatility.Meanwhile,due to high precision,Siamese networks are becoming hot spots in visual object tracking.Howev...Onboard visual object tracking in unmanned aerial vehicles(UAVs)has attractedmuch interest due to its versatility.Meanwhile,due to high precision,Siamese networks are becoming hot spots in visual object tracking.However,most Siamese trackers fail to balance the tracking accuracy and time within onboard limited computational resources of UAVs.To meet the tracking precision and real-time requirements,this paper proposes a Siamese dense pixel-level network for UAV object tracking named SiamDPL.Specifically,the Siamese network extracts features of the search region and the template region through a parameter-shared backbone network,then performs correlationmatching to obtain the candidate regionwith high similarity.To improve the matching effect of template and search features,this paper designs a dense pixel-level feature fusion module to enhance the matching ability by pixel-wise correlation and enrich the feature diversity by dense connection.An attention module composed of self-attention and channel attention is introduced to learn global context information and selectively emphasize the target feature region in the spatial and channel dimensions.In addition,a target localization module is designed to improve target location accuracy.Compared with other advanced trackers,experiments on two public benchmarks,which are UAV123@10fps and UAV20L fromthe unmanned air vehicle123(UAV123)dataset,show that SiamDPL can achieve superior performance and low complexity with a running speed of 100.1 fps on NVIDIA TITAN RTX.展开更多
Infrared(IR) small target detection is one of the key technologies of infrared search and track(IRST)systems. Existing methods have some limitations in detection performance, especially when the target size is irregul...Infrared(IR) small target detection is one of the key technologies of infrared search and track(IRST)systems. Existing methods have some limitations in detection performance, especially when the target size is irregular or the background is complex. In this paper, we propose a pixel-level local contrast measure(PLLCM), which can subdivide small targets and backgrounds at pixel level simultaneously.With pixel-level segmentation, the difference between the target and the background becomes more obvious, which helps to improve the detection performance. First, we design a multiscale sliding window to quickly extract candidate target pixels. Then, a local window based on random walker(RW) is designed for pixel-level target segmentation. After that, PLLCM incorporating probability weights and scale constraints is proposed to accurately measure local contrast and suppress various types of background interference. Finally, an adaptive threshold operation is applied to separate the target from the PLLCM enhanced map. Experimental results show that the proposed method has a higher detection rate and a lower false alarm rate than the baseline algorithms, while achieving a high speed.展开更多
Camouflaged targets are a type of nonsalient target with high foreground and background fusion and minimal target feature information,making target recognition extremely difficult.Most detection algorithms for camoufl...Camouflaged targets are a type of nonsalient target with high foreground and background fusion and minimal target feature information,making target recognition extremely difficult.Most detection algorithms for camouflaged targets use only the target’s single-band information,resulting in low detection accuracy and a high missed detection rate.We present a multimodal image fusion camouflaged target detection technique (MIF-YOLOv5) in this paper.First,we provide a multimodal image input to achieve pixel-level fusion of the camouflaged target’s optical and infrared images to improve the effective feature information of the camouflaged target.Second,a loss function is created,and the K-Means++clustering technique is used to optimize the target anchor frame in the dataset to increase camouflage personnel detection accuracy and robustness.Finally,a comprehensive detection index of camouflaged targets is proposed to compare the overall effectiveness of various approaches.More crucially,we create a multispectral camouflage target dataset to test the suggested technique.Experimental results show that the proposed method has the best comprehensive detection performance,with a detection accuracy of 96.5%,a recognition probability of92.5%,a parameter number increase of 1×10^(4),a theoretical calculation amount increase of 0.03 GFLOPs,and a comprehensive detection index of 0.85.The advantage of this method in terms of detection accuracy is also apparent in performance comparisons with other target algorithms.展开更多
Image fusion can be performed at different levels:signal,pixel,feature and symbol levels.Almost all image fusion algorithms developed to date fall into pixel level.This paper provides an overview of the most widely us...Image fusion can be performed at different levels:signal,pixel,feature and symbol levels.Almost all image fusion algorithms developed to date fall into pixel level.This paper provides an overview of the most widely used pixel-level image fusion algorithms and some comments about their relative strengths and weaknesses.Particular emphasis is placed on multiscale-based methods.Some performance measures practicable for pixel-level image fusion are also discussed.At last,prospects of pixel-level image fusion are made.展开更多
Accurate global land cover(GLC), as a key input for scientific communities, is important for a wide variety of applications. In order to understand the current suitability and limitation of GLC products, the discrepan...Accurate global land cover(GLC), as a key input for scientific communities, is important for a wide variety of applications. In order to understand the current suitability and limitation of GLC products, the discrepancy and pixellevel uncertainty in major GLC products in three epochs are assessed in this study by using an integrated uncertainty index(IUI) that combines the thematic uncertainty and local classification accuracy uncertainty. The results show that the overall spatial agreements(Ao values) between GLC products are lower than 58%, and the total areas of forests are very consistent in major GLC products, but significant differences are found in different forest classes.The misclassification among different forest classes and mosaic types can account for about 20% of the total disagreements. The mean IUI almost reaches 0.5, and high uncertainty mostly occurs in transition zones and heterogeneous areas across the world. Further efforts are needed to make in the land cover classifications in areas with high uncertainty. Designing a classification scheme for climate models, with explicit definitions of land cover classes in the threshold of common attributes, is urgently needed. Information of the pixel-level uncertainty in major GLC products not only give important implications for the specific application, but also provide a quite important basis for land cover fusion.展开更多
This paper presents a 50 Hz 15-bit analog-to-digital converter (ADC) for pixel-level implementation in CMOS image sensors. The ADC is based on charge packets counting and adopts a voltage reset technique to inject c...This paper presents a 50 Hz 15-bit analog-to-digital converter (ADC) for pixel-level implementation in CMOS image sensors. The ADC is based on charge packets counting and adopts a voltage reset technique to inject charge packets. The core circuit for charge/pulse conversion is specially optimized for low power, low noise and small area. An experimental chip with ten pixel-level ADCs has been fabricated and tested for verification. The measurement result shows a standard deviation of 1.8 LSB for full-scale output. The ADC has an area of 45 × 45μm^2 and consumes less than 2 μW in a standard 1P-6M 0.18μm CMOS process.展开更多
基金funded by the National Natural Science Foundation of China(Grant No.52072408),author Y.C.
文摘Onboard visual object tracking in unmanned aerial vehicles(UAVs)has attractedmuch interest due to its versatility.Meanwhile,due to high precision,Siamese networks are becoming hot spots in visual object tracking.However,most Siamese trackers fail to balance the tracking accuracy and time within onboard limited computational resources of UAVs.To meet the tracking precision and real-time requirements,this paper proposes a Siamese dense pixel-level network for UAV object tracking named SiamDPL.Specifically,the Siamese network extracts features of the search region and the template region through a parameter-shared backbone network,then performs correlationmatching to obtain the candidate regionwith high similarity.To improve the matching effect of template and search features,this paper designs a dense pixel-level feature fusion module to enhance the matching ability by pixel-wise correlation and enrich the feature diversity by dense connection.An attention module composed of self-attention and channel attention is introduced to learn global context information and selectively emphasize the target feature region in the spatial and channel dimensions.In addition,a target localization module is designed to improve target location accuracy.Compared with other advanced trackers,experiments on two public benchmarks,which are UAV123@10fps and UAV20L fromthe unmanned air vehicle123(UAV123)dataset,show that SiamDPL can achieve superior performance and low complexity with a running speed of 100.1 fps on NVIDIA TITAN RTX.
基金supported by the National Natural Science Foundation of China under Grant 62003247, Grant 62075169, and Grant 62061160370。
文摘Infrared(IR) small target detection is one of the key technologies of infrared search and track(IRST)systems. Existing methods have some limitations in detection performance, especially when the target size is irregular or the background is complex. In this paper, we propose a pixel-level local contrast measure(PLLCM), which can subdivide small targets and backgrounds at pixel level simultaneously.With pixel-level segmentation, the difference between the target and the background becomes more obvious, which helps to improve the detection performance. First, we design a multiscale sliding window to quickly extract candidate target pixels. Then, a local window based on random walker(RW) is designed for pixel-level target segmentation. After that, PLLCM incorporating probability weights and scale constraints is proposed to accurately measure local contrast and suppress various types of background interference. Finally, an adaptive threshold operation is applied to separate the target from the PLLCM enhanced map. Experimental results show that the proposed method has a higher detection rate and a lower false alarm rate than the baseline algorithms, while achieving a high speed.
基金Project supported by the Shandong Provincial Natural Science Foundation of China(No.ZR2020MF015)the Aerospace Science and Technology Innovation Institute Stabilization Support Project(No.ZY0110020009)。
文摘Camouflaged targets are a type of nonsalient target with high foreground and background fusion and minimal target feature information,making target recognition extremely difficult.Most detection algorithms for camouflaged targets use only the target’s single-band information,resulting in low detection accuracy and a high missed detection rate.We present a multimodal image fusion camouflaged target detection technique (MIF-YOLOv5) in this paper.First,we provide a multimodal image input to achieve pixel-level fusion of the camouflaged target’s optical and infrared images to improve the effective feature information of the camouflaged target.Second,a loss function is created,and the K-Means++clustering technique is used to optimize the target anchor frame in the dataset to increase camouflage personnel detection accuracy and robustness.Finally,a comprehensive detection index of camouflaged targets is proposed to compare the overall effectiveness of various approaches.More crucially,we create a multispectral camouflage target dataset to test the suggested technique.Experimental results show that the proposed method has the best comprehensive detection performance,with a detection accuracy of 96.5%,a recognition probability of92.5%,a parameter number increase of 1×10^(4),a theoretical calculation amount increase of 0.03 GFLOPs,and a comprehensive detection index of 0.85.The advantage of this method in terms of detection accuracy is also apparent in performance comparisons with other target algorithms.
基金the National Natural Science Foundation of China (Nos. 60775022 and 60705006)
文摘Image fusion can be performed at different levels:signal,pixel,feature and symbol levels.Almost all image fusion algorithms developed to date fall into pixel level.This paper provides an overview of the most widely used pixel-level image fusion algorithms and some comments about their relative strengths and weaknesses.Particular emphasis is placed on multiscale-based methods.Some performance measures practicable for pixel-level image fusion are also discussed.At last,prospects of pixel-level image fusion are made.
基金Supported by the National Key Research and Development Program of China(2016YFA0600303 and 2018YFC1506506)。
文摘Accurate global land cover(GLC), as a key input for scientific communities, is important for a wide variety of applications. In order to understand the current suitability and limitation of GLC products, the discrepancy and pixellevel uncertainty in major GLC products in three epochs are assessed in this study by using an integrated uncertainty index(IUI) that combines the thematic uncertainty and local classification accuracy uncertainty. The results show that the overall spatial agreements(Ao values) between GLC products are lower than 58%, and the total areas of forests are very consistent in major GLC products, but significant differences are found in different forest classes.The misclassification among different forest classes and mosaic types can account for about 20% of the total disagreements. The mean IUI almost reaches 0.5, and high uncertainty mostly occurs in transition zones and heterogeneous areas across the world. Further efforts are needed to make in the land cover classifications in areas with high uncertainty. Designing a classification scheme for climate models, with explicit definitions of land cover classes in the threshold of common attributes, is urgently needed. Information of the pixel-level uncertainty in major GLC products not only give important implications for the specific application, but also provide a quite important basis for land cover fusion.
基金supported by the Major National Science & Technology Program of China(No.2012ZX03004004-002)
文摘This paper presents a 50 Hz 15-bit analog-to-digital converter (ADC) for pixel-level implementation in CMOS image sensors. The ADC is based on charge packets counting and adopts a voltage reset technique to inject charge packets. The core circuit for charge/pulse conversion is specially optimized for low power, low noise and small area. An experimental chip with ten pixel-level ADCs has been fabricated and tested for verification. The measurement result shows a standard deviation of 1.8 LSB for full-scale output. The ADC has an area of 45 × 45μm^2 and consumes less than 2 μW in a standard 1P-6M 0.18μm CMOS process.