In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have differ...In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.展开更多
Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic...Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic lights in the perceptual domain of visual sensors and the complexity of recognition scenarios,we propose an end-to-end traffic light status recognition method,ResNeSt50-CBAM-DINO(RC-DINO).First,we performed data cleaning on the Tsinghua-Tencent traffic lights(TTTL)and fused it with the Shanghai Jiao Tong University’s traffic light dataset(S2TLD)to form a Chinese urban traffic light dataset(CUTLD).Second,we combined residual network with split-attention module-50(ResNeSt50)and the convolutional block attention module(CBAM)to extract more significant traffic light features.Finally,the proposed RC-DINO and mainstream recognition algorithms were trained and analyzed using CUTLD.The experimental results show that,compared to the original DINO,RC-DINO improved the average precision(AP),AP at intersection over union(IOU)=0.5(AP50),AP for small objects(APs),average recall(AR),and balanced F score(F1-Score)by 3.1%,1.6%,3.4%,0.9%,and 0.9%,respectively,and had a certain capability to recognize the partially covered traffic light status.The above results indicate that the proposed RC-DINO improved recognition performance and robustness,making it more suitable for traffic light status recognition tasks.展开更多
文摘In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.
基金supported by the National Key R&D Program of China(2021YFB2501200)the Key Program of the National Natural Science Foundation of China(52131204)the Shaanxi Province Key Research and Development Program(2022GY-300).
文摘Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic lights in the perceptual domain of visual sensors and the complexity of recognition scenarios,we propose an end-to-end traffic light status recognition method,ResNeSt50-CBAM-DINO(RC-DINO).First,we performed data cleaning on the Tsinghua-Tencent traffic lights(TTTL)and fused it with the Shanghai Jiao Tong University’s traffic light dataset(S2TLD)to form a Chinese urban traffic light dataset(CUTLD).Second,we combined residual network with split-attention module-50(ResNeSt50)and the convolutional block attention module(CBAM)to extract more significant traffic light features.Finally,the proposed RC-DINO and mainstream recognition algorithms were trained and analyzed using CUTLD.The experimental results show that,compared to the original DINO,RC-DINO improved the average precision(AP),AP at intersection over union(IOU)=0.5(AP50),AP for small objects(APs),average recall(AR),and balanced F score(F1-Score)by 3.1%,1.6%,3.4%,0.9%,and 0.9%,respectively,and had a certain capability to recognize the partially covered traffic light status.The above results indicate that the proposed RC-DINO improved recognition performance and robustness,making it more suitable for traffic light status recognition tasks.