摘要
针对经典的有锚框检测算法RetinaNet、无锚框检测算法FCOS等目标检测算法中存在漏检以及重复检测的问题,提出一种自适应特征融合与cosIoU-NMS的目标检测算法.首先采用自适应特征融合模块对多尺度特征中相邻3层特征加权融合,获取丰富的上下文信息和空间信息;然后采用cosIoU计算检测框之间的余弦相似度与重叠面积,使目标定位更准确;最后使用cosIoU-NMS代替Greedy-NMS抑制置信度分数较高的冗余框,保留更准确的检测结果.以RetinaNet和FCOS为基准,在PASCAL VOC数据集上的实验结果表明,所提算法的检测精度达到81.3%和82.3%,分别提升2.8个百分点和1.2个百分点;在MSCOCO数据集上检测精度达到36.8%和38.0%,分别提升1.0个百分点和0.7个百分点;该算法能够增强特征表征能力,筛除多余的检测框,有效地提高检测性能.
To address the problem of missing or repeating detection in the classical anchor-based RetinaNet,anchor-free FCOS,and other object detection algorithms,this paper proposes a novel object detection algo-rithm based on adaptive feature fusion and cosIoU-NMS.Firstly,the algorithm leverages an adaptive feature fusion module to obtain rich context and spatial information by weighted fusion of adjacent three-layer fea-tures in multi-scale features.Then,the cosIoU,which measures the cosine similarity and overlap area be-tween detection boxes,is calculated to locate the target more precisely.Finally,by replacing Greedy-NMS with our cosIoU-NMS,redundant boxes with high confidence scores can be effectively suppressed,and thus retaining more accurate detection results.Based on RetinaNet and FCOS,the experimental results on the PASCAL VOC dataset demonstrate the detection accuracy of our proposed algorithm achieves 81.3%and 82.3%,with relative gains of 2.8 and 1.2 percentage points,respectively.On the MS COCO dataset,the ac-curacy reaches 36.8%and 38.0%,which is increased by 1.0 and 0.7 percentage points,respectively.The al-gorithm can improve the capability of feature representation,remove redundant detection boxes,and sig-nificantly boost the detection performance.
作者
马素刚
李宁博
彭冠升
杨小宝
侯志强
Ma Sugang;Li Ningbo;Peng Guansheng;Yang Xiaobao;Hou Zhiqiang(School of Computer Science and Technology,Xi’an University of Posts and Telecommunications,Xi’an 710121;Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing,Xi’an University of Posts and Telecommunications,Xi’an 710121;School of Computer Science and Technology,XIDIAN University,Xi’an 710126;Xi’an Key Laboratory of Big Data and Intelligent Computing,Xi’an University of Posts and Telecommunications,Xi’an 710121)
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2024年第1期112-121,共10页
Journal of Computer-Aided Design & Computer Graphics
基金
国家自然科学基金(62072370)
西安市科技计划(22GXFW0125)。
关键词
深度学习
目标检测
多尺度特征融合
交并比
非极大值抑制
余弦相似度
deep learning
object detection
multi-level feature fusion
intersection over union
non-maximum suppression
cosine similarity