期刊文献+

使用GIoU改进非极大值抑制的目标检测算法 被引量:25

Object Detection Algorithm for Improving Non-Maximum Suppression Using GIoU
下载PDF
导出
摘要 针对单阈值-非极大值抑制算法中出现的目标漏检和重复检测问题,本文提出了一种使用全局交并比指标GIoU(Generalized Intersection over Union)衡量目标相似度的双阈值非极大值抑制算法GDT-NMS(Generalized Dual Threshold NMS,GDT-NMS).使用双阈值改进NMS算法和soft-NMS算法,抑制多余的检测框;在此基础上,使用GIoU替换传统的IoU计算目标间的相似度,使目标的定位更加准确;进一步,使用非线性函数赋予检测框不同比例的权值惩罚,使检测框的得分随距离呈非线性变化,目标区分度更高.改进算法在PASCAL VOC和MSCOCO上的检测精度分别为74.8%和25.9%,与使用NMS算法作为后处理的Faster R-CNN算法相比,性能分别提升了1.6%和1.5%.同时本文算法具有较快的检测速度. Aiming at the problem of missed detection and repeated detection in the single-threshold-non-maximum suppression algorithm,this paper proposes a dual-threshold Non-Maximum Suppression algorithm using GIoU(Generalized Intersection over Union).Using dual thresholds to improve the NMS algorithm and the soft-NMS algorithm,suppressing redundant detection boxes,not only balances the relationship between the object missed detection problem and the object false detection problem caused by the single threshold algorithm,but also reduces the occurrence of the soft-NMS algorithm.Based on the above,using GIoU instead of IoU to calculate the similarity between objects,so that the positioning of the object is more accurate;the non-linear function is used to give different weights to the proposal boxes,which makes the proposal boxes’scores change non-linearly with distance,and the target discrimination is higher,which is more conducive to suppressing the proposal boxes.The detection accuracy of the improved algorithm on PASCAL VOC and MSCOCO is 74.8%and 25.9%,respectively.At the same time,the algorithm in this paper has a fast detection speed.
作者 侯志强 刘晓义 余旺盛 蒲磊 马素刚 范九伦 HOU Zhi-qiang;LIU Xiao-yi;YU Wang-sheng;PU Lei;MA Su-gang;FAN Jiu-lun(School of Computer,Xi’an University of Posts and Telecommunications,Xi’an,Shaanxi 710121,China;School of Information and Navigation,Air Force Engineering University,Xi’an,Shaanxi 710077,China;School of Communication and Information Engineering,Xi’an University of Posts and Telecommunications,Xi’an,Shaanxi 710121,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2021年第4期696-705,共10页 Acta Electronica Sinica
基金 国家自然科学基金(No.61703423,No.61473309,No.62072370)。
关键词 双阈值 非极大值抑制算法 重复检测 后处理 double threshold non-maximum suppression algorithm repeated detection post processing
  • 相关文献

参考文献8

二级参考文献83

  • 1Marr D.Vision:A Computational Investigation Into the Human Representation and Processing of Visual Information.Cambridge:The MIT Press,2010.
  • 2LeCun Y,Bottou L,Bengio Y,Haffner P.Gradient-based learning applied to document recognition.Proceedings of the IEEE,1998,86(11):2278-2324.
  • 3Ferrari V,Jurie F,Schmid C.From images to shape models for object detection.International Journal of Computer Vision,2009,87(3):284-303.
  • 4Latecki L J,Lakamper R,Eckhardt U.Shape descriptors for non rigid shapes with a single closed contour//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Hilton Head,USA,2000,1:424-429.
  • 5Krizhevsky A.Learning Multiple Layers of Features from Tiny Images[M.S.dissertation].University of Toronto,2009.
  • 6Torralba A,Fergus R,Freeman W T.80 million tiny images:A large dataset for non-parametric object and scene recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(11):1958-1970.
  • 7Li FebFei,Fergus R,Perona P.Learning generative visual models from few training examples:An incremental Bayesian approach tested on 101 object categories//Proceedings of the Computer Vision and Pattern Recognition (CVPR),Workshop on Generative-Model Based Vision.Washington,USA,2004:178.
  • 8Griffin G,Holub A D,Perona P.The Caltech 256.Caltech Technical Report CNS-TR-2007-001.
  • 9Lazebnik S,Schmid C,Ponce J.Beyond bags of features:Spatial pyramid matching for recognizing natural scene categories//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).New York,USA,2006:2169-2178.
  • 10Li Fei-Fei,Perona P.A Bayesian hierarchical model for learning natural scene categories//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Washington,USA,2005:524-531.

共引文献332

同被引文献236

引证文献25

二级引证文献100

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部