期刊文献+

基于可学习记忆特征金字塔网络的小样本目标检测 被引量:1

Few-shot Object Detection via Learnable Memory Feature Pyramid Network
下载PDF
导出
摘要 现阶段,部分行业应用场景数据难以获取,从而产生的小样本问题成为制约深度学习技术应用推广的重要因素。本文通过小样本方法来提升模型在数据缺乏情况下的表现,降低深度学习模型对数据的依赖性,提出一种基于可学习记忆特征金字塔网络来保留更干净的多尺度特征信息用于分类器预测。借助自适应特征融合模块,让网络自行选择不同层级特征间的侧重比,最大化保留不同尺度的判别性特征信息。同时还加入回溯特征对齐模块,用于缓解特征层堆叠时引入的特征混淆效应。实验结果表明,通过克服样本依赖性可以有效地提升模型性能,改进后的模型可以在COCO数据集和VOC数据集上超越其他现有同类型的模型。特别地,在VOC数据集中将先验参数k设置为5的情况下,nAP50提高了4.8达到44.7;在COCO数据集中将先验参数k设置为30的情况下,nAP50提高了4.0达到29.4。 At present,it is difficult to obtain the data of some industry application scenarios,and the problem of few shot has become an important factor restricting the application and promotion of deep learning technology.In this paper,few shot method is adopted to improve the performance of the model in the absence of data and reduce the dependence of the deep learning model on data,and few-shot object detection via learnable memory feature pyramid network is proposed to retain cleaner multi-scale feature information for classifier prediction.With the help of the adaptive feature fusion module,the network can choose the emphasis ratio among the features of different levels to maximize the retention of discriminant feature information of different scales.At the same time,we also add a retrospective feature alignment module to alleviate the feature confusion effect introduced by stacking feature layers.The experimental results show that the model performance can be effectively improved by overcoming the dependence on data,and the improved model can surpass other existing models of the same type in the COCO dataset and VOC dataset.In particular,when the prior parameter k is set to 5 in VOC dataset,nAP50 increases by 4.8 to 44.7;when the prior parameter k is set to 30 in COCO dataset,nAP50 increases by 4.0 to 29.4.
作者 夏千涵 何胜煌 吴元清 赵乐乐 XIA Qian-han;HE Sheng-huang;WU Yuan-qing;ZHAO Le-le(School of Computer Science and Technology,Guangdong University of Technology,Guangzhou 510006,China;School of Automation,Shanghai Jiaotong University,Shanghai 200030,China;Concordia University Wisconsin,Mequon WI 53097,USA)
出处 《计算机与现代化》 2023年第12期7-13,23,共8页 Computer and Modernization
基金 国家自然科学基金资助项目(U22A2065,62003100,62276074) 国家重点发展计划项目(2022YFB4701300) 广东省基础和应用基础研究基金资助项目(2021B15120058)。
关键词 小样本 自适应融合 特征对齐 特征金字塔网络 few shot adaptive fusion feature alignment feature pyramid network
  • 相关文献

参考文献6

二级参考文献29

  • 1李楚为,张志龙,杨卫平.结合布尔图和灰度稀缺性的小目标显著性检测[J].中国图象图形学报,2020,0(2):267-281. 被引量:4
  • 2AN S, PEURSUM P, LIU W, et al. Efficient algorithms for subwin* dow search in object detection and localization [ C]// CVPR 2009: Proceedings of the 2009 IEEE Conference on Computer Vision and Patter Recognition. Piscataway: IEEE, 2009:264-271.
  • 3LEHMANN A, LEIBE B, GOOL L. Feature centric efficient sub- window search [ C]// CVPR 2009: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Piscat- away: IEEE, 2009: 940-947.
  • 4MUTCH J, LOWED G, Multiclass object recognition with sparse localized features [ C]//CVPR 2006: Proceedings of the 2006 IEEI] Conference on Computer Vision and Pattern Recognition. Piscat[ away: IEEE, 2006:11-18. [.
  • 5BRUNELLI R. Template matching techniques in computer vision: theory and practice [ M]. Hoboken: John Wiley and Sons, 2009.
  • 6SHECHTMAN E, IRANI M. Matching local self similarities across images and videos [ C]/! CVPR 2007: Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition. Pis- catawav: IEEE, 2007:1-8.
  • 7SIBIRYAKOV A. Fast and high-performance template matching methods [ C]/! CVPR 2008: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2008:1-8.
  • 8SEO H J, MILANFAR P. Training-free, generic object detection using locally adaptive regression kernels [ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(9): 1688- 1704.
  • 9XU P, YE M, LI X, et al. Object detection using voting spaces trained by few samples [ J]. Optical Engineering, 2013, 52(9) :093105.
  • 10XU P, YE M. FU M, et al. Object detection based on several samples with training Hough spaces [ C]// CCPR 2012: Chinese Conference of Pattern Recognition, CCIS 321. Berlin: Springer- Verlag, 2012:235-242.

共引文献33

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部