基于深度融合的显著性目标检测算法被引量：35

Salient Object Detection Based on Deep Fusion of Hand-Crafted Features

下载PDF

导出

摘要自然图像往往包含各种复杂的内容,基于单一特征的显著性检测算法很难从复杂场景中提取符合人类视觉的显著性目标.虽然多种显著图的融合能够弥补或者纠正单一特征带来的检测缺陷,但是不合理的显著图融合方式可能会进一步降低算法的检测性能.为了解决多种显著图的有效融合问题,作者提出了一种基于深度卷积神经网络的特征图深度融合模型.算法使用四种低层显著图作为网络的输入,采用前融合和后融合的双通道卷积网络学习图像的显著目标.前融合通道利用一个多层的全卷积网络生成对目标物体边缘敏感的显著图,后融合通道使用权重共享的浅层网络分别获得四种目标对象位置保持的高层语义显著图.两个通道的特征图再通过一个四层的全卷积网络进行优化,从而获得最终的显著图.在公开数据集上的大量实验证明了本文提出的显著图深度融合算法的有效性. Visual saliency detection is an important and fundamental research problem in neuroscience and psychology, which investigates the mechanism of human visual systems in selecting regions of interest from complex scenes. Recently it has also been a 5 active topic in computer vision, due to its applications to object detection, video summarization, image editing techniques, image retrieval, face detection, and fine - grained visual categorization. Saliency detection is commonly interpreted as a process that includes two stages:(1) detecting the most salient regions in accordance with human visual attention and (2) segmenting the accurate boundary of that regions. In general, saliency detection methods can be categorized as either bottom-up or top-down. The former focuses on stimulus-driven stage of attention which is of main interest in computer vision community. Contrasted with bottom-up methods, top-down approaches usually require supervised learning with manually labeled ground truth. However, natural images often contain a variety of complex content. Saliency detection method based single visual feature hardly extract salient object that are consistent with human visual system from complex scenes. Although the fusion of various saliency maps is able to compensate or correct the defects of single visual feature, the irrational fusion may further degrade the performance. In order to solve the problem of effective fusion of saliency maps, we propose a deep fusion model based on deep convolutional neural network. We use four hand-crafted feature maps, which include Local Contrast (LC), Global Contrast (GC), Spatial Variance (SV) and Center Variance (CV), as the input of the network and learn the saliency values in a dual-estimation process. The anterior fusion estimation is fed with fused maps to learn values with precise boundaries. The posterior fusion estimation is fed with feature map to learn the object conception saliency separately. Four feature maps are exploited dependent to detect salient region, and as well as six fusion schemes including HF, HF p , HF a , proposed in the paper, and CRF, CSVM and WA. The evaluations show that HF gains the highest performance, which shows the superiority of anterior and posterior. Therefore, HF is selected as the fusion scheme and the two features are concatenated and integrated into a jointly optimized network for final saliency detection. Extensive experiments are carried out on four benchmark datasets including ASD, PASCALS, ECSSD, HKU - IS. Two measurements including MAE and F -Score are calculated to evaluate the proposed method’s performance and the existing methods including HS, GMR, DSR, DRFI, LEGS, MDF, MCDL and ELD, among which, HS, GMR, DSR and DRFI are traditional methods based on low-level features, while LEGS, MDF, MCDL and ELD are based on deep learning. The PRCs of the methods are also used to illustrate their performances. The results show that the proposed method gain significant and consistent improvements over the representative deep learning framework based saliency detection methods. We believe that the proposed method’s success comes from the following factors.(1) Four hand-crafted feature maps come from different level features, which are complementary.(2) Deep network is efficient to locate the correlation among the feature maps.(3) Contrasted with shallow fusion model SVM and CRF, HF can fuse the different level feature maps and get the better performance.

作者张冬明靳国庆代锋袁庆升包秀国张勇东 ZHANG Dong-Ming;JIN Guo-Qing;DAI Feng;YUAN Qing-Sheng;BAO Xiu-Guo;ZHANG Yong-Dong(The National Computer Network Emergency Response Technical Team Coordination Center of China, Beijing 100029;Intelligent Information Processing Key Lab, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190;School of Information Science and Technology, University of Science and Technology of China, Hefei 230026)

机构地区国家计算机网络应急处理协调中心中国科学院计算技术研究所智能信息处理实验室中国科学技术大学信息科学技术学院

出处《计算机学报》 EI CSCD 北大核心 2019年第9期2076-2086,共11页 Chinese Journal of Computers

基金国家重点研发计划(2018YFB0804202) 国家自然科学基金项目(61672495,61771458,61525206)资助~~

关键词显著目标检测人工特征深度融合深度学习显著图 salient object detection hand-crafted features deep fusion deep learning saliency map

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1张鹏,王润生.基于视点转移和视区追踪的图像显著区域检测[J].软件学报,2004,15(6):891-898. 被引量：53

二级参考文献13

1Bourque E, Dudek G, Ciaravola P. Robotic sightseeing: A method for automatically creating virtual environments. In: Giralt G, ed.Proc. of the IEEE Conf. on Robotics and Automation. Leuven: IEEE Press, 1998. 3186～3191.
2Kadir T, Brady M. Saliency, scale and image description. International Journal of Computer Vision, 2001,45(2):83-105.
3Gesu VD, Valenti C, Strinati L. Local operators to detect regions of interest. Pattern Recognition Letters, 1997,18(11-13):1077-1081.
4Wai WYK, Tsotsos JK. Directing attention to onset and offset of image events for eye-head movement control. In: Huang T, ed.Proc. of the Int'l Association for Pattern Recognition Workshop on Visual Behaviors. Seattle: IEEE Press, 1994. 79～84.
5Stentiford FWM. An evolutionary programming approach to the simulation of visual attention. In: Kim JH, ed. Proc. of the IEEE Congress on Evolutionary Computation. Seoul: IEEE Press, 2001. 851-858.
6Itti L, Koch C, Niebur E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 1998,20(11):1254-1259.
7Itti L, Koch C. Computational modeling of visual attention. Nature Reviews Neuroscience, 2001,2(3):194-230.
8Itti L, Koch C. Feature combination strategies for saliency-based visual attention systems. Journal of Electronic Imaging,2001,10(1):161-169.
9Yee H, Pattanaik SN, Greenberg DP. Spatiotemporal sensitivity and visual attention for efficient rendering of dynamic environments. ACM Trans. on Computer Graphics, 2001,20(1):39-65.
10Boccignone G, Ferraro M, Caelli T. Generalized spatio-chromatic diffusion. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2002,24(10): 1298-1309.

共引文献52

1张鹏,王润生.静态图像中的感兴趣区域检测技术[J].中国图象图形学报（A辑）,2005,10(2):142-148. 被引量：32
2刘爱梅,刘兴国.让课堂教学“问”起来[J].中国农村教育,2005(6):60-60.
3张鹏,王润生.基于视觉注意的遥感图像分析方法[J].电子与信息学报,2005,27(12):1855-1860. 被引量：10
4田媚,罗四维,齐英剑,廖灵芝.基于视觉系统“What”和“Where”通路的图像显著区域检测[J].模式识别与人工智能,2006,19(2):155-160. 被引量：4
5张宝薇,陈浩,张晔.一种针对遥感图像的自动ROI编码算法[J].光电技术应用,2006,21(4):64-70. 被引量：1
6张国敏,殷建平,祝恩,强永刚.遥感图像中基于视觉显著性的分层目标检测[J].吉林大学学报（工学版）,2007,37(3):625-629. 被引量：5
7田媚,罗四维,廖灵芝.基于what和where信息的目标检测方法[J].电子学报,2007,35(11):2055-2061. 被引量：3
8高静静,张菁,沈兰荪.视觉注意力模型的改进算法[J].电子测量技术,2008,31(3):1-3. 被引量：7
9张焱,张志龙,沈振康.一种融入运动特性的显著性特征提取方法[J].国防科技大学学报,2008,30(3):109-115. 被引量：2
10王璐,陆筱霞,蔡自兴.基于局部显著区域的自然场景识别[J].中国图象图形学报,2008,13(8):1594-1600. 被引量：13

同被引文献183

1项家伟,王伟.基于显著性目标检测网络的面部属性编辑方法[J].国外电子测量技术,2022,41(5):1-8. 被引量：3
2刘健庄,栗文青.灰度图象的二维Otsu自动阈值分割法[J].自动化学报,1993,19(1):101-105. 被引量：357
3代科学,李国辉,涂丹,袁见.监控视频运动目标检测减背景技术的研究现状和展望[J].中国图象图形学报,2006,11(7):919-927. 被引量：169
4徐晶,刘鹏,刘家锋,唐降龙.一种受雨滴影响的运动目标检测方法[J].计算机研究与发展,2009,46(11):1885-1892. 被引量：5
5WU Yang,ZHENG NanNing,YUAN ZeJian,JIANG HuaiZu,LIU Tie.Detection of salient objects with focused attention based on spatial and temporal coherence[J].Chinese Science Bulletin,2011,56(10):1055-1062. 被引量：4
6胡晓辉,关山.视频序列中运动目标检测算法[J].计算机工程与应用,2011,47(16):166-168. 被引量：2
7胡伟.改进的层次K均值聚类算法[J].计算机工程与应用,2013,49(2):157-159. 被引量：63
8董晶,傅丹,杨夏.无人机视频运动目标实时检测及跟踪[J].应用光学,2013,34(2):255-259. 被引量：21
9黄晨,王建军,高昕,丁晟.电子稳像中稳像质量评价方法研究[J].激光与红外,2013,43(5):477-481. 被引量：9
10肖德贵,辛晨,张婷,朱欢,李小乐.显著性纹理结构特征及车载环境下的行人检测[J].软件学报,2014,25(3):675-689. 被引量：20

引证文献35

1刘云,钱美伊,李辉,王传旭.深度学习的多尺度多人目标检测方法研究[J].计算机工程与应用,2020,56(6):172-179. 被引量：11
2亢伉.利用对抗生成网络的视觉显著性预测研究[J].电子设计工程,2020,28(8):180-183. 被引量：2
3任艳,张舒婷,王昊,苏新航,李祺.基于视觉显著性的高分辨率遥感图像目标检测算法[J].沈阳航空航天大学学报,2020,37(2):49-55. 被引量：1
4汪鑫,吴开志,俞子荣,郑晖.基于PoolNet显著性和SURF-VIBE模型的林火视频烟雾提取算法[J].南昌航空大学学报（自然科学版）,2020,34(2):94-100. 被引量：1
5王冬丽,廖春江,牟金震,周彦.基于特征融合的SSD视觉小目标检测[J].计算机工程与应用,2020,56(16):31-36. 被引量：12
6李辉,周航,董燕,张淑军.面向输电线路的异常目标检测方法[J].计算机与现代化,2020(8):8-13. 被引量：7
7毛琳,李雪萌,杨大伟,张汝波.金字塔频率特征融合目标检测网络[J].计算机辅助设计与图形学学报,2021,33(2):207-214. 被引量：11
8徐晓华,钱平,王一达,周昕悦,徐汉麟,徐李冰.面向电力系统的多粒度隐患检测方法[J].北京航空航天大学学报,2021,47(3):520-530.
9李可夫,钟汇才,高兴宇,翁超群,陈振宇,李勇周,王师峥.显著性引导的低光照人脸检测[J].北京航空航天大学学报,2021,47(3):572-584. 被引量：4
10常磊,龙言,任小刚.基于BP神经网络的人脸检测方法研究[J].价值工程,2021,40(11):193-194.

二级引证文献111

1师亚文,崔耀,刁长隆.开放场景下筛上杂物目标检测算法研究[J].煤炭工程,2023,55(S01):225-230.
2金波,陈铈,赵青尧,夏凡,刘雯.基于改进卷积神经网络的输电线路异常检测研究[J].高电压技术,2023,49(S01):68-71. 被引量：1
3帖军,宋威,尹帆,郑禄,杨欣.基于遮挡标记的目标检测算法[J].中南民族大学学报（自然科学版）,2020,39(3):302-308.
4黄健,张钢.深度卷积神经网络的目标检测算法综述[J].计算机工程与应用,2020,56(17):12-23. 被引量：92
5邓天民,周臻浩,方芳,王琳.改进YOLOv3的交通标志检测方法研究[J].计算机工程与应用,2020,56(20):28-35. 被引量：14
6刘洋,战荫伟.基于深度学习的小目标检测算法综述[J].计算机工程与应用,2021,57(2):37-48. 被引量：34
7宋建辉,饶威,于洋,刘砚菊.基于Focal Loss的多特征融合地物小目标检测[J].火力与指挥控制,2021,46(1):20-24. 被引量：6
8魏智锋,肖书浩,蒋国璋,伍世虔,程国飞.基于深度学习的人造板表面缺陷检测研究[J].林产工业,2021,58(2):21-26. 被引量：18
9葛雯,王嘉利.YOLO-K模型多目标检测算法研究[J].电脑与信息技术,2021,29(2):27-30. 被引量：1
10史浩,冯全,陈佰鸿.基于深度学习的果树资源调查[J].林业机械与木工设备,2021,49(6):53-58. 被引量：3

1创意村.康养福地葫芦峪[J].商业文化,2018(35):86-92. 被引量：1
2李婷婷,孙艺嘉,代静.大运河沧州段多元文化融合的路径研究[J].智库时代,2019,0(35):265-265. 被引量：3
3余映,吴青龙,邵凯旋,康迂星,杨鉴.超复数域小波变换的显著性检测[J].电子与信息学报,2019,41(9):2231-2238. 被引量：7
4傅斌贺,刘维平,聂俊峰,刘西侠.考虑认知行为差异的乘员信息作业绩效研究[J].兵工学报,2019,40(3):659-665. 被引量：4
5王生进.计算机视觉的发展之路[J].人工智能,2017,0(6):8-13. 被引量：3
6王媛华.基于多融合模型的图像语义描述研究[J].河南科技,2019,0(14):34-36. 被引量：2
7石春丹,秦岭.基于BGRU-CRF的中文命名实体识别方法[J].计算机科学,2019,46(9):237-242. 被引量：29
8尚宇炜,郭剑波,吴文传,盛万兴,马钊.数据–知识融合的机器学习(1):模型分析[J].中国电机工程学报,2019,39(15):4406-4415. 被引量：23
9刘政怡,徐天泽.基于优化的极限学习机和深度层次的RGB-D显著检测[J].电子与信息学报,2019,41(9):2224-2230.
10蔡杨,苏明旭,蔡小舒.基于卷积神经网络的混合颗粒分类法研究[J].光学学报,2019,39(7):115-124. 被引量：3

计算机学报

2019年第9期

浏览历史

内容加载中请稍等...

基于深度融合的显著性目标检测算法被引量：35

参考文献1

二级参考文献13

共引文献52

同被引文献183

引证文献35

二级引证文献111

相关作者

相关机构

相关主题

浏览历史

基于深度融合的显著性目标检测算法 被引量：35

参考文献1

二级参考文献13

共引文献52

同被引文献183

引证文献35

二级引证文献111

相关作者

相关机构

相关主题

浏览历史

基于深度融合的显著性目标检测算法被引量：35