深度学习背景下视觉显著性物体检测综述被引量：7

Review of deep learning based salient object detection

导出

摘要视觉显著性物体检测是对人类视觉和认知系统的模拟,而深度学习则是对人类大脑计算方式的模拟,将两者有机结合可以有效推动计算机视觉的发展。视觉显著性物体检测的任务是从图像中定位并提取具有明确轮廓的显著性物体实例。随着深度学习的发展,视觉显著性物体检测的精度和效率都得到巨大提升,但仍然面临改进主流算法性能、减少对像素级标注样本的依赖等主要挑战。针对上述挑战,本文从视觉显著性物体检测思想与深度学习方法融合策略的角度对相关论述进行分类总结。1)分析传统显著性物体检测方法带来的启示及其缺点,指出视觉显著性物体检测的核心思路为多层次特征的提取、融合与修整;2)从改进特征编码方式与信息传递结构、提升边缘定位精度、改善注意力机制、提升训练稳定性和控制噪声的角度对循环卷积神经网络、全卷积神经网络和生成对抗网络3种主流算法的性能提升进行分析,从优化弱监督样本处理模块的角度分析了减少对像素级标注样本依赖的方法;3)对协同显著性物体检测、多类别图像显著性物体检测以及未来的研究问题和方向进行介绍,并给出了可能的解决思路。 Salient object detection(SOD)visual technology is a simulation of human vision and cognitive system nowadays.Current deep learning method is a computational simulation for human brain.Traditional SOD methods are required to design complicated hand-craft features to extract multi-level features,and then use machine learning or other methods for fusion and refinement.Each step of SOD can be internalized into the deep learning based neural network model related to a variety of algorithms.To provide reference for our intelligent SOD methods,recent SOD are sorted out from the perspective of principles,basic ideas and algorithms in detail.First,we briefly review the classic framework of traditional SOD methods to extract SOD technology like multi-level features fusion.Current challenges are related to time-consuming preprocessing,complex feature designing and lack of robust.Traditional methods are based on contrasting features,which tend to identify the boundary of the object and the corresponding internal noise.However,the SOD task is more concerned with the scope of the object and constrained of the internal homogeneous area to be suppressed.In addition,the spatial domain features extractions are disturbed of the effects of light and complex background,it is difficult to be integrated with other features effectively.These unstable issues are resulted in.Next,we analyzes a sort of fully supervised implementation architectures for significant deep learning based object detection in the context of the early fusion model,series of recurrent convolution neural network(CNN)architecture,series of full convolution network architecture as well as the feature extraction and fusion enhanced attention mechanism.The internal mechanism and connection of these methods are discussed.“The early fusion model”refers to the fusion strategy of traditional features and deep learning features in terms of artificial rules(such as vector stitching)in 2015.This strategic artificial feature fusion rules are lack of theories and mechanisms.CNN method is only used for high-level features extraction.Each super pixel needs to be traversed and input into the neural network,which is time-consuming.Recurrent CNN constant updates the recognition results to identify the target through introducing the forgetting mechanism.Thanks to this,multi-level features can naturally aggregate with each other using less parameters,which can achieve a better result than the single feed forward network.Full convolution network(FCN)is qualified for end-to-end multi-classification tasks at pixel level in complex background greatly enhanced the detection capability.To aggregate multi-level features,current researches are conducted based on FCN and illustrated a large number of customized models based on multiple strategies,including improving the fusion method in the network,compensating for the network’s extraction accuracy of boundary information.Recent the attention mechanism module have become a useful supplement for neural network model.Not only does attention mechanism improve the precision of salient object detection,but also makes the thought of“salient”deliberated that it can only be applied in the classification task at pixel level.So the“salient concepts”can be used in object detection and lightweight model.Thirdly,The weak supervision and multi-task issues have their potentials because the training of full supervision salient object detection method requires expensive pixel-level annotation.Since the task-oriented CNN classification can focus and locate objects with image based tag semantics in the context of detailed salient maps optimization.Meanwhile,the sample updating method needs to be designed because the weakly supervised samples are insufficient to the refinement after generating initial salient map.At last,we introduce the application and development of generative adversarial network(GAN)and graph neural network(GNN)in SOD.Thanks to neural network theory,GNN and GAN have also been applied to SOD task.Based on of summarizing the existing methods,future SOD are predicted like feature fusion mode improvement,collaborative significant object detection,weak supervision and multi-task strategy,and multi-categories image significance detection.In these scenarios,the data has a more complex or fuzzy distribution(e.g.,no longer subject to Euclidean spatial distribution).The solution should be more capable to describe the features further.

作者王自全张永生于英闵杰田浩 Wang Ziquan;Zhang Yongsheng;Yu Ying;Min Jie;Tian Hao(College of Geospatial Information,Information Engineering University,Zhengzhou 450001,China;31434 Troops,Shenyang 110000,China)

机构地区信息工程大学地理空间信息学院 [

出处《中国图象图形学报》 CSCD 北大核心 2022年第7期2112-2128,共17页 Journal of Image and Graphics

基金国家自然科学基金项目(42071340)。

关键词显著性物体检测(SOD) 深度学习循环卷积神经网络(RCNN) 全卷积网络(FCN) 注意力机制弱监督与多任务策略 salient object detection(SOD) deep learning recurrent convolutional neural network(RCNN) fully convolutional network(FCN) attention mechanism weakly supervised and multi-tasks strategy

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献54

1蔡莉,王淑婷,刘俊晖,朱扬勇.数据标注研究综述[J].软件学报,2020,31(2):302-320. 被引量：59
2黄凯奇,任伟强,谭铁牛.图像物体分类与检测算法综述[J].计算机学报,2014,37(6):1225-1240. 被引量：195
3汪京京,张武,刘连忠,黄帅.农作物病虫害图像识别技术的研究综述[J].计算机工程与科学,2014,36(7):1363-1370. 被引量：61
4张超,王正,姚青,杨保军,唐健.便携式农业病虫害图像采集仪设计与应用[J].浙江农业科学,2016,57(12):2077-2081. 被引量：5
5zhi-hua zhou.A brief introduction to weakly supervised learning[J].National Science Review,2018,5(1):44-53. 被引量：105
6于洪,陈云.基于Spark的三支聚类集成方法[J].郑州大学学报（理学版）,2018,50(1):20-26. 被引量：6
7朱辉,秦品乐.基于多尺度特征结构的U-Net肺结节检测算法[J].计算机工程,2019,45(4):254-261. 被引量：32
8李熙莹,周智豪,邱铭凯.基于部件融合特征的车辆重识别算法[J].计算机工程,2019,45(6):12-20. 被引量：13
9翁杨,曾睿,吴陈铭,王猛,王秀杰,刘永进.基于深度学习的农业植物表型研究综述[J].中国科学：生命科学,2019,49(6):698-716. 被引量：49
10史东旭,高德民,薛卫,张朔,张福全.基于物联网和大数据驱动的农业病虫害监测技术[J].南京农业大学学报,2019,42(5):967-974. 被引量：39

引证文献7

1赵永强,金芝,张峰,赵海燕,陶政为,豆乘风,徐新海,刘东红.深度学习图像描述方法分析与展望[J].中国图象图形学报,2023,28(9):2788-2816. 被引量：4
2李鹏,邓甘霖,黄鹏,陈海龙.融合深度学习与几何算法的单目视觉里程计方法[J].测绘科学,2023,48(8):27-33.
3管博伦,张立平,朱静波,李闰枚,孔娟娟,汪焱,董伟.农业病虫害图像数据集现状及高质量构建综述[J].智慧农业（中英文）,2023,5(3):17-34. 被引量：4
4王泽瑞,陈实.基于深度特征的质量感知旋转舰船模板匹配算法[J].计算机工程,2023,49(12):161-168.
5梁秀雅,冯水春,陈红珍.结合视觉显著性和EfficientNetV2的舰船目标检测方法[J].计算机工程与应用,2024,60(5):259-270.
6张家瑜,朱锐,邱威,陈坤杰.基于选择性注意力神经网络的木薯叶病害检测算法[J].农业机械学报,2024,55(5):254-262.
7董薇,窦立君.基于Parzen窗算法的图像视觉显著目标识别算法[J].计算机仿真,2024,41(5):214-219.

二级引证文献8

1李聿为.浅析数字电影母版收缴介质及其技术格式[J].现代电影技术,2023(11):41-46.
2王汉生,姚建斌.基于ResNet和ViT双流网络的小麦病虫害识别[J].农业技术与装备,2024(2):18-21.
3李冬睿,邱尚明,杨善友.基于自适应权重优化的多任务深度学习模型在甘蔗病害识别中的应用[J].智能计算机与应用,2024,14(3):163-167.
4朱轶萍,吴华瑞,郭旺,吴小燕.基于改进UperNet的结球甘蓝叶球识别方法[J].智慧农业（中英文）,2024,6(3):128-137.
5周蔚,董立红,叶鸥,厍向阳,段雪瑶,彭志奎,王思倩,赵楠楠,郭旭鹏.煤矿井下钻场目标检测数据集[J].中国科学数据（中英文网络版）,2024,9(2):300-312.
6李振冲,周波,张绿云,施龙江,尹世海.在复杂背景下应用迁移学习技术优化木薯叶疾病识别与分析的研究[J].黑龙江粮食,2024(6):74-77.
7曹丽琴,汪都,熊海洋,钟燕飞.热红外高光谱遥感影像信息提取方法综述[J].中国图象图形学报,2024,29(8):2089-2112.
8李佳乐,张建华,王健,周国民.数据驱动的农业深度学习方法计量分析[J].农业大数据学报,2024,6(3):400-411.

1李明泰.智慧乡村弥合数字鸿沟:平台效应与运行机制研究[J].农村经济与科技,2021,32(19):235-237. 被引量：1
2金燕,薛智中,姜智伟.基于循环残差卷积神经网络的医学图像分割算法[J].计算机辅助设计与图形学学报,2022,34(8):1205-1215. 被引量：6
3潘志敏,王梓糠,蒋毅,尹骏刚.基于深度迁移学习的电力作业安全带佩戴检测[J].计算机仿真,2022,39(5):95-101. 被引量：4
4王建华,黎琳,赵镇东,常晓林,王爱丽,刘宇,耿欣.一种基于全同态加密算法的神经网络预测方案[J].人工智能,2022(4):97-108. 被引量：3
5戴娇红.合作学习背景下阮族训练策略探析[J].文理导航,2022(28):64-66.
6王晓芳.深度学习背景下的小学语文阅读教学策略研究[J].新作文（教研）,2022(9):0152-0154.
7噶菘代吉.新时期做好地方文物保护利用工作的核心思路[J].文化产业,2022(25):7-9. 被引量：1
8倪震.预制装配式钢结构建筑设计的分析与研究核心思路[J].中国建筑金属结构,2022(8):121-123. 被引量：4
9朱林立,华钢,高炜.本体学习算法的两类LOO一致稳定性和广义界[J].智能系统学报,2022,17(3):471-479.
10何元秀,叶泽洲.大学生在线学习倦怠调查研究——基于L高校的数据分析[J].新丝路,2022(9):229-231.

中国图象图形学报

2022年第7期

浏览历史

内容加载中请稍等...

深度学习背景下视觉显著性物体检测综述被引量：7

同被引文献54

引证文献7

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

深度学习背景下视觉显著性物体检测综述 被引量：7

同被引文献54

引证文献7

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

深度学习背景下视觉显著性物体检测综述被引量：7