融合深度特征和多核增强学习的显著目标检测被引量：14

Salient object detection via deep features and multiple kernel boosting learning

导出

摘要目的针对现有基于手工特征的显著目标检测算法对于显著性物体尺寸较大、背景杂乱以及多显著目标的复杂图像尚不能有效抑制无关背景区域且完整均匀高亮显著目标的问题,提出了一种利用深度语义信息和多核增强学习的显著目标检测算法.方法首先对输入图像进行多尺度超像素分割计算,利用基于流形排序的算法构建弱显著性图.其次,利用已训练的经典卷积神经网络对多尺度序列图像提取蕴含语义信息的深度特征,结合弱显著性图从多尺度序列图像内获得可靠的训练样本集合,采用多核增强学习方法得到强显著性检测模型.然后,将该强显著性检测模型应用于多尺度序列图像的所有测试样本中,线性加权融合多尺度的检测结果得到区域级的强显著性图.最后,根据像素间的位置和颜色信息对强显著性图进行像素级的更新,以进一步提高显著图的准确性.结果在常用的MSRA5K、ECSSD和SOD数据集上与9种主流且相关的算法就准确率、查全率、F-measure值、准确率一召回率(PR)曲线、加权F-measure值和覆盖率(OR)值等指标和直观的视觉检测效果进行了比较.相较于性能第2的非端到端深度神经网络模型,本文算法在3个数据集上的平均F-measure值、加权F-measure值、OR值和平均误差(MAE)值,分别提高了1.6%,22.1%,5.6%和22.9%.结论相较于基于手工特征的显著性检测算法,本文算法利用图像蕴含的语义信息并结合多个单核支持向量机(SVM)分类器组成强分类器,在复杂图像上取得了较好的检测效果. Objective Salient object detection identifies the most conspicuous and eye-attracting objects or regions in images. Results are often expressed by saliency maps,in which the intensity of each pixel presents the strength of the probability that the pixel belongs to a salient region. Visual saliency detection has been used as a pre-processing step for facilitating a wide range of vision applications,including image and video compression,image retargeting,visual tracking,and robot navigation. Although the performance of salient object detection approaches has dramatically improved in the last few years,it remains challenging in computer vision tasks. Most existing methods focus on handcrafted features and use distinct prior knowledge,such as contrast,center,background,and objectness priors,to enhance performance. Recently,convolutional neural network( CNN)-based approaches have shown to be remarkably effective and successfully broken the limits of traditional handcrafted feature-based methods. The recent CNN-based salient object detection approaches have been successful in overcoming the disadvantages of handcrafted feature-based approaches and have greatly enhanced the performance of saliency detection. These CNN-based models,especially the end-to-end ones,have shown their superiority on feature extraction and efficiently captured high-level information about the objects and their cluttered surroundings. The existing handcrafted feature-based salient object detection algorithms are insufficient in effectively suppressing irrelevant backgrounds and uniformly highlighting the entire salient object and on complicated images with large salient object,cluttered backgrounds,and multiple salient objects. We propose a salient object detection scheme based on multiple kernel boosting learning and deep semantic information to overcome this drawback. Method First,we segment the input image into multiscale superpixels and obtain weak saliency maps through graph-based manifold ranking. Second,we extract the deep features involving semantic information by using classic CNN. We obtain reliable training sets through the multiscale weak saliency maps to develop a strong salient object detection model by using multiple kernel boosting learning. Then,saliency maps are directly produced by samples from the multiscale superpixel images,which are infused to generate a strong saliency map. Finally,a pixel-level saliency map is refined in accordance with the color and position to improve the detection performance. Result The proposed moodel is compared with 11 state-of-the-art methods to evaluate its performance in terms of precision,recall,F-measure,PR( precision-recall) curve,weighted F-measure,OR( overlapping ratio) and MAE( mean absolute error)scores,and visual effect on three popular and public datasets,namely,MSRA5 K,ECSSD,and SOD. Experimental results show the improvements over the state-of-the-art methods. The F-measure score of our algorithm increased by 0. 7%,2. 0%,and 2. 1%;the weighted F-measure increased by 18. 9%,27. 6%,and 19. 8%;the OR scores increased by2. 9%,6. 8%,and 7. 2%;and the MAE scores increased by 34. 5%,26. 9%,and 7. 5% compared with the saliency results produced by the non-end-to-end deep learning model whose performance ranks second on MSRA5 K,ECSSD,and SOD,respectively. The experiments on visual effect show that our method performs well in various complex images,such as saliency objects and backgrounds that share similar appearance,multiple salient objects,salient objects with complex texture and structure,and clutter backgrounds. The proposed approach not only uniformly highlights the entire salient objects but also efficiently preserves the contour of salient objects under various scenarios. Moreover,we conduct experiments on three datasets in terms of PR curves to evaluate the performance of each component of the proposed algorithm. Moreover,the average running time of our algorithm and the methods based on non-end-to-end CNNs is presented. The implementation is performed on ECSSD dataset by using MATLAB or C,and most of the test images have a resolution of 300 × 400 pixels.An efficient C/C ++ implementation based on parallelized components would decrease our model's computation time and render it feasible for real-world application. Conclusion The proposed salient object detection model demonstrates good performance on complicated images compared with the salient object detection method based on handcrafted features,which learns a strong classifier with four single kernel SVM( support vector machine) and uses classic CNN. Further improvements of salient object detection algorithm on dataset with complex and confusing background images are worth expecting. In further research,we plan to utilize additional features from a CNN and construct an end-to-end model,which would improve performance and save computation cost. Moreover,our further work will pay attention to small and salient object detections in video.

作者张晴李云李文举林家骏肖莽陈飞云 Zhang Qing;Li Yun;Li Wenju;Lin Jiajun;Xiao Mang;Chen Feiyun(School of Computer Science and Information Engineering, Shanghai Institute of Technology, Shanghai 201418, China;School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China)

机构地区上海应用技术大学计算机科学与信息工程学院华东理工大学信息科学与工程学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第7期1096-1105,共10页 Journal of Image and Graphics

基金国家自然科学基金项目(61401281,61806126,41671402) 上海应用技术大学中青年科技人才发展基金项目(ZQ2018-23)~~

关键词显著目标检测显著性检测深度特征多核增强学习多尺度检测 salient object detection saliency detection deep feature multiple kernel boosting learning multiscale detection

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献87

1南方哲,钱育蓉,行艳妮,赵京霞.基于深度学习的单图像超分辨率重建研究综述[J].计算机应用研究,2020,37(2):321-326. 被引量：24
2王文豪,周泓,严云洋.一种基于连通区域的轮廓提取方法[J].计算机工程与科学,2011,33(6):67-71. 被引量：9
3刘晓慧,叶西宁.肤色检测和Hu矩在安全帽识别中的应用[J].华东理工大学学报（自然科学版）,2014,40(3):365-370. 被引量：90
4王琳,姚新,雷丹.公路隧道火灾初期视频火焰检测[J].中国公路学报,2018,31(11):121-129. 被引量：13
5回天,哈力旦.阿布都热依木,杜晗.结合Faster R-CNN的多类型火焰检测[J].中国图象图形学报,2019,24(1):73-83. 被引量：31
6<中国公路学报>编辑部.中国隧道工程学术研究综述·2015[J].中国公路学报,2015,28(5):1-65. 被引量：572
7李炜.基于图像处理的道路拥堵快速检测研究[J].山东交通学院学报,2015,23(2):11-16. 被引量：2
8花卉.多视觉特征结合有约束简化群优化的显著性目标检测[J].计算机工程,2015,41(11):257-262. 被引量：3
9邢晴,张锁平,李明兵,党超群,齐占辉.融合颜色特征和对比度特征的图像显著性检测[J].半导体光电,2019,0(3):433-437. 被引量：6
10贾峻苏,鲍庆洁,唐慧明.基于可变形部件模型的安全头盔佩戴检测[J].计算机应用研究,2016,33(3):953-956. 被引量：22

引证文献14

1张瑞勋,周洪雨,骆圣丽.基于机器视觉的药盒传送拥堵检测算法[J].计算机与现代化,2019,0(10):94-100. 被引量：1
2张神德,陈学雄,黄宏安,王翠瑜,黄明炜,林进浔,陈国栋.塔式起重机安全管理中裂缝检测方法[J].福建电脑,2020,36(8):98-100.
3罗小权,潘善亮.改进YOLOV3的火灾检测方法[J].计算机工程与应用,2020,56(17):187-196. 被引量：25
4韩笑.文创产品主题设计显著性视觉特征融合研究[J].现代电子技术,2021,44(2):149-152. 被引量：1
5陈国栋,王翠瑜,张神德,邓志勇,王同珍,吴志鸿,黄明炜,林进浔.基于改进YOLO V3的塔式起重机裂缝检测研究[J].贵州大学学报（自然科学版）,2021,38(3):76-82. 被引量：4
6肖体刚,蔡乐才,高祥,黄洪斌,张超阳.改进YOLOv3的安全帽佩戴检测方法[J].计算机工程与应用,2021,57(12):216-223. 被引量：31
7吴皓,沙玲.基于改进YOLO-V3的汽车刹车衬芯缺陷检测[J].农业装备与车辆工程,2022,60(5):120-124. 被引量：2
8王安志,任春洪,何淋艳,杨元英,欧卫华.基于多模态多级特征聚合网络的光场显著性目标检测[J].计算机工程,2022,48(7):227-233. 被引量：6
9曹靖城,张继东,史国杰.基于深度学习的视觉图像非显著性区域增强[J].信息技术,2022,46(10):153-158.
10潘美莲,陈洁,陈赣浪.面向机械分拣系统的电子元件自动识别算法[J].机械设计与制造,2023(1):175-178. 被引量：1

二级引证文献75

1乔欢欢,权恒友,邱文利,闫润禾.改进YOLOv5s的交通标志识别算法[J].计算机系统应用,2022,31(12):273-279. 被引量：7
2王铮帅,邱联奎,李迎港.复杂环境下的YOLOv5s烟火检测方法[J].电子测量技术,2023,46(24):149-156. 被引量：1
3朱书德,李少波,王铮,杨静,董豪,段仲静,王军.基于机器视觉的胶囊缺陷检测装置设计[J].贵州大学学报（自然科学版）,2020,37(3):42-48. 被引量：8
4叶保璇,王康坚,余盛达,易婷婷,黄廷城.基于边-云协同的输电线路综合在线监测系统[J].机电工程技术,2020,49(11):73-75. 被引量：3
5严春满,王铖.卷积神经网络模型发展及应用[J].计算机科学与探索,2021,15(1):27-46. 被引量：63
6睢丙东,张湃,王晓君.一种改进YOLOv3的手势识别算法[J].河北科技大学学报,2021,42(1):22-29. 被引量：8
7刘青,刘志国,刘守全,孙淼,马宪伟.基于改进YOLOv3的无人机林火监测系统设计与实现[J].消防科学与技术,2021,40(4):557-561. 被引量：11
8周骥.公共图书馆文创产品开发的策略及实践研究——以安徽省图书馆为例[J].科技资讯,2021,19(11):192-194. 被引量：1
9蒋镕圻,彭月平,谢文宣,谢郭蓉.嵌入scSE模块的改进YOLOv4小目标检测算法[J].图学学报,2021,42(4):546-555. 被引量：35
10兰元帅,何晋.基于树莓派4B的森林防火系统研究与设计[J].农业与技术,2021,41(16):48-51. 被引量：1

1黄继鹏,史颖欢,高阳.面向小目标的多尺度Faster-RCNN检测算法[J].计算机研究与发展,2019,56(2):319-327. 被引量：89
2彭艳,陈加宏,李小毛,罗均,谢少荣,刘畅,蒲华燕.时空上下文融合的无人艇海面目标跟踪[J].中国科学：技术科学,2018,48(12):1357-1372. 被引量：6
3吴宏晓,黄顺涛,崔江静,廖雁群,曾啸,孟安波.改进SSD方法在电缆隧道明火识别中的应用[J].宁夏电力,2018,0(5):1-5.
4蒲磊,冯新喜,侯志强,余旺盛.基于空间可靠性约束的鲁棒视觉跟踪算法[J].电子与信息学报,2019,41(7):1650-1657. 被引量：8
5林涵阳,詹永照,陈羽中.复杂场景中机动车行驶证快速检测与识别[J].小型微型计算机系统,2019,40(5):1076-1082. 被引量：1
6滕文秀,王妮,陈泰生,王本林,陈梦琳,施慧慧.基于深度对抗域适应的高分辨率遥感影像跨域分类[J].激光与光电子学进展,2019,56(11):228-238. 被引量：8
7雷李义,艾矫燕,彭婧,姚冬宜.基于深度学习的水面漂浮物目标检测评估[J].环境与发展,2019,31(6):117-120. 被引量：8
8周翼,陈渤.一种改进dueling网络的机器人避障方法[J].西安电子科技大学学报,2019,46(1):46-50. 被引量：5
9孙肖肖,牟少敏,许永玉,曹旨昊,苏婷婷.基于深度学习的复杂背景下茶叶嫩芽检测算法[J].河北大学学报（自然科学版）,2019,39(2):211-216. 被引量：30
10牛燕雄,陈梦琪,张贺.基于尺度不变特征变换的快速景象匹配方法[J].电子与信息学报,2019,41(3):626-631. 被引量：7

中国图象图形学报

2019年第7期

浏览历史

内容加载中请稍等...

融合深度特征和多核增强学习的显著目标检测被引量：14

同被引文献87

引证文献14

二级引证文献75

相关作者

相关机构

相关主题

浏览历史

融合深度特征和多核增强学习的显著目标检测 被引量：14

同被引文献87

引证文献14

二级引证文献75

相关作者

相关机构

相关主题

浏览历史

融合深度特征和多核增强学习的显著目标检测被引量：14