视觉注意机制下结合语义特征的行人检测被引量：2

Semantic feature-based visual attention model for pedestrian detection

导出

摘要目的为研究多场景下的行人检测,提出一种视觉注意机制下基于语义特征的行人检测方法。方法首先,在初级视觉特征基础上,结合行人肤色的语义特征,通过将自下而上的数据驱动型视觉注意与自上而下的任务驱动型视觉注意有机结合,建立空域静态视觉注意模型;然后,结合运动信息的语义特征,采用运动矢量熵值计算运动显著性,建立时域动态视觉注意模型;在此基础上,以特征权重融合的方式,构建时空域融合的视觉注意模型,由此得到视觉显著图,并通过视觉注意焦点的选择完成行人检测。结果选用标准库和实拍视频,在Matlab R2012a平台上,进行实验验证。与其他视觉注意模型进行对比仿真,本文方法具有良好的行人检测效果,在实验视频上的行人检测正确率达93%。结论本文方法在不同的场景下具有良好的鲁棒性能,能够用于提高现有视频监控系统的智能化性能。 Objective Pedestrian detection under video surveiUance systems has always been a hot topic in computer vision research. These systems are widely used in train stations, airports, large commercial plazas, and other public places. How- ever, pedestrian detection remains difficult because of complex backgrounds. Given its development in recent years, the visual attention mechanism has attracted increasing attention in object detection and tracking research, and previous studies have achieved substantial progress and breakthroughs. We propose a novel pedestrian detection method based on the seman- tic features under the visual attention mechanism. Method The proposed semantic feature-based visual attention model is a spatial-temporal model that consists of two parts： the static visual attention model and the motion visual attention model. The static visual attention model in the spatial domain is constructed by combining bottom-up with top-down attention guid- ance. Based on the characteristics of pedestrians, the bottom-up visual attention model of Itti is improved by intensifying the orientation vectors of elementary visual features to make the visual saliency map suitable for pedestrian detection. In terms of pedestrian attributes, skin color is selected as a semantic feature for pedestrian detection. The regional and Ganssian models are adopted to construct the skin color model. Skin feature-based visual attention guidance is then proposed to com- plete the top-down process. The bottom-up and top-down visual attentions are linearly combined using the proper weights obtained from experiments to construct the static visual attention model in the spatial domain. The spatial-temporal visual attention model is then constructed via the motion features in the temporal domain. Based on the static visual attention mod- el in the spatial domain, the frame difference method is combined with optical flowing to detect motion vectors. Filtering is applied to process the field of motion vectors. The saliency of motion vectors can be evaluated via motion entropy to make the selected motion feature more suitable for the spatial-temporal visual attention model. Result Standard datasets and prac- tical videos are selected for the experiments. The experiments are performed on a MATLAB R2012a platform. The experi- mental results show that our spatial-temporal visual attention model demonstrates favorable robustness under various scenes, including indoor train station surveillance videos and outdoor scenes with swaying leaves. Our proposed model outperforms the visual attention model of hti, the graph-based visual saliency model, the phase spectrum of quaternion Fourier transform model, and the motion channel model of Liu in terms of pedestrian detection. The proposed model achieves a 93% accura- cy rate on the test video. Conclusion This paper proposes a novel pedestrian method based on the visual attention mecha- nism. A spatial-temporal visual attention model that uses low-level and semantic features is proposed to calculate the sali- ency map. Based on this model, the pedestrian targets can be detected through focus of attention shifts. The experimental results verify the effectiveness of the proposed attention model for detecting pedestrians.

作者黎宁龚元许莙苓顾晓蓉徐涛 Zhou Huiyu

机构地区南京航空航天大学电子信息工程学院南京航空航天大学雷达成像与微波光子技术教育部重点实验室南京航空航天大学理学院中国民航大学中国民航信息技术科研基地 School of Electronics

出处《中国图象图形学报》 CSCD 北大核心 2016年第6期723-733,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(1008-GAA14033) 中国民航总局科技基金项目(1004-14000202) 中国民航信息技术科研基地开放基金项目(1004-ZBA12016) 南京航空航天大学理工融合项目(1008-56XZA15009)~~

关键词行人检测视觉注意模型语义特征显著图肤色运动矢量熵值 people detection visual attention model semantic features saliency map skin color motion entropy

分类号 TN911.73 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献21

1Ren Z X, Gao S H, Chia L T, et al. Region-based saliency de- tection and its application in object recognition[ J]. IEEE Trans- actions on Circuits and Systems for Video Technology, 2014, 24(5) : 769-779. [ DOI: 10. ll09/TCSVT. 2013. 2280096].
2Mahadevan V, Vasconcelos N. Biologically inspired object track- ing using center-surround saliency mechanisms [ J ]. IEEE Trans- actions on Pattern Analysis and Machine Intelligence, 2013, 35(3) : 541-554. [DOI: 10. 1109/TPAMI. 2012.98].
3Chang K Y, Liu T L, Chert H T, et al. Fusing generic object- ness and visual saliency for salient object detection [ C ]//Pro- ceedings of IEEE International Conference on Computer Vision. Barcelona: IEEE, 2011 : 914-921. [DOI : 10. 1109/ICCV. 2011. 6126333 ].
4Itti L, Koch C. Computational modelling of visual attention[J]. Nature Reviews Neuroscience, 2001, 2 (3): 194-203. [ DOI: 10. 1038/35058500].
5Itti L. Models of bottom-up and top-down visual attention [ D ]. California: California Institute of Technology, 2000.
6Itti L, Koch C, Niebur E. A model of saliency-based visual at- tention for rapid scene analysis [J].IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 1998, 20 ( 11 ) : 1254- 1259. [ DOI : 10.1109/34. 730558 ].
7Seholkopf B, Platt J, Hofmann T. Graph-based visual saliency [ C ]//Proceedings of the 2006 Conference on Advanees in Neu- ral Information Processing Systems 19. London: MIT Press, 2007 : 545-552.
8Hou X D, Zhang L Q. Saliency detection : a spectral residual ap- proach [ C ]//Proceedings of IEEE Conference on Computer Vi- sion and Pattern Recognition. Minneapolis, MN : IEEE, 2007 : 1-8. [DOI: 10. l109/CVPR. 2007. 383267 ].
9Guo C L, Ma Q, Zhang L M. Spatio-temporal saliency detection using phase spectrum of quatemion fourier transform [ C ]//Pro- ceedings of IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, AK: IEEE, 2008: 1-8. [DOI: 10. 1109/CVPR. 2008. 4587715 ].
10Einhauser W, Spain M, Perona P. Objects predict fixations bet- ter than early saliency [ J ]. Journal of Vision, 2008, 8 ( 14 ) : 1-26. [DOI: 10. 1167/8.14.18].

同被引文献19

1陈付幸,王润生.基于预检验的快速随机抽样一致性算法[J].软件学报,2005,16(8):1431-1437. 被引量：105
2李彬寅,许百华,崔翔宇,盛峰,雷婧宇.图像记忆对动态搜索的影响[J].心理学报,2010,42(4):485-495. 被引量：4
3周许超,屠大维,陈勇,赵其杰,张翼成.基于相位相关和差分相乘的动态背景下运动目标检测[J].仪器仪表学报,2010,31(5):980-983. 被引量：31
4刘相滨,邹北骥.一种基于RANSAC的柱面图像配准算法[J].湖南大学学报（自然科学版）,2010,37(8):79-82. 被引量：9
5张欢,安国成,张凤军,王宏安,戴国忠.多颜色空间融合的人体检测算法研究[J].中国图象图形学报,2011,16(10):1944-1950. 被引量：10
6齐美彬,汪巍,蒋建国,沈玉亮.动态场景下的快速目标检测算法[J].电子测量与仪器学报,2011,25(9):756-761. 被引量：7
7周莺,柳伟,张基宏.基于内容感知的可分级视频码流排序方法[J].信号处理,2013,29(8):1012-1018. 被引量：8
8钦爽,谢刚,饶钦,郭旭,张文慧.视频中基于LW-PGD和SVM的头肩部检测[J].计算机应用研究,2014,31(3):949-952. 被引量：3
9黎万义,王鹏,乔红.引入视觉注意机制的目标跟踪方法综述[J].自动化学报,2014,40(4):561-576. 被引量：69
10王栋,朱虹,康凯,赵永飞.基于背景补偿引导的动态场景下目标跟踪算法[J].仪器仪表学报,2014,35(6):1433-1440. 被引量：7

引证文献2

1李鹏,王延江.一种运动背景下视觉注意辅助的目标检测方法[J].湖南大学学报（自然科学版）,2018,45(8):138-149. 被引量：4
2陆泽早,彭刚,何顶新.使用聚合通道特征的嵌入式实时人体头肩检测[J].中国图象图形学报,2019,0(4):523-535. 被引量：9

二级引证文献13

1程大鹏.持久地在干部中开展反对形而上学思维方法的教育[J].探索,2000(2):16-17.
2夏烨,陈李沐,王君杰,孙利民.基于SSD的桥梁主动防船撞目标检测方法与应用[J].湖南大学学报（自然科学版）,2020,47(3):97-105. 被引量：8
3赵鹤群.基于单指令多数据流技术的视频信息处理优化[J].信息技术与信息化,2020(10):33-35.
4董秀娟,兰建平.车载行人检测技术研究[J].电子制作,2021,29(1):98-100.
5金淼,张军,黄天富,郭志伟.参数自适应去阴影前景目标检测算法[J].华中科技大学学报（自然科学版）,2021,49(1):73-79. 被引量：7
6付红杰,刘悦,王青正.一种基于改进聚合通道特征的快速行人检测方法[J].开封大学学报,2020,34(3):86-89.
7韩萍,刘占锋,贾云飞,牛勇钢.多尺度特征融合的对抗神经网络人群计数算法[J].中国民航大学学报,2021,39(1):17-22.
8文化,张田剑南.基于人工智能的嵌入式图像识别信息采集系统[J].信息技术,2021,45(7):114-118. 被引量：7
9贾叙文,刘庆华,刘东华,李杨,黄凯枫.改进型Cascada R-CNN的行人检测算法的研究[J].计算机与数字工程,2022,50(8):1716-1719.
10贺雍译.基于物联网技术的人工智能图像检测系统设计[J].信息记录材料,2022,23(12):209-211.

1刘恋.略述汉语的几类语法结构[J].长沙通信职业技术学院学报,2005,4(1):68-70. 被引量：1
2叶涛.浅析基于空间信息的视觉注意模型[J].无线互联科技,2016,13(21):134-135.
3谢树果,郝旭春,曾歆妍,叶知秋.一种基于相位差拟合的二次相关测向处理方法[J].电波科学学报,2014,29(1):183-187. 被引量：5
4李超,李悦丽,安道祥,王广学.基于视觉注意机制的UWB SAR叶簇隐蔽目标变化检测[J].电子学报,2016,44(1):39-45. 被引量：3
5中国移动试点手机短信过滤[J].军民两用技术与产品,2008(7):24-24.
6苏娟,张强,陈炜,王继平.高分辨率SAR图像中建筑物特征融合检测算法[J].测绘学报,2014,43(9):939-944. 被引量：6
7张立保,李浩.基于自适应半径搜索的图像感兴趣区域检测[J].中国激光,2013,40(7):200-204. 被引量：7
8张鹏,王润生.静态图像中的感兴趣区域检测技术[J].中国图象图形学报（A辑）,2005,10(2):142-148. 被引量：32
9孟丽茹,赵岩,王世刚,陈贺新.基于2D视觉注意模型的全参考图像质量评价方法[J].吉林大学学报（信息科学版）,2014,32(6):563-568. 被引量：3
10赵巨峰,毛磊,刘承,冯华君.视觉注意机制与边缘展宽衡量相结合的显微成像清晰度评价[J].光子学报,2015,44(7):132-138. 被引量：4

中国图象图形学报

2016年第6期

浏览历史

内容加载中请稍等...

视觉注意机制下结合语义特征的行人检测被引量：2

参考文献21

同被引文献19

引证文献2

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

视觉注意机制下结合语义特征的行人检测 被引量：2

参考文献21

同被引文献19

引证文献2

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

视觉注意机制下结合语义特征的行人检测被引量：2