期刊文献+

基于单目视觉与深度神经网络的行为识别研究 被引量:1

Study on Human Behavior Recognition Based on Monocular Vision and Deep Neural Network
下载PDF
导出
摘要 为解决多人场景下单目视觉行为检测难度大的问题,通过将单目视觉提取的多种特征集融合提出了一种新的行为检测模型。该模型通过深度卷积神经网络(Convolutional Neural Network,CNN)提取区域候选集,经过池化层获得单目视觉的感兴趣目标集;搜索决定人体姿势的25个骨架关键点,提取各个点的位置坐标向量。将2种特征融合传入CNN预测单目视觉的行为标签。实验结果表明,所提模型获得了较高的单目视觉行为检测精度,对多人场景的检测性能也优于其他对比模型。 To resolve the problem of behavior detection difficulty of monocular vision in the multi-person scene,multiple feature sets extracted from monocular vision images are fused to propose a new behavior detection model.Firstly,the region candidates are extracted through deep Convolutional Neural Network(CNN),and the regions of interest of monocular vision images are generated through the pooling layer;then,twenty-five skeleton critical points are located,the body pose is determined by the twenty-five points,and the position coordinate vectors of each point are extracted.Finally,both feature sets are fused,and then the results are transmitted to CNN to predict the behavior labels of monocular vision.The experimental results show that the proposed model achieves a high behavior detection precision of monocular vision still images;the detection performance is better than the other models in multi-person scenes.
作者 刘兰淇 刘钟涛 LIU Lanqi;LIU Zhongtao(Center of Education Technology,Henan University of Economics and Law,Zhengzhou 450046,China)
出处 《无线电工程》 北大核心 2022年第11期2072-2080,共9页 Radio Engineering
基金 河南科技攻关项目(18210221022)。
关键词 单目视觉 静态影像 深度神经网络 卷积神经网络 行为检测 目标检测 monocular vision still image deep neural networks convolutional neural network behavior detection target detection
  • 相关文献

参考文献9

二级参考文献83

  • 1Fujiyoshi H, Lipton A J, Kanade T. Real-time human mo- tion analysis by image skeletonization. IEICE Transactions on Information and Systems, 2004, 87-D(1): 113-120.
  • 2Chaudhry R, Ravichandran A, Hager G, Vidal R. His- tograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of hu- man actions. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1932-1939.
  • 3Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: Proceedings of the 2005 IEEE Con- ference on Computer Vision and Pattern Recognition. San Diego, CA, USA: IEEE, 2005. 886-893.
  • 4Lowe D G. Object recognition from local scale-invariant fea- tures. In: Proceedings of the 7th IEEE International Confer- ence on Computer Vision. Kerkyra: IEEE, 1999. 1150-1157.
  • 5Schuldt C, Laptev I, Caputo B. Recognizing human actions: a local SVM approach. In: Proceedings of the 17th In- ternational Conference on Pattern Recognition. Cambridge: IEEE, 2004. 32-36.
  • 6Dollar P, Rabaud V, Cottrell G, Belongie S. Behavior recog- nition via sparse spatio-temporal features. In: Proceedings of the 2005 IEEE International Workshop on Visual Surveil- lance and Performance Evaluation of Tracking and Surveil- lance. Beijing, China: IEEE, 2005.65-72.
  • 7Rapantzikos K, Avrithis Y, Kollias S. Dense saliency-based spatiotemporal feature points for action recognition. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1454-1461.
  • 8Knopp J, Prasad M, Willems G, Timofte R, Van Gool L. Hough transform and 3D SURF for robust three dimensional classification. In: Proceedings of the llth European Confer- ence on Computer Vision (ECCV 2010). Berlin Heidelberg: Springer. 2010. 589-602.
  • 9Klaser A, Marszaeek M, Schmid C. A spatio-temporal de- scriptor based on 3D-gradients. In: Proceedings of the 19th British Machine Vision Conference. Leeds: BMVA Press, 2008. 99.1-99.10.
  • 10Wang H, Ullah M M, Klaser A, Laptev I, Schmid C. Evalua- tion of local spatio-temporal features for action recognition. In: Proceedings of the 2009 British Machine Vision Confer- ence. London, UK: BMVA Press, 2009. 124.1-124.11.

共引文献209

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部