期刊文献+

基于三维残差稠密网络的人体行为识别算法 被引量:9

Human behavior recognition algorithm based on three-dimensional residual dense network
下载PDF
导出
摘要 针对现有的人体行为识别算法不能充分利用网络多层次时空信息的问题,提出了一种基于三维残差稠密网络的人体行为识别算法。首先,所提算法使用三维残差稠密块作为网络的基础模块,模块通过稠密连接的卷积层提取人体行为的层级特征;其次,经过局部特征聚合自适应方法来学习人体行为的局部稠密特征;然后,应用残差连接模块来促进特征信息流动以及减轻训练的难度;最后,通过级联多个三维残差稠密块实现网络多层局部特征提取,并使用全局特征聚合自适应方法学习所有网络层的特征用以实现人体行为识别。设计的网络算法在结构上增强了对网络多层次时空特征的提取,充分利用局部和全局特征聚合学习到更具辨识力的特征,增强了模型的表达能力。在基准数据集KTH和UCF-101上的大量实验结果表明,所提算法的识别率(top-1精度)分别达到了93.52%和57.35%,与三维卷积神经网络(C3D)算法相比分别提升了3.93和13.91个百分点。所提算法框架有较好的鲁棒性和迁移学习能力,能够有效地处理多种视频行为识别任务。 Concerning the problem that the existing algorithm for human behavior recognition cannot fully utilize the multi-level spatio-temporal information of network, a human behavior recognition algorithm based on three-dimensional residual dense network was proposed. Firstly, the proposed network adopted the three-dimensional residual dense blocks as the building blocks, these blocks extracted the hierarchical features of human behavior through the densely-connected convolutional layer. Secondly, the local dense features of human behavior were learned by the local feature aggregation adaptive method. Thirdly, residual connection module was adopted to facilitate the flow of feature information and mitigate the difficulty of training. Finally, after realizing the multi-level local feature extraction by concatenating multiple three-dimensional residual dense blocks, the aggregation adaptive method for global feature was proposed to learn the features of all network layers for realizing human behavior recognition. In conclusion, the proposed algorithm has improved the extraction of network multi-level spatio-temporal features and the features with high discrimination are learned by local and global feature aggregation, which enhances the expression ability of model. The experimental results on benchmark datasets KTH and UCF-101 show that, the recognition rate(top-1 recognition accuracy) of the proposed algorithm can achieve 93.52% and 57.35% respectively, which outperforms that of Three-Dimensional Convolutional neural network(C3 D) algorithm by 3.93 percentage points and 13.91 percentage points respectively. The proposed algorithm framework has excellent robustness and migration learning ability, and can effectively handle multiple video behavior recognition tasks.
作者 郭明祥 宋全军 徐湛楠 董俊 谢成军 GUO Mingxiang;SONG Quanjun;XU Zhannan;DONG Jun;XIE Chengjun(Institute of Intelligent Machines,Chinese Academy of Sciences,Hefei Anhui 230031,China;University of Science and Technology of China,Hefei Anhui 230026,China)
出处 《计算机应用》 CSCD 北大核心 2019年第12期3482-3489,共8页 journal of Computer Applications
基金 国家重点研发计划项目(2017YFC0806504) 安徽省科技强警项目(201904d07020007)~~
关键词 人体行为识别 视频分类 三维残差稠密网络 深度学习 特征聚合 human behavior recognition video classification three-dimensional residual dense network deep learning feature aggregation
  • 相关文献

参考文献2

二级参考文献143

  • 1BERGTHOLDT M, KAPPES J, SCHMIDT S, et al. A study of parts-based object class detection using complete graphs[J]. International Journal of Computer Vision, 2010, 87(1/2) : 93 - 117.
  • 2LAPTEV I, LINDEBERG T. Local descriptors for spatio-temporal recognition[C] // Proceedings of ECCV Workshop on Spatial Coherence for Visual Motion Analysis. New York: Springer, 2004:1 - 12.
  • 3LAPTEV I, LINDEBERG T. Velocity adaptation of space-time interest points[ C] // Proceedings of the International Conference on Pattern Recognition. Washington, DC: IEEE Computer Society, 2004:52 - 56.
  • 4SINGH M, BASU A, MANDAL M K. Human activity recognition based on silhouette directionality[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2008, 18(9) : 1280 - 1292.
  • 5SCHULDT C , LAPTEV I , CAPUTO B . Recognizing human actions: A local SVM approach [C] // Proceedings of the International Conference on Pattern Recognition. Washington, DC: IEEE Computer Society, 2004:32 -36.
  • 6BUTUROVIC L J. PCP: A program for supervised classification of gene expression profiles[J]. Bioinformatics, 2006, 22(2): 245-247.
  • 7Mokhber A,Achard C,Milgram M. Recognition of Human Behavior by Space-Time Silhouette Characterization[J].Pattern Recognition Let-ters,2008,(01):81-89.
  • 8Polat E,Yeasin M,Sharma R. Robust Tracking of Human Body Parts for Collaborative Human Computer Interaction[J].{H}COMPUTER VISION AND IMAGE UNDERSTANDING,2003,(01):44-69.
  • 9Kjellstr?m H,Romero J,Kragic' D. Visual Object-Action Recogni-tion:Inferring Object Affordances from Human Demonstration[J].{H}COMPUTER VISION AND IMAGE UNDERSTANDING,2011,(01):81-90.
  • 10Suma E A,Krum D M,Lange B. Adapting User Interfaces for Gestural Interaction with the Flexible Action and Articulated Skele-ton Toolkit[J].Computers& Graphics,2012,(03):193-201.

共引文献104

同被引文献33

引证文献9

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部