期刊文献+

基于注意力机制的改进残差网络的人体行为识别方法 被引量:5

Human Action Recognition Method based on Attention Mechanism and Improved ResNeXt Network
下载PDF
导出
摘要 针对ResNeXt网络(残差网络)中存在的对特征提取不充分,以及数据集中背景信息干扰的问题,将ResNeXt网络和注意力机制相结合,提出了一种基于注意力机制的ResNeXt模型。首先,在ResNeXt网络的基础上,将浅层和深层的特征融合生成新型网络结构。其次,将全连接层由全局平均池化层替代,然后在通道空间注意力机制中添加一个条件因子,同时将改进后的注意力机制嵌入上述网络中。最后,在UCF101和HMDB51上分别进行实验,得到了95.2%和65.6%的准确率。研究表明,本文提出的模型可以有效地提取关键特征,充分利用不同层次的特征信息获得较好的准确率。 Aiming at problems of insufficient feature extraction in ResNeXt network and background information interference in the dataset,this paper proposes a ResNeXt model based on attention mechanism,which combines the ResNeXt network and attention mechanism.First,based on ResNeXt network,shallow and deep features are merged to generate a new network structure.Second,the fully connected layer is replaced by a global average pooling layer.Then channel attention mechanism is improved by adding a condition factor.At the same time,the improved attention mechanism is embedded in the above-mentioned network.Finally,experiments are performed on UCF101 and HMDB51 respectively,and the accuracy rates of 95.2% and 65.6% are obtained.Experiments show that the proposed model can effectively extract key features,and make full use of feature information of different layers to achieve better accuracy.
作者 王昊飞 李俊峰 WANG Haofei;LI Junfeng(Faculty of Mechanical Engineering&Automation,Zhejiang Sci-Tech University,Hangzhou 310018,China)
出处 《软件工程》 2021年第11期51-54,46,共5页 Software Engineering
关键词 人体行为识别 注意力机制 ResNeXt 全局平均池化 human action recognition attention mechanism ResNeXt network global average pooling
  • 相关文献

参考文献7

二级参考文献110

  • 1Fujiyoshi H, Lipton A J, Kanade T. Real-time human mo- tion analysis by image skeletonization. IEICE Transactions on Information and Systems, 2004, 87-D(1): 113-120.
  • 2Chaudhry R, Ravichandran A, Hager G, Vidal R. His- tograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of hu- man actions. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1932-1939.
  • 3Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: Proceedings of the 2005 IEEE Con- ference on Computer Vision and Pattern Recognition. San Diego, CA, USA: IEEE, 2005. 886-893.
  • 4Lowe D G. Object recognition from local scale-invariant fea- tures. In: Proceedings of the 7th IEEE International Confer- ence on Computer Vision. Kerkyra: IEEE, 1999. 1150-1157.
  • 5Schuldt C, Laptev I, Caputo B. Recognizing human actions: a local SVM approach. In: Proceedings of the 17th In- ternational Conference on Pattern Recognition. Cambridge: IEEE, 2004. 32-36.
  • 6Dollar P, Rabaud V, Cottrell G, Belongie S. Behavior recog- nition via sparse spatio-temporal features. In: Proceedings of the 2005 IEEE International Workshop on Visual Surveil- lance and Performance Evaluation of Tracking and Surveil- lance. Beijing, China: IEEE, 2005.65-72.
  • 7Rapantzikos K, Avrithis Y, Kollias S. Dense saliency-based spatiotemporal feature points for action recognition. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1454-1461.
  • 8Knopp J, Prasad M, Willems G, Timofte R, Van Gool L. Hough transform and 3D SURF for robust three dimensional classification. In: Proceedings of the llth European Confer- ence on Computer Vision (ECCV 2010). Berlin Heidelberg: Springer. 2010. 589-602.
  • 9Klaser A, Marszaeek M, Schmid C. A spatio-temporal de- scriptor based on 3D-gradients. In: Proceedings of the 19th British Machine Vision Conference. Leeds: BMVA Press, 2008. 99.1-99.10.
  • 10Wang H, Ullah M M, Klaser A, Laptev I, Schmid C. Evalua- tion of local spatio-temporal features for action recognition. In: Proceedings of the 2009 British Machine Vision Confer- ence. London, UK: BMVA Press, 2009. 124.1-124.11.

共引文献223

同被引文献30

引证文献5

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部