期刊文献+

基于改进型C3D神经网络的动作识别技术 被引量:2

Action Recognition Technology Based on Improved C3D Neural Network
下载PDF
导出
摘要 Facebook提出的C3D三维卷积神经网络虽然能达到良好的视频动作识别准确率,但是在速度方面还有很大的改进余地,而且训练得到的模型过大,不便于移动设备使用。本文利用小型卷积核能够减少参数的特点,对已有网络结构进行优化,提出一种新的动作识别方案,将原C3D神经网络常用的3×3×3卷积核分解成深度卷积和点卷积(1×1×1卷积核),并且在UCF101数据集和ActivityNet数据集训练测试。结果表明,与原C3D网络进行对比:改进后的C3D网络准确率比C3D提升了2.4%,在速度方面比C3D提升了12.9%,模型大小压缩到原来的25.8%。 Although the C3D convolutional neural network proposed by Facebook can achieve good video action recognition accuracy, there is still much room for improvement in terms of speed, and the model obtained by training is too large to be used by mobile devices. This paper uses small convolutional kernels to reduce the characteristics of parameters, optimizes the existing network structure, and proposes a new action recognition scheme, which decomposes the 3×3×3 convolutional kernel commonly used in the original C3D neural network into deep convolution and point convolution(1×1×1 convolution kernel), and training tests on the UCF101 dataset and ActivityNet dataset. The results show that compared with the original C3D network, the improved C3D network accuracy is 2.4% higher than C3D, 12.9% faster than C3D in speed, and the model size is compressed to 25.8%.
作者 廖小东 贾晓霞 LIAO Xiao-dong;JIA Xiao-xia(Eigth System Department,North China Institute of Computing Technology,Beijing 100083,China)
出处 《计算机与现代化》 2019年第3期32-38,共7页 Computer and Modernization
关键词 动作识别 卷积分解 识别速度 模型压缩 action recognition convolution decomposition recognition speed model compression
  • 相关文献

参考文献2

二级参考文献213

  • 1王素玉,沈兰荪.智能视觉监控技术研究进展[J].中国图象图形学报,2007,12(9):1505-1514. 被引量:82
  • 2Bouwmans T, El Baf F, Vachon B. Background modeling using mixture of Gaussians for foreground detection: A survey. Recent Patents on Computer Science, 2008, 1(3) 219-237.
  • 3Wojek C, Dollar P, Schiele B, Perona P. Pedestrian detection: An evaluation o{ the state o{ the art. IEEE Pattern Analysis and Machine Intelligence, 2012, 34(4): 743-761.
  • 4Yilmaz A, Javed O, Shah M. Object trackingt A survey. ACM Computing Surveys (CSUR), 2006, 38(4) 1-29.
  • 5Wang X. Intelligent multi-camera video surveillance: A review. Pattern Recognition Letters, 2012, 34 (1) : 3-19.
  • 6Wu Y, Lira J, Yang M H. Online object tracking: A bench- mark//Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. Portland, USA, 2013 2411-2418.
  • 7Andreopoulos A, Tsotsos J K. 50 years of object recognition: Directions forward. Computer Vision and Image Understanding, 2013, 117(8) 827-891.
  • 8Zhang X, Yang Y H, Han Z, et al. Object class detection: A survey. Association for Computing Machinery Computing Surveys (CSUR), 2013, 46(1) : 1311-1325.
  • 9Morris B T, Trivedi M M. A survey of vision-based trajectory learning and analysis for surveillance. IEEE Transactions on Circuits and Systems for Video Technology, 2008, 18(8): 1114-1127.
  • 10Aggarwal J K, Ryoo M S. Human activity analysis: A review. ACM Computing Surveys, 2011, 43(3): 16.

共引文献409

同被引文献18

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部