期刊文献+

时空域深度卷积神经网络及其在行为识别上的应用 被引量:24

Spatiotemporal Convolutional Neural Networks and its Application in Action Recognition
下载PDF
导出
摘要 近年来深度卷积神经网络在静态图像识别上取得了较大进展,但在行为视频上建模运动信息的能力较弱。但是,运动信息是行为识别区别于静态图像识别的关键。基于滤波器响应积提出了时空域深度卷积神经网络。该网络先将相邻帧对应的卷积核分为两组,近似地形成傅里叶基函数对,后续的乘法层将不同帧产生的响应两两相乘后再输入加法层求和,从而将相邻帧映射到变换矩阵的特征值对应的不变子空间上,依靠相邻帧在不变子空间上的旋转角度检测它们之间的运动特征。理论分析证明,网络既对运动敏感,又对内容敏感。实验表明,该网络能对行为视频做出更准确的分类,并与近年出现的其他6种算法进行比较,结果体现了本算法的优越性。 The key thing that distinguishes action recognition from other recognition tasks is to encode motion explicitly.But,so far,most works based on convolutional neural networks(CNN)cannot properly handle the spatiotemporal interaction in video.We developed a spatiotemporal-CNN that explicitly exploits this important cue provided by video.Instead of summing filter responses,responses are multiplied and our approach is based on that.Specifically,the spatiotemporal-CNN divides convolutional kernels into two groups forming sinusoidals of Fourier Transform.Then,the responses of convolutional kernels are multiplied by multiplicative layer as calculating covariance and the outputs are put into sum layer.In this way,the inputs and adjacent frames are mapped into the subspaces spanned by the eigenvectors,and the special geometric transformations or motion features can be checked by the rotating angles in that space.Additional theoretical analysis proves that spatiotemporal-CNN is sensitive to both motion and content.The experiment shows that our approach produces more accurate classification than current algorithms.
出处 《计算机科学》 CSCD 北大核心 2015年第7期245-249,共5页 Computer Science
关键词 时空域 卷积神经网络 深度学习 动作特征 行为识别 Spatiotemporal Convolutional neural networks Deep learning Motion feature Action recognition
  • 相关文献

参考文献15

二级参考文献88

  • 1王福豹,史龙,任丰原.无线传感器网络中的自身定位系统和算法[J].软件学报,2005,16(5):857-868. 被引量:671
  • 2熊忠阳,周亚峰.Web访问挖掘的预处理技术的研究[J].计算机技术与发展,2007,17(8):11-14. 被引量:19
  • 3Wang L, Hu W M, Tan T N. Recent developments in human motion analysis[J]. Pattern Recognition, 2003,36 (3) : 585-601.
  • 4Johnson N, Hogg D. Learning the distribution of object trajectories for event recognition[J].Image and Vision Computing, 1995,14(8) :609-615.
  • 5Brand M, Oliver N, Pentland A. Coupled hidden markov models for complex action recognition [C]///Proceedings of IEEE International Conference on Computer Vision. San Juan, Puerto Rico: IEEE, 1997 : 994-999.
  • 6Dollar P, Rabaud V, Cottrell G. Behavior recognition via sparse spatio-temporal features [C]/Proc. 2^nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance. China, 2005 : 65-72.
  • 7Hongeng S, Nevatia R. Multi-agent event recognition[C]// Proceedings of Eighth International Conference on Computer Vision. Vancouver, BC, Canada: IEEE, 2001 : 84-91.
  • 8Russo R,Shah M,Lobo N. A computer vision system for monitoring production of fast food [C]//Proceedings of The 5th Asian Conference on Computer Vision. Vancouver, Melbourne, Australia, 2002.
  • 9Wren C, Azarbayejani A, Darrell T. Pfinder: Real-time tracking of the human body[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(7) : 780-785.
  • 10Haritaoglu I, Harwood D, Davis L S. W4: Who when where what a real time system for detecting and tracking people[C]//Proceedings of International Conference on Face and Gesture Recognition. Nara,Japan: IEEE, 1998.

共引文献305

同被引文献190

引证文献24

二级引证文献271

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部