期刊文献+

有效视频帧时间序池化的人体行为识别算法 被引量:4

Human Action Recognition Algorithm with Temporal Rank Pooling of Valid Video Frames
下载PDF
导出
摘要 为利用人体行为的时域信息并减少帧间冗余及特征维数,提出一种提取有效视频帧并对其时间序池化的人体行为识别算法。通过对视频帧的稠密轨迹特征进行局部累计描述向量编码,获取视频帧特征表示,对每帧的特征编码进行余弦相似度分析,剔除冗余特征帧得到有效视频帧特征序列。采用时间序池化对有效视频帧特征序列进行排序,得到可表示视频时序动态变化的特征向量,然后训练支持向量机实现人体行为识别。在HMDB51和UCF101数据集上的实验结果表明,与稠密轨迹行为识别算法相比,该算法可有效提高识别准确率。 In order to make full use of the video-wide temporal information and reduce the redundant frames and dimensions of features,a method of extracting valid video frames and performing temporal rank pooling for human action recognition is proposed.Vector of Locally Aggregated Descriptors(VLAD)is used to encode dense trajectory features of every frame of video to get feature representations.The cosine similarity analysis of frame features is employed to remove the redundant features and extract feature sequence of valid video frames.Temporal rank pooling is performed to order feature sequence of valid frames temporally and get the feature vectors capturing the evolution of video-wide temporal information.Support Vector Machine(SVM)is learned to get the results of human action recognition.Experimental results conducted on HMDB51 and UCF101 datasets show that compared with the dense trajectory recognition algorithm,the proposed agorithm has improved the recognition accuracy.
作者 鹿天然 于凤芹 陈莹 LU Tianran;YU Fengqin;CHEN Ying(School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China)
出处 《计算机工程》 CAS CSCD 北大核心 2018年第12期271-275,287,共6页 Computer Engineering
基金 国家自然科学基金(61573168) 中央高校基本科研业务费专项资金(JUSRP51733B)
关键词 行为识别 稠密轨迹 局部累计描述向量 余弦相似度分析 时间序池化 action recognition dense trajectory Vector of Locally Aggregated Descriptors(VLAD) cosine similarity analysis temporal rank pooling
  • 相关文献

参考文献2

二级参考文献190

  • 1李妍婷,罗予频,唐光荣.单目视频中的多视角行为识别方法[J].计算机应用,2006,26(7):1592-1594. 被引量:8
  • 2冯波,赵春晖,杨涛,张洪才,程咏梅.基于光流特征与序列比对的实时行为识别[J].计算机应用研究,2007,24(3):194-196. 被引量:6
  • 3Aggarwal J K, Cai Q. Human motion analysis: A review [ J]. Computer Vision and Image Understanding, 1999, 73 (3) : 428-440.
  • 4Gavrila D M. The visual analysis of human movement: A survey [ J]. Computer Vision and Image Understanding, 1999, 73( 1 ): 82-98.
  • 5Moeslund Thomas B, Granum Erik. A survey of computer visionbased human motion capture [ J ]. Computer Vision and Image Understanding, 2001, 81 (3): 231-286.
  • 6Moeslund Thomas B, Hilton Adrian, Kruger Volker. A survey of advances in vision-based human motion capture and analysis [ J]. Computer Vision and Image Understanding, 2006, 104(3) : 90-126.
  • 7Johansson G. Visual motion perception [ J ]. Scientific American, 1975, 232(2) : 76-88.
  • 8Robertson N, Reid I. A general method for human activity recognition in video [ J ]. Computer Proceedings of Vision and Image Understanding, 2006, 104(2-3): 232-248.
  • 9Ryoo M S, Aggarwal J K. Recognition of composite human activities through context-free grammar based representation [ A ]. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition [C], New York, USA, 2006: 1709-1718.
  • 10Wang Liang, Suter David. Informative shape representations for human action recognition [ A ] . In: Proceedings of International Conference on Pattern Recognition [ C ], Hong Kong, 2006: 1266-1269.

共引文献132

同被引文献6

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部