一种基于深度学习的人体交互行为分层识别方法被引量：4

A Hierarchical Approach Based on Deep Learning for Human Interactive-action Recognition

下载PDF

导出

摘要本文把人体交互行为分解为由简单到复杂的4个层次:姿态、原子动作、复杂动作和交互行为,并提出了一种分层渐进的人体交互行为识别方法.该方法共有3层:第1层通过训练栈式降噪自编码神经网络把原始视频中的人体行为识别为姿态序列;第2层构建原子动作的隐马尔科夫模型(hidden Markov model,HMM),并利用估值定界法识别第1层输出的姿态序列中包含的原子动作;第3层以第2层输出的原子动作序列为输入,采用基于上下文无关文法(contextfree grammar,CFG)的描述方法识别原子动作序列中的复杂动作和交互行为.实验结果表明,该方法能有效地识别人体交互行为. This paper discusses the recognition of interaction-level human activities with a hierarchical approach.We classify human activities into four categories：pose,atomic action,composite action,and interaction.In the bottom layer,a new pyramidal stacked de- noising auto-encoder is adopted to recognize the poses of person with high accuracy.In the middle layer, the hidden Markov models （HMMs） of atomic actions are built, and evaluation demarcation algorithm is proposed to detect atomic actions and speed up calcu- lations.In the top layer,the context-free grammar （CFG） is used to represent and recognize interactions.In this layer,a new spatial predicate set is proposed and face orientation is introduced to describe activities.We use Kinect to capture activity videos.The experimental result from the dataset shows that the system possesses the ability to recognize human actions accurately.

作者尹坤阳潘伟谢立东徐素霞

机构地区厦门大学信息科学与技术学院

出处《厦门大学学报（自然科学版）》 CAS CSCD 北大核心 2016年第3期413-419,共7页 Journal of Xiamen University：Natural Science

基金国家自然科学基金(60975084)

关键词人体行为识别深度学习隐马尔科夫模型(HMM) 上下文无关文法(CFG) KINECT human action recognition deep learning hidden Markov model （HMM） context-free grammar （CFG） Kineet

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1POPPE R. A survey on vision-based human action recog-nition [J]. Image and Vision Computing, 2010,28 (6):976-990.
2AGGARWAL J K,Ryoo M S. Human activity analysis: areview[J].Acm Computing Surveys,2011 ?43(3) : 1-43.
3SHEIKH Y,SHEIKH M,SHAH M.Exploring the spaceof a human action[C] // 2005 IEEE International Confer-ence on Computer Vision (ICCV). Beijing: IEEE, 2005 :144-149.
4NATARAJAN P, NEVATIA R. Coupled hidden semimarkov models for activity recognition[C] // 2007 IEEEWorkshop on Motion and Video Computing (WMVC).Austin: IEEE, 2007 : 10.
5OLIVER N,HORVITZ E’GARG A.Layered representa-tions for human activity recognition[C] // 2002 IEEE In-ternational Conference on Multimodal Interfaces (ICMI).Pittsburgh, PA : IEEE,2002 : 3-8.
6JOO S W, CHELLAPPA R. Attribute grammar-based e-vent recognition and anomaly detection[C] // 2006 IEEEConference on Computer Vision and Pattern RecognitionWorkshops(CVPRW).New York:IEEE,2006 :107.
7GUPTA A, SRINIVASAN P, JIANBO S,et al. Under-standing videos, constructing plots learning a visuallygrounded storyline model from annotated videos [C]//2009 IEEE Conference on Computer Vision and PatternRecognition(CVPR).Miami,FL: IEEE,2009 : 2012-2019.
8HINTON G E,SALAKHUTDINOV R R.Reducing thedimensionality of data with neural networks [J ]. Science.2006,313(5786):504-507.
9VINCENT P, LAROCHELLE H, LAJOIE I,et al.Stacked denoising autoencoders : learning useful represen-tations in a deep network with a local denoising criterion[J]. Journal of Machine Learning Research, 2010,11:3371-3408.
10ALLEN J F.Rethinking logics of action and time[C] //2013 International Symposium on Temporal Representa-tion and Reasoning (TIME).Pensacola,FL: IEEE,2013 :3-4.

二级参考文献22

1苏毅,吴文虎,郑方,等.基于支持向量机的语音识别研究[C].第六届全国人机语音通讯学术会议,深圳,2001.
2Ning Huazhong, Han Tony Xu, Wahher D B, et al. Hier- archical space-time model enabling efficient search for hu- man actions [ J ]. IEEE Transactions on Circuits and Sys- tems for Video Technology, 2009,19(6) :808-820.
3Gupta A, Srinivasan P, Shi Jianbo, et al. Understanding videos, constructing plots: Learning a visually grounded storyline model from annotated videos [ C ]// Proceedings of the 2009 IEEE International Conference on Computer Vi- sion and Pattern Recognition. 2009:2012-2019.
4Wu Jianxin, Osuntogun A, Choudhury T, et al. A scalable approach to activity recognition based on object use [ C ]// Proceedings of the 11 th IEEE International Conference on Computer Vision. 2007.
5Liu Jingen, Ali S, Shah M. Recognizing human actions u- sing multiple features [ C ]//Proceedings of the 2008 IEEE International Conference on Computer Vision and Pattern Recognition. 2008.
6Yao Bangpeng, Li Feifei. Modeling mutual context of object and human pose in human-object interaction activities [ C l// Proceedings of the 2010 IEEE International Conference on Computer Vision and Pattern Recognition. 2010:17-24.
7Aksoy E, Abramov A, Worgotter F, et al. Categorizing ob- ject-action relations from semantic scene graphs [ C ]//Pro- ceedings of the 2010 IEEE International Conference on Ro- botics and Automation. 2010:398-405.
8Jiang Yugang, Li Zhenguo, Chang Shih-Fu. Modeling scene and object contexts for human action retrieval with few exam- ples [ J ]. IEEE Transactions on Circuits and Systems for Video Technology, 2011,21 (5) :674-681.
9Pirsiavash H, Ramanan D. Detecting activities of daily liv- ing in first-person camera views [ C ]// Proceedings of the 2012 IEEE International Conference on Computer Vision and Pattern Recognition. 2012:2847-2854.
10Li Wanqing, Zhang Zhengyou, Liu Zicheng. Action recog- nition based on a bag of 3D points [ C]// Proceedings of the 3rd IEEE International Workshop on CVPR for Human Communicative Behavior Analysis. 2010:9-14.

共引文献3

1贺炎,王科,王忠民.用户无关的多分类器融合行为识别模型[J].西安邮电大学学报,2016,21(5):50-54. 被引量：2
2赵太飞,谷伟豪,马欣媛,段延峰.基于HMM和BP神经网络组合模型的用水行为识别[J].水资源与水工程学报,2019,30(4):14-17.
3张银环,肖秦琨,楚超勤,邢恒,贾松涛.一种注意力网络与马尔可夫链结合的多模态人体行为识别方法[J].西安工业大学学报,2022,42(5):507-512. 被引量：2

同被引文献52

1李玉鹏,刘婷婷,张良.基于深度学习的人体动作识别方法[J].计算机应用研究,2020,37(1):304-307. 被引量：6
2王杰,虞丽娟,张辉,黄华勇.决策树算法在乒乓球比赛中的应用[J].计算机工程,2010,36(24):272-274. 被引量：2
3郭利,姬晓飞,李平,曹江涛.基于混合特征的人体动作识别改进算法[J].计算机应用研究,2013,30(2):601-604. 被引量：14
4钟亚平,刘鹏.基于RBF神经网络算法的田径运动损伤预警模型研究[J].计算机应用与软件,2014,31(6):48-51. 被引量：5
5王珂,武军,周天相,李瑞峰.一种融合全局时空特征的CNNs动作识别方法[J].华中科技大学学报（自然科学版）,2018,46(12):36-41. 被引量：4
6王忠民,屈肃.一种极速学习机人体行为识别模型迁移方法[J].西安邮电大学学报,2015,20(1):49-54. 被引量：8
7夏田,雷展.人体下肢外骨骼的运动分析与仿真[J].制造业自动化,2015,37(9):89-91. 被引量：5
8常亮,孙国妹,刘雨辰.基于灰色神经网络的体育成绩预测研究[J].价值工程,2015,34(20):191-193. 被引量：1
9唐超,王文剑,李伟,李国斌,曹峰.基于多学习器协同训练模型的人体行为识别方法[J].软件学报,2015,26(11):2939-2950. 被引量：9
10吴冬梅,谢金壮,王静.基于多特征融合的人体行为识别[J].计算机应用与软件,2015,32(11):171-175. 被引量：6

引证文献4

1胡炜.基于傅里叶-隐马尔科夫模型的人体行为识别方法研究[J].电子设计工程,2018,26(7):185-188. 被引量：4
2路来冰,王艳,马忆萌,许金富.基于知识图谱的体育人工智能研究分析[J].首都体育学院学报,2021,33(1):6-18. 被引量：18
3刘俊来.基于融合不变性特征与混合核方法的体育视频动作识别[J].沈阳工业大学学报,2022,44(2):198-202. 被引量：3
4季欣,杨子喆,马勇,贾孟尧,刘林.人工智能技术在健康促进、运动能力提升和损伤预测等方面的研究进展[J].体育科技文献通报,2023,31(5):235-238. 被引量：1

二级引证文献26

1陈振雷,沈友青,徐继来,徐剑,任双峰.数字体育在运动人体科学中的专业具象化应用与现实意义[J].体育视野,2023(18):115-117.
2李冉超,田青,齐自强.复杂场景轻量级SSD行人检测方法[J].信息技术与信息化,2019(2):58-62.
3毕雪超.基于空间骨架时序图的舞蹈特定动作识别方法[J].信息技术,2019,43(11):16-19. 被引量：2
4刘敏,潘炼,曾新华,朱泽德.基于MTCNN的坐姿行为识别[J].计算机工程与设计,2019,40(11):3293-3298. 被引量：7
5罗丹.基于深度学习算法的篮球运动技术特征目标检测与精细定位[J].兰州文理学院学报（自然科学版）,2021,35(6):108-112. 被引量：3
6谢正阳,周铭扬.人工智能与公共体育服务融合发展的逻辑、价值与路径[J].北京体育大学学报,2021,44(12):176-184. 被引量：24
7梁楠楠,李亚玲.人工智能与体育融合发展历程、现状与展望[J].体育科技文献通报,2022,30(9):199-202. 被引量：4
8王萍.基于异构多处理器和深度学习算法的篮球运动图像目标检测[J].洛阳师范学院学报,2022,41(8):29-33.
9阮文翩,祁世龙,孙华洪,马杰威,马廉祯.智能化辅助器材应用于中国长棍对抗运动的路径研究[J].体育科技文献通报,2022,30(11):168-172. 被引量：4
10马丽,刘晓磊.体育赛事中视频智能识别技术的实现[J].微型电脑应用,2023,39(2):150-153.

1潘培琛.一般上下文无关文法的一个分析算法[J].北京大学学报（自然科学版）,1989,25(5):615-625. 被引量：2
2徐完平.《Android手机开发》课程教学研究[J].电脑知识与技术,2015,11(10X):119-120. 被引量：1
3谢霖铨,梁博群.基于降噪自编码的推荐算法[J].计算机与现代化,2016(2):38-41. 被引量：3
4张成刚,姜静清.一种稀疏降噪自编码神经网络研究[J].内蒙古民族大学学报（自然科学版）,2016,31(1):21-25. 被引量：9
5陶美平,马力,黄文静,吴雨隆.基于无监督特征学习的手势识别方法[J].微电子学与计算机,2016,33(1):100-103. 被引量：9
6俸世洲.基于自编码神经网络的文本表示应用研究[J].电子测试,2016,27(10):91-92. 被引量：1
7陈鸥.一种基于改进型SPIHT的分层渐进图像压缩方法[J].科学技术与工程,2005,5(20):1503-1505.
8贾文其,李明,朱美强,王军.基于栈式降噪自编码神经网络的车牌字符识别[J].计算机工程与设计,2016,37(3):751-756. 被引量：16
9张翠翠,张勇.一种改进的多机器人队形形成算法[J].制造业自动化,2012,34(22):54-57. 被引量：1
10彭和,余萍,彭生祥,鲍安红,白雪飞.基于遗传算法的多移动机器人编队行为控制[J].西南农业大学学报（自然科学版）,2006,28(3):507-509. 被引量：1

厦门大学学报（自然科学版）

2016年第3期

浏览历史

内容加载中请稍等...

一种基于深度学习的人体交互行为分层识别方法被引量：4

参考文献16

二级参考文献22

共引文献3

同被引文献52

引证文献4

二级引证文献26

相关作者

相关机构

相关主题

浏览历史

一种基于深度学习的人体交互行为分层识别方法 被引量：4

参考文献16

二级参考文献22

共引文献3

同被引文献52

引证文献4

二级引证文献26

相关作者

相关机构

相关主题

浏览历史

一种基于深度学习的人体交互行为分层识别方法被引量：4