面向人体动作预测的对称残差网络

Symmetric Residual Network for Human Motion Prediction

导出

摘要为了研究不同残差连接方式对人体动作预测卷积神经网络的影响,探讨了在保持网络深度一定的情况下,如何利用残差连接构成一个高效捕捉人体动作特征的预测模型。通过观察人体骨骼关节点排列方式,提出一种适用于人体骨骼关节点预测的对称残差连接方法,并基于该方法设计了对称残差块(symmetric residual block,SRB)。所设计的SRB,最后一层卷积核的感受野达到最大,覆盖了人体全部关节信息,采用的对称连接方式高效地利用浅层动态特征,使预测的效果更好、模型使用的参数更少。此外,本文提出一种基于2个SRB和1个解码器的端到端卷积网络——对称残差网络(symmetric residual network,SRNet),取得的预测结果高于基准方法。最后,在TensorFlow框架下利用公开数据集Human3.6M和CMU-Mocap进行了人体动作预测实验。其结果表明,与基准方法相比,本文方法的关节位置平均误差(mean per joint postion error,MPJPE)在各个预测时间点上均有0.2 mm~1 mm的降低,验证了本文提出的SRNet能有效建模人体姿态的全局空间特征。 To study the influence of different residual connection methods on CNN(convolutional neural network) for human motion prediction, this paper investigates how to use residual connection to construct an effective prediction model for capturing the human motion features by the network with a certain depth. Through observing the arrangement of human skeletal joints, a symmetric residual connection method is proposed for predicting the human skeletal joints, and a symmetric residual block(SRB) is designed based on the proposed method. In the designed SRB, the receptive field of the last convolution kernel is maximized, covering all the joint information of the human body. The symmetric connection method is adopted to efficiently utilize the shallow dynamic features, and consequently improve the prediction performance and reduce the model parameters. Based on two SRBs and one decoder, an end-to-end convolutional network is proposed, named as symmetric residual network(SRNet), by which a higher accuracy is achieved comparing with the baseline methods. In the framework of TensorFlow, human motion prediction experiments are carried out on two public datasets, Human3.6M and CMU-Mocap.The results indicate that, the proposed method reduces the mean per joint position error(MPJPE) by 0.2 mm~1 mm at each prediction time point comparing with the baseline methods, which confirms the effectiveness of the proposed SRNet for modeling the human global spatial features.

作者张晋唐进尹建芹 ZHANG Jin;TANG Jin;YIN Jianqin(School of Artificial Intelligence,Beijing University of Posts and Telecommunications,Beijing 100876,China)

机构地区北京邮电大学人工智能学院

出处《机器人》 EI CSCD 北大核心 2022年第3期291-298,共8页 Robot

基金国家自然科学基金(61673192) 中央高校基本科研业务费(2020XD-A04-2)。

关键词人体动作预测对称残差连接卷积神经网络骨骼关节点建模 human motion prediction symmetric residual connection convolutional neural network skeletal joints modeling

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1刘今越,李顺达,陈梦倩,郭士杰.面向移乘搬运护理机器人的人体姿态视觉识别[J].机器人,2019,41(5):601-608. 被引量：14
2林安迪,干旻峰,葛涵,唐宇存,徐海东,匡绍龙,黄立新,孙立宁.基于模糊模型参考学习控制的手术机器人人机交互[J].机器人,2019,41(4):543-550. 被引量：7
3马淼,李贻斌.基于多级动态模型的2维人体姿态估计[J].机器人,2016,38(5):578-587. 被引量：9
4谭嘉崴,丁其川,白忠玉.基于视频帧连贯信息的3维人体姿势优化估计方法[J].机器人,2021,43(1):9-16. 被引量：9

二级参考文献37

1刘今越,李顺达,陈梦倩,郭士杰.面向移乘搬运护理机器人的人体姿态视觉识别[J].机器人,2019,41(5):601-608. 被引量：14
2Dautenhahn K. Socially intelligent robots: Dimensions of human-robot interaction[J]. Philosophical Transactions of the Royal Society of London, B: Biological Sciences, 2007, 362(1480): 679-704.
3Atkeson C G, Hale J G, Pollick F E, et al. Using humanoid robots to study human behavior[J]. IEEE Intelligent Systems and Their Applications, 2000, 15(4): 46-55.
4Yang Y Z, Li Y, Fermtiller C, et al. Robot learning manipula- tion action plans by "watching" unconstrained videos from the World Wide Web[C]//Proceedings of the 29th AAAI Confer- ence on Artificial Intelligence. 2015: 3686-3693.
5Koppula H S, Gupta R, Saxena A. Learning human activities and object affordances from RGB-D videos[J]. International Journal of Robotics Research, 2013, 32(8): 951-970.
6Yang Y, Ramanan D. Articulated human detection with flexible mixtures of parts[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(12): 2878-2890.
7Dantone M, Gall J, Leistner C, et al. Human pose estimation us- ing body parts dependent joint regressors[C]//IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2013: 3041-3048.
8Fischler M A, Elschlager R A. The representation and match- ing of pictorial structures[J]. IEEE Transactions on Computers, 1973, 22(1): 67-92.
9Freifeld O, Weiss A, Zuffl S, et al. Contour people: A parame- terized model of 2D articulated human shape[C]//IEEE Confer- ence on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2010: 639-646.
10Zuffi S, Freifeld O, Black M J. From pictorial structures to de- formable structures[C]//IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2012: 3546- 3553.

共引文献34

1费树岷,赵宏涛,杨艺,李春锋.基于时序拓扑非共享图卷积和多尺度时间卷积的骨架行为识别[J].信息与控制,2023,52(6):758-772.
2姚晶晶.体育运动视频人体关节点运动轨迹自动识别方法[J].商丘师范学院学报,2022,38(12):16-20.
3刘今越,李顺达,陈梦倩,郭士杰.面向移乘搬运护理机器人的人体姿态视觉识别[J].机器人,2019,41(5):601-608. 被引量：14
4李庆武,席淑雅,王恬,马云鹏,周亮基.结合位姿约束与轨迹寻优的人体姿态估计[J].光学精密工程,2017,25(4):1060-1069. 被引量：4
5朱凌飞,万旺根.基于骨架模型的人体行为分析[J].电子测量技术,2019,42(8):68-73.
6蒋莹.基于Kinect的体育运动辅助训练研究[J].自动化技术与应用,2019,38(9):151-153. 被引量：4
7刘静.基于大数据的人体行为特征方向估计模型仿真[J].计算机仿真,2019,36(9):422-425. 被引量：1
8陈靓,黄玉平,陶云飞,贾龙飞,郭亚星.基于阻抗模型的下肢康复机器人交互控制系统设计[J].计算机测量与控制,2020,28(4):116-120. 被引量：8
9华丹.基于区块链技术的工业机器人视觉检测及避障系统设计[J].计算机测量与控制,2020,28(7):69-73. 被引量：5
10刘今越,刘彦开,贾晓辉,郭士杰.基于模型约束的人体姿态视觉识别算法研究[J].仪器仪表学报,2020,41(4):208-217. 被引量：7

1赵敬娇,赵志宏,杨绍普.基于残差连接和1D-CNN的滚动轴承故障诊断研究[J].振动与冲击,2021,40(10):1-6. 被引量：32
2庞鸿宇,于龙,高仕斌.基于卷积网络的受电弓图像目标检测与矫正方法研究[J].电气化铁道,2021,32(5):1-5. 被引量：1
3熊中敏,舒贵文,郭怀宇.融合用户偏好的图神经网络推荐模型[J].计算机科学,2022,49(6):165-171. 被引量：2
4宋相兵,季玉龙,俎文强,何扬,杨红雨.基于触觉传感器和强化学习内在奖励的机械臂抓取方法[J].四川大学学报（自然科学版）,2022,59(3):53-62. 被引量：2
5陈成,张皞,李永强,冯远静.关系生成图注意力网络的知识图谱链接预测[J].浙江大学学报（工学版）,2022,56(5):1025-1034. 被引量：5
6Malvika Arya,Almyr S.Sabrosa,Jay S.Duker,Nadia K.Waheed.Choriocapillaris changes in dry age-related macular degeneration and geographic atrophy: a review[J].Eye and Vision,2018,5(1):204-210. 被引量：4
7郭凡,杨操,郭锐,姜炜.超轻碳气凝胶的机械鲁棒性增强策略及其应用[J].应用数学和力学,2022,43(5):499-514. 被引量：1
8徐良,忻俊杰,王恒毅,李文磊.基于变权重拟合的并行组合电价预测模型研究[J].数据通信,2022(2):46-51.
9梁骁,黄文明,姚俊,温雅媛,邓珍荣.结合多注意力和条件变分自编码器的宋词生成模型[J].广西科学,2022,29(2):308-315. 被引量：1
10钱龙,赵静,韩京宇,毛毅.基于标签相关性的K近邻多标签学习[J].计算机工程,2022,48(6):73-78. 被引量：2

机器人

2022年第3期

浏览历史

内容加载中请稍等...

面向人体动作预测的对称残差网络

参考文献4

二级参考文献37

共引文献34

相关作者

相关机构

相关主题

浏览历史