基于通道切分的人体姿态估计算法

Human Pose Estimation Algorithm Based on Channel Splitting

下载PDF

导出

摘要为了提高人体姿态估计的准确率和识别速度,提出一种基于通道切分的人体姿态估计算法Channel-Split Residual Steps Network(Channel-Split RSN)。首先,提出通道切分模块,对切分后的特征通道通过卷积提取特征再融合起来,以获得丰富的特征表示。接着,引入特征增强模块,对特征通道进一步分组,并对不同的分组采取不同的处理策略,以减少特征通道内的相似特征。最后,结合改进的空间注意力机制,提出一种基于特征空间相关性的姿态修正机Context-PRM,得到更加准确的人体姿态估计。在COCO test-dev数据集上的实验结果表明,本文方法达到75.9%的AP和55.36的FPS,并且模型的大小Params(M)仅为18.3。相较于传统的RSN18和传统的RSN50,模型的AP分别提高了5和3.4个百分点,FPS比传统的RSN50快12.08。在更具挑战性的CrowdPose数据集上,本文方法达到66.9%的AP和19.16的FPS,相较于RSN18,AP提高了4.6个百分点。有效提高了人体姿态估计的准确率,且模型具有较快的识别速度。本文源代码公开在https://github.com/qdd1234/Channel-Split-RSN。 To improve the accuracy and speed of human pose estimation,a channel-split-based human pose estimation algorithm,named Channel-Split Residual Steps Network(Channel-Split RSN),is proposed.First of all,channel-split blocks are proposed to apply convolution operation for split feature in order to obtain rich feature representation.Then,feature enhancement blocks are introduced to further split feature channel and employ different strategies for different groups which can reduce similar features in feature channels.Finally,to further enhance the pose refine machine in Channel-Split RSN,combined with improved spatial attention mechanism,a pose refine machine based on feature spatial correlation,named Context-PRM,is proposed.Experimental results show that on the COCO test-dev dataset,our algorithm reaches 75.9%AP and 55.36 FPS,and the Params(M)of the model is only 18.3.Compared with the traditional RSN18 and RSN50,the AP of the model is improved by 5 and 3.4 percentage points,respectively.FPS is 12.08 faster than the traditional RSN50.On the more challenging CrowdPose dataset,our approach achieves 66.9%AP and 19.16 FPS,an AP improvement of 4.6 percentage points compared to RSN18,which effectively improves the accuracy of human pose estimation and the model has a faster recognition speed.Our source code is available at https://github.com/qdd1234/Channel-Split-RSN.

作者周昆阳赵梦婷张海潮邵叶秦 ZHOU Kun-yang;ZHAO Meng-ting;ZHANG Hai-chao;SHAO Ye-qin(School of Zhang Jian,Nantong University,Nantong 226019,China;School of Transportation and Civil Engineering,Nantong University,Nantong 226019,China)

机构地区南通大学张謇学院南通大学交通与土木工程学院

出处《计算机与现代化》 2021年第12期27-36,42,共11页 Computer and Modernization

基金国家自然科学基金面上项目(61671255) 江苏省大学生创新训练计划项目(201910304158H,202010304180H,202010304122Y)。

关键词 Channel-Split RSN 人体姿态估计通道切分模块特征增强模块 Context-PRM Channel-Split RSN human pose estimation channel-split block feature enhancement block Context-PRM

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TH7 [机械工程—精密仪器及机械]

引文网络
相关文献

参考文献5

1王恬,李庆武,刘艳,周亚琴.利用姿势估计实现人体异常行为识别[J].仪器仪表学报,2016,37(10):2366-2372. 被引量：31
2唐心宇,宋爱国.人体姿态估计及在康复训练情景交互中的应用[J].仪器仪表学报,2018,39(11):195-203. 被引量：37
3冯文宇,朱洪堃,殷佳炜,费敏锐,张堃.无人CT智能姿态识别算法研究[J].仪器仪表学报,2020(8):188-195. 被引量：8
4王柳程,欧阳城添,梁文.基于改进特征金字塔网络的人体姿态估计[J].计算机工程,2021,47(8):251-259. 被引量：4
5罗梦诗,徐杨,叶星鑫.融入双注意力的高分辨率网络人体姿态估计[J].计算机工程,2022,48(2):314-320. 被引量：8

二级参考文献28

1MASOUD O, PAPANIKOLOPOULOS N. A method for human action recognition [ J ]. Image and Vision Computing, 2003, 21 (8) : 729-743.
2ADAM A, RIVLIN E, SHIMSHONI I, et al. Robust real-time unusual event detection using multiple fixed- location monitors [ J ]. IEEE Pattern Analysis and Machine Intelligence, 2008, 30(3) : 555-560.
3BENEZETH Y, JODOIN P M, SALIGRAMA V. Abnormality detection using low-level co-occurring events[ J]. Pattern Recognition Letters, 2011, 32 ( 3 ) : 423-431.
4IWASHITA Y, TAKAKI S, MOROOKA K, et al. Abnormal behavior detection using privacy protected videos [ C ]. IEEE Emerging Security Technologies (EST), 2013 : 55-57.
5BOUMA H, BAAN J, BURGHOUTS G J, et al. Automatic detection of suspicious behavior of pickpockets with track-based features in a shopping mall [ C ]. International Society for Optics and Photonics, 2014: 92530F-92530F-9.
6VISHWAKARMA D K, KAPOOR R, MAI-IESHWARI R, et al. Recognition of abnormal human activity using the changes in orientation of silhouette in key frames[C]. IEEE Computing for Sustainable Global Development, 2015: 336-341.
7BENENSON R, MATHIAS M, TUYTELAARS T, et al. Seeking the strongest rigid detector[C]. Computer Vision and Pattern Recognition, 2013: 3666-3673.
8DALAL N, TRIGGS B. Histograms of oriented gradients for human detection [ C ]. IEEE Computer Vision and Pattern Recognition, 2005 : 886-893.
9FREUND Y, SCHAPIRE R E. A decision-theoretic generalization of on-line learning and an application to boosting[ J]. Journal of computer and system sciences, 1997, 55(1) : 119-139.
10RAMANAN D. Learning to parse images of articulated bodies [ C ]. Advances in Neural Information Processing Systems, 2006 : 1129-1136.

共引文献82

1尹相国,张岱岩,林明星,石朝国.基于多模感知的上肢康复训练轨迹跟踪研究[J].仪器仪表学报,2023,44(2):154-163. 被引量：1
2张堃,刘志诚,刘纪元,华亮,费敏锐.面向人机协作系统的上肢姿态精准识别算法研究[J].仪器仪表学报,2023,44(1):275-282. 被引量：4
3冯文宇,朱洪堃,殷佳炜,费敏锐,张堃.无人CT智能姿态识别算法研究[J].仪器仪表学报,2020(8):188-195. 被引量：8
4张莹,刘笑宇,樊瑜波.基于社会网络分析的康复机器人跨学科合作关联研究[J].仪器仪表学报,2020,41(3):220-229. 被引量：1
5周意乔,徐昱琳.基于双向LSTM的复杂环境下实时人体姿势识别[J].仪器仪表学报,2020,41(3):192-201. 被引量：4
6姚晶晶.体育运动视频人体关节点运动轨迹自动识别方法[J].商丘师范学院学报,2022,38(12):16-20.
7原渊.Mahout策略下矿井监控视频异常行为推荐[J].煤炭技术,2017,36(10):218-220. 被引量：3
8陆雅婷,陆小锋,王聪,赵泽伟,贾杰,陈树耿.基于手功能评估系统的“腕背伸”动作定量评估[J].电子测量技术,2017,40(10):127-133. 被引量：14
9孟勃,刘雪君,王晓霖.基于四元数时空卷积神经网络的人体行为识别[J].仪器仪表学报,2017,38(11):2643-2650. 被引量：17
10白中浩,王鹏辉,李智强.基于Stixel-world及特征融合的双目立体视觉行人检测[J].仪器仪表学报,2017,38(11):2822-2829. 被引量：6

1徐海燕.基于通道相似度注意力的图像分类研究[J].信息技术与信息化,2021(11):78-80. 被引量：2
2李萌,钟珂,刘琼,亢燕铭.上海及其沿海地区臭氧分布特征分析[J].洁净与空调技术,2021(4):43-45.
3郑琪,奉莉军,梁勤欧.新安江流域居民区空间网络与社会经济网络特征研究[J].浙江师范大学学报（自然科学版）,2022,45(1):97-104.
4何瑶,何春耕.中外调查类电视新闻节目特征比较——以《新闻调查》《前线》《新闻之夜》为例[J].海河传媒,2021(3):27-30.

计算机与现代化

2021年第12期

浏览历史

内容加载中请稍等...

基于通道切分的人体姿态估计算法

参考文献5

二级参考文献28

共引文献82

相关作者

相关机构

相关主题

浏览历史