期刊文献+

基于通道切分的人体姿态估计算法

Human Pose Estimation Algorithm Based on Channel Splitting
下载PDF
导出
摘要 为了提高人体姿态估计的准确率和识别速度,提出一种基于通道切分的人体姿态估计算法Channel-Split Residual Steps Network(Channel-Split RSN)。首先,提出通道切分模块,对切分后的特征通道通过卷积提取特征再融合起来,以获得丰富的特征表示。接着,引入特征增强模块,对特征通道进一步分组,并对不同的分组采取不同的处理策略,以减少特征通道内的相似特征。最后,结合改进的空间注意力机制,提出一种基于特征空间相关性的姿态修正机Context-PRM,得到更加准确的人体姿态估计。在COCO test-dev数据集上的实验结果表明,本文方法达到75.9%的AP和55.36的FPS,并且模型的大小Params(M)仅为18.3。相较于传统的RSN18和传统的RSN50,模型的AP分别提高了5和3.4个百分点,FPS比传统的RSN50快12.08。在更具挑战性的CrowdPose数据集上,本文方法达到66.9%的AP和19.16的FPS,相较于RSN18,AP提高了4.6个百分点。有效提高了人体姿态估计的准确率,且模型具有较快的识别速度。本文源代码公开在https://github.com/qdd1234/Channel-Split-RSN。 To improve the accuracy and speed of human pose estimation,a channel-split-based human pose estimation algorithm,named Channel-Split Residual Steps Network(Channel-Split RSN),is proposed.First of all,channel-split blocks are proposed to apply convolution operation for split feature in order to obtain rich feature representation.Then,feature enhancement blocks are introduced to further split feature channel and employ different strategies for different groups which can reduce similar features in feature channels.Finally,to further enhance the pose refine machine in Channel-Split RSN,combined with improved spatial attention mechanism,a pose refine machine based on feature spatial correlation,named Context-PRM,is proposed.Experimental results show that on the COCO test-dev dataset,our algorithm reaches 75.9%AP and 55.36 FPS,and the Params(M)of the model is only 18.3.Compared with the traditional RSN18 and RSN50,the AP of the model is improved by 5 and 3.4 percentage points,respectively.FPS is 12.08 faster than the traditional RSN50.On the more challenging CrowdPose dataset,our approach achieves 66.9%AP and 19.16 FPS,an AP improvement of 4.6 percentage points compared to RSN18,which effectively improves the accuracy of human pose estimation and the model has a faster recognition speed.Our source code is available at https://github.com/qdd1234/Channel-Split-RSN.
作者 周昆阳 赵梦婷 张海潮 邵叶秦 ZHOU Kun-yang;ZHAO Meng-ting;ZHANG Hai-chao;SHAO Ye-qin(School of Zhang Jian,Nantong University,Nantong 226019,China;School of Transportation and Civil Engineering,Nantong University,Nantong 226019,China)
出处 《计算机与现代化》 2021年第12期27-36,42,共11页 Computer and Modernization
基金 国家自然科学基金面上项目(61671255) 江苏省大学生创新训练计划项目(201910304158H,202010304180H,202010304122Y)。
关键词 Channel-Split RSN 人体姿态估计 通道切分模块 特征增强模块 Context-PRM Channel-Split RSN human pose estimation channel-split block feature enhancement block Context-PRM
  • 相关文献

参考文献5

二级参考文献28

  • 1MASOUD O, PAPANIKOLOPOULOS N. A method for human action recognition [ J ]. Image and Vision Computing, 2003, 21 (8) : 729-743.
  • 2ADAM A, RIVLIN E, SHIMSHONI I, et al. Robust real-time unusual event detection using multiple fixed- location monitors [ J ]. IEEE Pattern Analysis and Machine Intelligence, 2008, 30(3) : 555-560.
  • 3BENEZETH Y, JODOIN P M, SALIGRAMA V. Abnormality detection using low-level co-occurring events[ J]. Pattern Recognition Letters, 2011, 32 ( 3 ) : 423-431.
  • 4IWASHITA Y, TAKAKI S, MOROOKA K, et al. Abnormal behavior detection using privacy protected videos [ C ]. IEEE Emerging Security Technologies (EST), 2013 : 55-57.
  • 5BOUMA H, BAAN J, BURGHOUTS G J, et al. Automatic detection of suspicious behavior of pickpockets with track-based features in a shopping mall [ C ]. International Society for Optics and Photonics, 2014: 92530F-92530F-9.
  • 6VISHWAKARMA D K, KAPOOR R, MAI-IESHWARI R, et al. Recognition of abnormal human activity using the changes in orientation of silhouette in key frames[C]. IEEE Computing for Sustainable Global Development, 2015: 336-341.
  • 7BENENSON R, MATHIAS M, TUYTELAARS T, et al. Seeking the strongest rigid detector[C]. Computer Vision and Pattern Recognition, 2013: 3666-3673.
  • 8DALAL N, TRIGGS B. Histograms of oriented gradients for human detection [ C ]. IEEE Computer Vision and Pattern Recognition, 2005 : 886-893.
  • 9FREUND Y, SCHAPIRE R E. A decision-theoretic generalization of on-line learning and an application to boosting[ J]. Journal of computer and system sciences, 1997, 55(1) : 119-139.
  • 10RAMANAN D. Learning to parse images of articulated bodies [ C ]. Advances in Neural Information Processing Systems, 2006 : 1129-1136.

共引文献82

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部