期刊文献+

融合权重自适应损失和注意力的人体姿态估计 被引量:2

Human Pose Estimation FusingWeight Adaptive Loss and Attention
下载PDF
导出
摘要 在自底向上人体姿态估计方法中存在前景和背景样本不平衡的问题,同时高分辨率网络在特征提取和特征融合时不能有效获得通道信息和空间位置信息。针对以上问题,提出以高分辨率网络(HigherHRNet)为基础融合权重自适应和注意力的自底向上人体姿态估计网络WA-HRNet(weight-adaptive fusing attention HRNet)。提出权重自适应损失函数,自适应调整不同区域的损失权重,使得HigherHRNet训练时更加关注人体关键点中心区域;同时为了获取丰富的全局信息进一步定位关键点区域,提出高效全局注意力,加强关键点中心区域的表征;引入热力图分布调制,提高热力图解码关键点位置的准确性。在CrowdPose数据集以及COCO2017数据集上的实验表明,与基线HigherHRNet相比,WA-HRNet在CrowdPose测试集上AP值提升了5.8个百分点,在COCO2017测试集上AP值提升了1.8个百分点达到了72.3%,优于其他自底向上人体姿态估计主流算法。 There is an imbalance between foreground and background samples in the bottom-up human pose estimation method.Meanwhile,the high-resolution network cannot effectively obtain channel information and spatial location infor-mation during feature extraction and feature fusion.To address these problems,this paper presents WA-HRNet(weight-adaptive fusing attention HRNet):a bottom-up human pose estimation network based on the high-resolution network(HigherHRNet).Firstly,a weight-adaptive loss function is proposed to adaptively adjust the loss weight of different regions,so that HigherHRNet pays more attention to the central region of human key points during training.At the same time,in order to obtain rich global information and further locate the keypoint area,efficient global attention is proposed to strengthen the representation of the central area of the keypoint.Finally,heatmap distribution modulation is introduced to improve the accuracy of decoding keypoint locations in the heatmap.Experiments conducted on the CrowdPose dataset as well as the COCO2017 dataset show that WA-HRNet improves its AP value by 5.8 percentage points on the CrowdPose test set and 1.8 percentage points on the COCO2017 test-dev set to 72.3%compared to the baseline HigherHRNet,outper-forming other mainstream algorithms for bottom-up human pose estimation.
作者 江春灵 曾碧 姚壮泽 邓斌 JIANG Chunling;ZENG Bi;YAO Zhuangze;DENG Bin(School of Computer Science,Guangdong University of Technology,Guangzhou 510006,China)
出处 《计算机工程与应用》 CSCD 北大核心 2023年第18期145-153,共9页 Computer Engineering and Applications
基金 国家自然科学基金(62172111) 广东省自然科学基金(2021A1515012233)。
关键词 人体姿态估计 自底向上 注意力 高分辨率网络 human pose estimation bottom-up attention high-resolution network
  • 相关文献

参考文献4

二级参考文献9

共引文献68

同被引文献11

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部