摘要
为解决人体姿态估计任务的准确率和实时性问题,提出一个卷积宽接收域、检测实时的人体姿态估计网络。构建稠密残差步进网络(dense residual steps network,DRSN),提高模型对输入图像空间信息的提取和全局特征的把握。在激活函数上,以改进的FReLU激活函数替换原始的激活函数,通过采用二维卷积的方式改变ReLU函数中的激活条件,扩大模型的接收域,关键点分类更加准确。该网络在标准MPII数据集上进行测试,在满足较高定位精度的条件下,模型在NVIDIA RTX 2080Ti GPU上的检测速度达到38 FPS,可有效解决检测实时性问题。
To solve the real-time and accuracy problem of human pose estimation task,a human body pose estimation network with wide convolutional receiving domain and real-time detection model was proposed.A dense residual step network(DRSN)was constructed to improve the model’s ability to extract the spatial information of the input image and grasp the global features.For the activation function,the original activation function was replaced using the improved FReLU activation function,and the activation conditions in the ReLU function were changed using a two-dimensional convolution method,which expanded the receiving domain of the model,and the key point classification was more accurate.This network was tested on the standard MPII data set.Under the condition of high positioning accuracy,the detection speed of the model on NVIDIA RTX 2080Ti GPU reaches 38 FPS,which can effectively solve the real-time detection problem.
作者
苟先太
陶明江
李欣
康立烨
金炜东
GOU Xian-tai;TAO Ming-jiang;LI Xin;KANG Li-ye;JIN Wei-dong(School of Electrical Engineering,Southwest Jiaotong University,Chengdu 611756,China;Institute of Atomic and Molecular Physics,Sichuan University,Chengdu 610065,China;China-ASEAN International Joint Laboratory of Integrated Transport,Nanning University,Nanning 530200,China)
出处
《计算机工程与设计》
北大核心
2023年第1期247-254,共8页
Computer Engineering and Design
基金
广西科技基地和人才专项基金项目(桂科AD20297125)。
关键词
姿态估计
FReLU激活函数
宽接收域
稠密残差步进网络
二维卷积激活
pose estimation
FReLU activation function
wide receiving domain
dense residual step network
two-dimensional convolution activation