基于特征融合和注意力机制的物体6D姿态估计算法

Object 6D Pose Estimation Algorithm Based on Feature Fusion andAttention Mechanism

下载PDF

导出

摘要针对物体6D姿态估计易受目标物体的弱纹理和小体积特性、复杂背景、遮挡的影响,提出一种结合特征融合和注意力机制的物体6D姿态估计算法。首先,在RGB图像特征提取网络的首个卷积块中加入卷积注意力模块,提升弱纹理小物体的区域显著度;其次,在基于编解码结构的RGB图像特征提取网络中引入基于卷积注意力模块的跳跃连接,有效地将编码阶段的颜色、纹理等细节外观特征融合到解码阶段的姿态语义特征中,弥补姿态语义特征缺乏细节外观特征的问题;然后,使用通道注意力模块改进池化金字塔模块,增强目标物体可见区域与遮挡区域的联系,提升遮挡鲁棒性;最后,使用卷积注意力模块重构解码阶段输出的姿态语义特征,增强相似表面特征的区分度,从而降低外观相似物体对物体6D姿态估计的干扰。实验结果表明,该算法在Occlusion LINEMOD数据集和LINEMOD数据集上ADD(-S)指标分别达到73.4%和99.8%,与FFB6D相比,分别提升7.8百分点和0.1百分点,验证了该算法的可行性。 Object 6D pose estimation is easily affected by the weak texture and small volume characteristics of the target object,complex background,and occlusion.To solve the above problems,an object 6D pose estimation algorithm combining feature fusion and attention mechanism is proposed.First of all,the Convolutional Block Attention Module is added to the first convolution module of the RGB image feature extraction network to improve the regional saliency of small objects with weak texture.Secondly,the skip connection based on Convolutional Block Attention Module is introduced into the RGB image feature extraction network based on the encoder-decoder structure,which effectively fuses the detailed appearance features containing color,texture and others in the coding stage into the pose semantic features in the decoding stage to make up for the lack of detailed appearance features in the pose semantic features.Then,the Channel Attention Module is used to improve the Pyramid Pooling Module to enhance the connection between the visible area of the target object and the occluded area,and improve the occlusion robustness.Finally,the Convolutional Block Attention Module is used to reconstruct the features in the decoding stage rich in pose semantic information,so as to enhance the discrimination of similar surface features,thus reducing the interference of similar appearance objects on object 6D pose estimation.The experimental results show that the ADD(-S)index of the algorithm on Occlusion LINEMOD dataset and LINEMOD dataset reaches 73.4%and 99.8%respectively,which are 7.8 percentage points and 0.1 percentage points higher than that of FFB6D respectively,verifying the feasibility of the algorithm.

作者高维东林琳刘贤梅赵娅 GAO Wei-dong;LIN Lin;LIU Xian-mei;ZHAO Ya(School of Computer and Information Technology,Northeast Petroleum University,Daqing 163318,China)

机构地区东北石油大学计算机与信息技术学院

出处《计算机技术与发展》 2023年第12期92-100,共9页 Computer Technology and Development

基金黑龙江省教育科学“十四五”规划重点课题(GJB1421114) 黑龙江省自然科学基金项目(LH2020F003) 黑龙江省高等教育教学改革重点委托项目(SJGZ20200037)。

关键词物体6D姿态估计深度学习特征融合注意力机制跳跃连接 object 6D pose estimation deep learning feature fusion attention mechanism skip connection

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1马天,蒙鑫,牟琦,李占利,何志强.基于特征融合的6D目标位姿估计算法[J].计算机工程与设计,2023,44(2):563-569. 被引量：4
2王太勇,孙浩文.基于关键点特征融合的六自由度位姿估计方法[J].天津大学学报（自然科学与工程技术版）,2022,55(5):543-551. 被引量：7
3马康哲,皮家甜,熊周兵,吕佳.融合注意力特征的遮挡物体6D姿态估计[J].计算机应用,2022,42(12):3715-3722. 被引量：2
4黄榕彬.基于位置依赖的密集融合的6D位姿估计方法[J].现代信息科技,2020,4(22):16-19. 被引量：1

二级参考文献4

1骆健,蒋旻,刘星,周龙.基于多模态深度学习的RGB-D物体识别[J].计算机工程与设计,2017,38(6):1624-1629. 被引量：6
2苏杰,张云洲,房立金,李奇,王帅.基于多重几何约束的未知物体抓取位姿估计[J].机器人,2020,42(2):129-138. 被引量：17
3梁达勇,陈俊洪,朱展模,黄可思,刘文印.多特征像素级融合的遮挡物体6DoF姿态估计研究[J].计算机科学与探索,2020,14(12):2072-2082. 被引量：2
4李坤,侯庆.基于注意力机制的轻量型人体姿态估计[J].计算机应用,2022,42(8):2407-2414. 被引量：7

共引文献9

1李雨龙,陈松,李鑫,李昌龙,赵耀耀,李顺.基于深度学习的管件识别与位姿估计研究[J].制造技术与机床,2022(12):70-75. 被引量：3
2薛珊,卢涛,吕琼莹,曹国华.基于多尺度融合和轻量化网络的无人机目标检测算法[J].湖南大学学报（自然科学版）,2023,50(8):82-93. 被引量：2
3王太勇,于恩霖.基于三维关键点投票的物体位姿估计方法[J].天津大学学报（自然科学与工程技术版）,2024,57(3):291-300.
4于建均,刘耕源,于乃功,龚道雄,冯新悦.轻量化改进XYZNet的RGB-D特征提取网络[J].计算机应用研究,2024,41(2):616-622.
5余娜,何国荣,晁阳,李培东.工业机器人装配中基于相机位姿估计算法的单目视觉定位研究[J].微型电脑应用,2024,40(4):85-88.
6邴雅星,王阳萍,雍玖,白浩谋.基于筛选学习网络的六自由度目标位姿估计算法[J].计算机应用,2024,44(6):1920-1926.
7蒋珺阳,吴晶华,赵娜娜.基于大核注意力改进的工业管件位姿估计[J].武汉工程大学学报,2024,46(3):304-309.
8葛泉波,李凯,张兴国.基于多关键点检测加权融合的无人机相对位姿估计算法[J].自动化学报,2024,50(7):1402-1416.
9张亚炜,付东翔.基于双向融合纹理和深度信息的目标位姿检测[J].数据采集与处理,2024,39(5):1214-1227.

1牛艺婷,郭超,卢俊,郭海涛,林雨准.联合密集连接与注意力的遥感影像变化检测[J].测绘与空间地理信息,2023,46(12):33-37. 被引量：1
2眼镜店自制视力表:哗众取宠,不可取[J].中国眼镜科技杂志,2023(12):86-86.
3Wen WU,Fei JI,Shujuan HU,Yongli HE.Asymmetric Drying and Wetting Trends in Eastern and Western China[J].Advances in Atmospheric Sciences,2024,41(2):221-232. 被引量：1
4文渊博,高涛,陈婷,张千禧.频率引导的双稀疏自注意力单图像去雨算法[J].电子学报,2023,51(10):2812-2820. 被引量：1

计算机技术与发展

2023年第12期

浏览历史

内容加载中请稍等...

基于特征融合和注意力机制的物体6D姿态估计算法

参考文献4

二级参考文献4

共引文献9

相关作者

相关机构

相关主题

浏览历史