基于局部注意力和位姿迭代优化的自监督单目深度估计算法被引量：3

A Self-supervised Monocular Depth Estimation Algorithm Based on Local Attention and Iterative Pose Refinement

下载PDF

导出

摘要自监督单目深度估计在自动驾驶、智能制造等领域有着广泛的应用。然而由于自监督训练存在大量训练噪声,其估计精度受到了极大限制。针对自监督单目深度估计算法中深度估计精度有限的问题,本文提出了一种基于局部注意力机制和迭代调优的自监督单目深度估计框架。首先,对于深度估计网络,基于局部像素间深度值的高度相关性,本文设计了一种局部注意力机制来融合高分辨率特征图的局部特征,提升深度估计的准确性;其次,对于位姿估计网络,本文设计了一种迭代调优的位姿估计结构,利用残差优化的方式降低位姿估计难度,提升位姿估计的准确性进而提升深度估计网络的性能。实验表明,本文提出的改进自监督单目深度估计算法有效提升了深度估计的精度。 Self-supervised monocular depth estimation is widely used in many areas,such as autonomous driving and intelligent manufacturing. However,due to the large amount of training noise in self-supervised training,the accuracy of self-supervised monocular depth estimation is limited. To improve the performance of self-supervised monocular depth estimation algorithm,we proposed a modified self-supervised monocular depth estimation algorithm based on local attention mechanism and iterative pose refinement. First,for the depth estimation network,we proposed a local attention mechanism,which is based on the high correlation between the depth of pixels in a local patch,to fuse features of highresolution feature map. Second,for the pose estimation network,we proposed an iterative refinement based architecture,which decreases the pose estimation difficulty with residual optimization and improves the pose estimation accuracy to benefit the depth estimation network. Experiments shown that,the proposed modified self-supervised monocular depth estimation algorithm significantly improves the depth estimation accuracy.

作者赵霖赵滟靳捷 ZHAO Lin;ZHAO Yan;JIN Jie(China Aerospace Academy of Systems Science and Engineering,Beijing 100048,China)

机构地区中国航天系统科学与工程研究院

出处《信号处理》 CSCD 北大核心 2022年第5期1088-1097,共10页 Journal of Signal Processing

基金装备发展部快速转化项目(8091C21)。

关键词单目深度估计自监督学习深度学习 monocular depth estimation self-supervised learning deep learning

分类号 TP751 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献1

1方智文,曹治国,肖阳.深度图像的目标潜在区域提取算法[J].信号处理,2016,32(2):193-202. 被引量：10

二级参考文献20

1Dalai N, Triggs B. Histograms of oriented gradients fi~r hu- man detection [ C 1//Computer Vision and Pattern Rec.<~li- tion, 2005 IEEE Conference on. IEEE 2005 : 886-893.
2Felzenszwalb P, McAllester D, Ramanan D. A discrimina- tively trained, muhiscale, deformable part model ~ C ] // C~mlputer Vision and Pattern Recognition, ;2008. CVPR2008. IEEE Conference on. IEEE, 2008: 1-8.
3(;'all J, LempitskyV. Class-specilic hough forests for object detection[ C ]//Computer Vision and Pattern Recognition, 2009 IEEE Conference on. IEEE, 2009: 1022-1029.
4S('hulter S, l,eistner C, ~'ohlhart P, el al. Accurate Ob- ject Detection with Joint Classification-Regression Ran- dom Forests[ C] S/Computer Vision and Pattern Ree~gni- tion. IEEE, 2014: 923-930.
5Xia L, Chen C C, Aggarwal J K. Human detection using depth inf()rmation by kinect [ C ]//Computer Visinn and Pattern Recognition Workshops (CVPRW), 2011 IEEE Conference on. IEEE, 2011 : 15-22.
6Tang S, Wang X, Lv X, et al. Histogram of oriented norn'.al vectors for object recognition with a depth sensor[ C]//Asi- an Conference on Computer Vision, 2013: 525-538.
7Oreifej O, Liu Z. Hon4d: Histogram of oriented 4(t nor- mals for activity recognition fi'om depth sequences [ C ]// Computer Vision and Pattern Recognition, 2013 IEEE Conference on. IEEE, 2013: 716-723.
8Gupta S, Girshick R, Arl)el~iez P, et al. l,earning rich fea- tures from RGB-D images for object detection and segmen- tation [ C ] //European Conference oll Computer Vision 2014. Springer International Publishing, 2014: 345-360.
9Alexe B, Deselaers T, [ C] //Computer Vision IEEE Conference on. IE Hongwen Kang, Hebert, Driven Objectness[ J ]. P telligence, 2015, 37(1.
10Uijlings J R R. van de Ferrari V. What is an o[).iect? and Pattern Recognition, 2010 EE, 2010: 73-80.

共引文献9

1赵欣,周海英.一种结合深度信息的人体行为识别方法[J].科学技术与工程,2017,17(1):244-249. 被引量：4
2何晓军,徐爱功,李玉.基于VGA聚类的遥感影像道路提取[J].计算机仿真,2018,35(5):288-293. 被引量：2
3何晓军,徐爱功,李玉.利用HSI空间相似性的彩色形态学图像处理方法[J].计算机科学,2019,46(4):285-292. 被引量：10
4何晓军,徐爱功,李玉.基于CM的高分辨率遥感影像目标边缘提取[J].计算机仿真,2019,36(3):333-338. 被引量：6
5何晓军,徐爱功,李玉.基于矢量相似性的多元滤波方法研究[J].计算机应用研究,2019,36(10):3132-3136.
6何晓军,李玉,徐爱功.基于主成分分析的多光谱形态学遥感影像解译方法[J].辽宁工程技术大学学报（自然科学版）,2018,37(6):913-919. 被引量：1
7鲁光男.基于交互式视景的虚拟现实单目深度信息提取[J].计算机仿真,2020,37(12):382-385. 被引量：1
8刘瑶,赵慧,伍世虔,陈彬.Bin-Picking中无纹理工件的分割[J].机械设计与制造,2022(9):278-281.
9何晓军,徐爱功,李玉.基于模糊相似性的彩色形态学图像处理方法[J].计算机应用研究,2019,36(1):258-263. 被引量：12

同被引文献18

1吴凡路,刘建军,任鑫,李春来.基于圆形标志点的深空探测全景相机标定方法[J].光学学报,2013,33(11):139-145. 被引量：27
2吴泽俊,吴庆阳,张佰春.一种新的基于球面模型的鱼眼镜头标定方法[J].中国激光,2015,42(5):226-233. 被引量：11
3潘德伦,冀隽,张跃进.基于运动矢量空间编码的视频监控动态目标检测方法[J].吉林大学学报（工学版）,2021,51(4):1370-1374. 被引量：10
4张志远,杨帆.结合多注意力机制的自监督目标跟踪[J].计算机工程与设计,2021,42(12):3502-3509. 被引量：2
5董桂官,吴双彤,张汉琦.基于卷积神经网络的全景图像超分辨率算法[J].电脑与信息技术,2022,30(2):1-4. 被引量：1
6张平,关丽红.基于概率统计的多维关联数据动态挖掘仿真[J].计算机仿真,2022,39(3):402-406. 被引量：1
7杨静,张灿龙,李志欣,唐艳平.集成空间注意力和姿态估计的遮挡行人再辨识[J].计算机研究与发展,2022,59(7):1522-1532. 被引量：3
8吴岸聪,林城梽,郑伟诗.面向跨模态行人重识别的单模态自监督信息挖掘[J].中国图象图形学报,2022,27(10):2843-2859. 被引量：7
9张方方,曹家晖,王海静,赵鹏博.基于多特征自适应融合的抗遮挡目标跟踪算法[J].红外技术,2023,45(2):150-160. 被引量：1
10张涛,张晓利,任彦.Transformer与CNN融合的单目图像深度估计[J].哈尔滨理工大学学报,2022,27(6):88-94. 被引量：3

引证文献3

1余伟群,刘佳涛,张亚萍.融合注意力的拉普拉斯金字塔单目深度估计[J].图学学报,2023,44(4):728-738.
2周艳秋,高宏伟,何婷,辛春花.电子监控部分遮挡目标单模态自监督信息挖掘技术[J].现代电子技术,2024,47(10):47-51.
3陈思喜,张延吉,李建微.基于自监督深度学习的全景图像深度估计研究[J].电视技术,2024,48(3):34-38.

1陈正升,王雪松,程玉虎.考虑扰动与输入饱和的机械臂连续非奇异快速终端滑模控制[J].控制与决策,2022,37(4):903-912. 被引量：11
2张书颖,陈适之,韩万水,吴刚.基于集成学习的FRP加固混凝土梁抗弯承载力预测研究[J].工程力学,2022,39(8):245-256. 被引量：14
3刘艳辉,黄俊宝,肖锐铧,方然可.基于随机森林的福建省区域滑坡灾害预警模型研究[J].工程地质学报,2022,30(3):944-955. 被引量：8

信号处理

2022年第5期

浏览历史

内容加载中请稍等...

基于局部注意力和位姿迭代优化的自监督单目深度估计算法被引量：3

参考文献1

二级参考文献20

共引文献9

同被引文献18

引证文献3

相关作者

相关机构

相关主题

浏览历史

基于局部注意力和位姿迭代优化的自监督单目深度估计算法 被引量：3

参考文献1

二级参考文献20

共引文献9

同被引文献18

引证文献3

相关作者

相关机构

相关主题

浏览历史

基于局部注意力和位姿迭代优化的自监督单目深度估计算法被引量：3