基于图像对齐和不确定估计的深度视觉里程计

Deep Visual Odometry Based on Image Alignment and Uncertainty Estimation

下载PDF

导出

摘要基于深度学习的视觉里程计方法(deepvisualodometry,DVO)通过神经网络直接估计单目图像的深度和相邻图像之间的相机运动,在保证精度的同时大大提高了运行速度。但这是基于灰度不变假设,作为一个很强的假设,灰度不变假设在现实场景中往往难以满足。为此,提出一种基于图像对齐(imagealignment,IA)的直接视觉里程计方法AUDVO(alignedU-CNNdeepVO),通过不确定性估计网络(uncertaintyCNN,U-CNN)引入正则项进行约束,使得估计的结果更具鲁棒性。为了处理大面积纹理缺失区域上因估计不准确带来的空洞,在设计深度估计模块时通过嵌入超分辨率网络进行上采样。在公开的KITTI数据集上的实验证明了AUDVO在深度和相机位姿估计上的有效性。 Deep learning based visual odometry methods can directly estimate the depth of monocular images and camera movement between adjacent images,which achieve high accuracy and improve running speed.However,this is based on a strong assumption of gray scale invariance,which is often not satisfied in real scenes.As a consequence,a self-supervised method for direct visual odometry based on image alignment is proposed,which gets a robust estimation result with the uncertainty regularization terms estimated from the uncertainty estimation network(uncertainty CNN,U-CNN)and it is called AUDVO(aligned U-CNN deep VO).Meanwhile,a super resolution network is incorporated into the depth estima-tion module instead of using a simple interpolation operation for upsampling in order to deal with the holes caused by the inaccurate estimation in the large non-texture area.The evaluation results on the public KITTI datasets demonstrate the effectiveness of AUDVO for robust single-view depth estimation and visual odometry.

作者秦超闫子飞 QIN Chao;YAN Zifei(Department of Media Technology and Art,School of Architecture,Harbin Institute of Technology,Key Laboratory of Interactive Media Design and Equipment Service Innovation,Ministry of Culture and Tourism,Harbin 150001,China)

机构地区哈尔滨工业大学建筑学院媒体技术与艺术系

出处《计算机工程与应用》 CSCD 北大核心 2022年第22期101-107,共7页 Computer Engineering and Applications

基金国家自然科学基金面上项目(61872118) 文旅部重点实验室资助项目。

关键词视觉里程计深度学习不确定性估计网络 visual odometry deep learning uncertainty estimation network

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1赵立坤,武二永,郭燚平,戴国骏.基于概率选取随机特征点的单目视觉SLAM方法[J].机器人,2010,32(5):642-646. 被引量：3

二级参考文献9

1Leonard J J, Durrant-Whyte H E Mobile robot localization by tracking geometric beacons[J]. IEEE Transactions on Robotics and Automation, 1991, 7(3): 376-382.
2Dissanayake G, Newman P, Clark S, et al. A solution to the simultaneous localization and map building (SLAM) problem[J]. IEEE Transactions on Robotics and Automation, 2001, 17(3): 229-241.
3Murphy K E Bayesian map learning in dynamic environments [C]//Advances in Neural Information Processing Systems 12. USA: MIT Press, 2000: 1015-1021.
4Civera J, Davison A J, Montiel J M M. Interacting multiple model monocular SLAM[C]//IEEE International Conference on Robotics and Automation. Piscataway, NJ, USA: IEEE, 2008: 3704-3709.
5Montiel A D J, Civera J, Davison A J. Unified inverse depth parametrization for monocular SLAM[C]//Proceedings of Robotics: Science and Systems. 2006.
6Guivant J, Nebot E. Improving computational and memory requirements of simultaneous localization and map building algorithms[C]//IEEE International Conference on Robotics and Automation. Piscataway, NJ, USA: IEEE, 2002: 2731-2736.
7Anousaki G C, Kyriakopoulos K J. Simultaneous localization and map building of skid-steered robots[J]. IEEE Robotics and Automation Magazine, 2007, 14(1): 79-89.
8Eade E, Drummond T. Monocular SLAM as a graph of coalesced observations[C]//IEEE International Conference on Computer Vision. Piscataway, NJ, USA: IEEE, 2007: 2028- 2035.
9Bouguet J-Y, Perona P. Visual navigation using a single camera[C]//IEEE International Conference on Computer Vision. Piscataway, NJ, USA: IEEE, 1995: 645-652.

共引文献2

1胡衡,梁岚珍.基于SURF特征的单目视觉SLAM方法研究[J].新疆大学学报（自然科学版）,2015,32(3):368-372. 被引量：1
2李帅鑫,李广云,周阳林,李明磊,王力.改进的单目视觉实时定位与测图方法[J].仪器仪表学报,2017,38(11):2849-2857. 被引量：16

1颜庆宇.规模化风电并网功率波动的预测及不确定估计研究[J].黑龙江电力,2021,43(4):312-316. 被引量：1
2韩明明,陈博,顾朝亮,伊锋,梁健.直流偏磁不确定因素建模分析[J].山东电力技术,2021,48(9):17-22.
3吕凤华,张志华,赵亚波,门茂林.无人机空地一体航测技术在天桥设施施工中的应用[J].工程勘察,2022,50(7):66-70. 被引量：2
4吴玉(编译).银河系中心黑洞的首张照片面世[J].自然杂志,2022,44(3):212-212.
5罗辉,王晓南,栗浩.一种基于深度视觉的井下充填智能监控方法[J].电脑编程技巧与维护,2022(10):131-133.
6陈洋,何元烈,高家辉.基于在线光度标定的单目直接视觉SLAM[J].信息与控制,2022,51(4):400-410.
7梁娟,肖珂,张艳.基于深度视觉和蚁群避障算法的果园精准喷施路径规划方法[J].河北农业大学学报,2022,45(5):122-130. 被引量：2
8广州美术学院视觉艺术设计学院2022届研究生、本科生毕业设计作品选登[J].包装与设计,2022(5):76-85.
9陈玥芙蓉,李毅.引入差分约束和对抗训练策略的虚拟试衣方法[J].计算机工程与应用,2022,58(21):286-293. 被引量：1
10冯向东,魏春英.基于覆盖区域自适应优化的无人机航拍拼接方法[J].无线电工程,2022,52(11):1977-1983. 被引量：1

计算机工程与应用

2022年第22期

浏览历史

内容加载中请稍等...

基于图像对齐和不确定估计的深度视觉里程计

参考文献1

二级参考文献9

共引文献2

相关作者

相关机构

相关主题

浏览历史