基于注意力和长短时记忆网络的视觉里程计被引量：1

Visual Odometer Based on Attention and LSTM

下载PDF

导出

摘要近年来通过利用视觉信息估计相机的位姿,实现对无人车的定位成为研究热点,视觉里程计是其中的重要组成部分.传统的视觉里程计需要复杂的流程如特征提取、特征匹配、后端优化,难以求解出最优情况.因此,提出融合注意力和长短时记忆网络的视觉里程计,通过注意力机制增强的卷积网络从帧间变化中提取运动特征,然后使用长短时记忆网络进行时序建模,输入RGB图片序列,模型端到端地输出位姿.在公开的无人驾驶KITTI数据集上完成实验,并与其他算法进行对比.结果表明,该方法在位姿估计上的误差低于其他单目算法,定性分析显示该算法具有较好的泛化能力. In recent years,the use of visual information to estimate the pose of the camera to realize the positioning of unmanned vehicles has become a research hotspot.Visual odometry is an important part of it.Traditional visual odometry requires complex processes such as feature extraction,feature matching,and post-processing.It is difficult to solve the optimal situation.Therefore,a visual odometer that combines attention and long short-term memory(LSTM)was proposed in this paper.The convolutional network was enhanced by the attention mechanism,which extracted motion features from the changes between frames.Then,the long and short-term memory network was used for timing modeling.The input was a sequence of RGB pictures,and a pose of end-to-end was output by the model.The experiment was completed on the public unmanned driving KITTI data set and compared with other algorithms.Results show that the error of the method in pose estimation is lower than that of other monocular algorithms,and through qualitative analysis,it has good generalization ability.

作者阮晓钢余鹏程朱晓庆 RUAN Xiaogang;YU Pengcheng;ZHU Xiaoqing(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China;Beijing Key Laboratory of Computational Intelligence and Intelligent System,Beijing 100124,China)

机构地区北京工业大学信息学部计算智能与智能系统北京市重点实验室

出处《北京工业大学学报》 CAS CSCD 北大核心 2021年第8期815-823,924,共10页 Journal of Beijing University of Technology

基金国家自然科学基金资助项目(61773027) 北京市自然科学基金资助项目(4202005)。

关键词深度学习注意力机制时序建模视觉里程计位姿估计镜像网络 deep learning attention mechanism sequence modeling visual odometry pose estimation symmetric network

分类号 TP242.6 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献1

1张亮,蒋荣欣,陈耀武.移动机器人在未知环境下的同步定位与地图重建方法[J].控制与决策,2010,25(4):515-520. 被引量：6

二级参考文献19

1Newman E On the structure and solution of the simultaneous localization and map building problem [D]. Sydney: University of Sydney, 2000.
2Durrant-Whyte H, Bailey T. Simultaneous localization and mapping: Part Ⅰ[J]. IEEE Robotics and Automation Magazine, 2006, 13(2): 99-110.
3Bailey T, Durrant-Whyte H. Simultaneous localization and mapping: Part Ⅱ[J]. IEEE Robotics and Automation Magazine, 2006, 13(3): 108-117.
4Montemerlo M, Thrun S, Koller D, et al. FastSLAM: A factored solution to the simultaneous localization and mapping problem[C]. Proc of the AAAI National Conf on Artificial Intelligence. Edmonton, 2002: 593-598.
5Montemerlo M, Thrun S, Koller D, et al. FastSLAM2.0: An improved particle filtering algorithm for simultaneous localization and mapping that provably converges[C]. Proc of the AAAI National Conf on Artificial Intelligence. Acapulco, 2003:1151-1156.
6Bailey T, Nieto J, Nebot E. Consistency of the FastSLAM algorithm[C]. Int Conf on Robotics and Automation. Orlando, 2006: 424-429.
7Wan E A, R van der Merwe. The unscented Kalman filter for nonlinear estimation[C]. Adaptive Systems for Signal Processing, Communications and Control Symposium. Lake Louise, 2000: 153-158.
8Wang X, Zhang H. A UPF-UKF framework for SLAM[C]. Int Conf on Robotics and Automation. Roma, 2007: 1664- 1669.
9Martinez-Cantin R, Castellanos J A. Unscented SLAM for large-scale outdoor environments[C]. Intelligent Robots and Systems. Edmonton, 2005: 3427-3432.
10Kim C, Sakthivel R, Kyun Chung W. Unscented FastSLAM: A robust and efficient solution to the SLAM problem[J]. IEEE Trans on Robotics, 2008, 24(4): 808- 820.

共引文献5

1王宏健,王晶,刘振业.基于迭代扩展Kalman滤波建议分布和线性优化重采样的快速同步定位与构图[J].电子与信息学报,2014,36(2):318-324. 被引量：9
2王丽娜.侧扫声呐数据采集与地貌图像构建[J].北京测绘,2018,32(8):965-969. 被引量：5
3唐娅娟,王晶,所玉君,李艳玲.无人机改进导航定位方法设计[J].科技与创新,2017(4):23-25. 被引量：1
4阮晓钢,李昂,黄静.基于自监督循环卷积神经网络的位姿估计方法[J].北京工业大学学报,2021,47(12):1311-1320. 被引量：2
5张云洲,胡航,秦操,楚好,吴运幸.基于栈式卷积自编码的视觉SLAM闭环检测[J].控制与决策,2019,34(5):981-988. 被引量：12

同被引文献1

1张海东,徐一鸣,王栗,卞春磊,周方杰.基于改进双流网络结构的视觉里程计[J].激光与光电子学进展,2021,58(20):128-137. 被引量：1

引证文献1

1李鹏,黄鹏,凌智琛,邓甘霖.无监督深度学习单目视觉里程计研究[J].导航定位与授时,2023,10(2):74-81.

1杨林,闫璐,张格明.城市轨道交通地震预警紧急处置系统设计研究[J].铁道运输与经济,2021,43(7):118-124. 被引量：1
2范海廷,杜云刚.基于激光SLAM的移动机器人导航算法研究[J].机床与液压,2021,49(14):41-46. 被引量：21
3丛百,王群山,莫斌峰,牛金海.基于CT影像的左心耳封堵三维辅助软件系统[J].中国医疗器械杂志,2021,45(4):355-360.
4徐艺文,刘航,黄景泉,赵铁松.基于深度增强学习的VVC码率控制算法[J].中国科技论文,2021,16(7):748-753.
5沈小马,吕斌涛,孙威,刘阳,叶建设,卞韩城.一种对深空探测器信标信号的无源定位方法[J].深空探测学报（中英文）,2021,8(3):276-283. 被引量：2

北京工业大学学报

2021年第8期

浏览历史

内容加载中请稍等...

基于注意力和长短时记忆网络的视觉里程计被引量：1

参考文献1

二级参考文献19

共引文献5

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于注意力和长短时记忆网络的视觉里程计 被引量：1

参考文献1

二级参考文献19

共引文献5

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于注意力和长短时记忆网络的视觉里程计被引量：1