基于多尺度特征融合的单目深度估计算法

Monocular depth estimation algorithm based on multi-scale feature fusion

下载PDF

导出

摘要在当前的单目深度算法中,堆叠的卷积层和过度的下采样操作会造成特征图分辨率和高层信息的损失,影响了深度图整体的精度。针对这一问题,本文提出了一个基于多尺度特征融合的单目深度估计算法。采用了递进式的编-解码结构,由浅到深逐级提取不同尺度的信息,不同层级不同分辨率的特征连接在一起,形成了多尺度特征融合结构;编码器采用U^(2)-Net的设计架构,内部通过Vision Transformer模块,使得模型能够在编码过程中拥有全局的感受野,并且避免了下采样操作,从而减少了特征图分辨率和高层信息的损失;解码器中设计了U型残差块,能更好地融合不同阶段内的多尺度特征。在KITTI和NYU-Depth V2数据集上进行了实验,实验结果表明本文所提算法在各项指标上优于大部分同类型算法。 In the current Monocular depth estimation algorithms,stacked convolutional layers and excessive downsampling operations lead to the loss of feature map resolution and high-level information,affecting the overall accuracy of the depth map.To address this issue,this paper proposes a monocular depth estimation model based on multi-scale feature fusion.The model adopts a progressive encoder-decoder structure to extract information of different scales from shallow to deep levels.Moreover,the features of different resolutions at different levels are connected to form a multi-scale feature fusion structure.The encoder is inspired by the design of Transformer,which has a global receptive field during encoding,while avoiding downsampling operations to reduce the loss of feature map resolution and high-level information.The decoder incorporates U-shaped residual blocks to better fuse multi-scale features within different stages.Our method was tested on the KITTI and NYU Depth V2 datasets,and the experimental results showed that it exhibited competitive performance on both datasets.

作者周晓吉 ZHOU Xiaoji(School of Information Science and Engineering,Zhejiang Sci-Tech University,Hangzhou 310018,China)

机构地区浙江理工大学信息科学与工程学院

出处《智能计算机与应用》 2024年第9期34-40,共7页 Intelligent Computer and Applications

关键词单目深度估计编-解码器结构 Vision Transformer U^(2)-Net Monocular depth prediction encoder-decoder structure Vision Transformer U^(2)-Net

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1陈海秀,房威志,陆成,陆康,何珊珊,黄仔洁.基于自编码器的红外与可见光图像融合算法[J].兵器装备工程学报,2024,45(9):283-290.
2赵凡,张学典.集成自注意力机制的医学图像分割方法[J].数据采集与处理,2024,39(5):1240-1250.
3苏智华,沈瑞冰,李敏,高涵.智能硬件课程思政教学改革探索[J].中文科技期刊数据库（全文版）教育科学,2024(10):0054-0056.
4苏东旭,赵治国,赵坤,李刚,于勤.基于可拓相平面稳定域划分的Tube-MPC车辆稳定性控制[J].汽车工程,2024,46(9):1654-1667.
5李君宁,陈静.基于工程项目驱动的《港口规划与布置》课程教学改革与实践[J].中国水运,2024(9):152-155.
6林明泉.海南:大力推进装配式建筑发展建筑产业现代化加速[J].中国建设信息化,2024(18):10-13.
7黄攀伟,胡源,杨亚欣,张博翎.人教社中小学教材语料库(PEPTC)的研制[J].语料库语言学,2024,11(1):120-134.
8袁红春,张波,程心.结合Transformer与生成对抗网络的水下图像增强算法[J].红外技术,2024,46(9):975-983.
9何建妍.体育馆智能化信息系统的架构设计及其在提升观众体验中的应用研究[J].中文科技期刊数据库（全文版）自然科学,2024(10):0101-0105.

智能计算机与应用

2024年第9期

浏览历史

内容加载中请稍等...

基于多尺度特征融合的单目深度估计算法

相关作者

相关机构

相关主题

浏览历史