期刊文献+

基于场景对象注意与深度图融合的深度估计 被引量:1

Depth Estimation Based on Scene Object Attention and Depth Map Fusion
下载PDF
导出
摘要 现有单目深度估计算法主要从单幅图像中获取立体信息,存在相邻深度边缘细节模糊、明显的对象缺失问题。提出一种基于场景对象注意机制与加权深度图融合的单目深度估计算法。通过特征矩阵相乘的方式计算特征图任意两个位置之间的相似特征向量,以快速捕获长距离依赖关系,增强用于估计相似深度区域的上下文信息,从而解决自然场景中对象深度信息不完整的问题。基于多尺度特征图融合的优点,设计加权深度图融合模块,为具有不同深度信息的多视觉粒度的深度图赋予不同的权值并进行融合,融合后的深度图包含深度信息和丰富的场景对象信息,有效地解决细节模糊问题。在KITTI数据集上的实验结果表明,该算法对目标图像预估时σ<1.25的准确率为0.879,绝对相对误差、平方相对误差和对数均方根误差分别为0.110、0.765和0.185,预测得到的深度图具有更加完整的场景对象轮廓和精确的深度信息。 The existing monocular depth estimation algorithm mainly obtains stereo information from a single image.This approach leads to blurred details of adjacent depth edges and apparent missing objects.A monocular depth estimation algorithm based on scene object attention mechanism and weighted depth map fusion is proposed.The similarity feature vector between any two positions of feature map is calculated by multiplying the feature matrix to rapidly capture the long-distance dependency relationship.The dependency between any two positions in the image can enhance the context information used to estimate the similar depth area,thus,solving the incomplete object depth information in the natural scene.Based on the advantages of multi-scale feature map fusion,weighted depth map fusion module is designed.The multi-vision granularity depth map with different depth information data is assigned different weights for fusion.The fused depth map contains depth information and rich-scene object information for effectively solving the problem of fuzzy details.The experimental results on the KITTI dataset show that an accuracy rate of the proposed algorithm for target image prediction is 0.879 atσ<1.25,and the absolute relative error,square relative error,and logarithmic root mean square error are 0.110,0.765,and 0.185,respectively.The predicted depth map has a complete scene object contour and accurate depth information.
作者 温静 杨洁 WEN Jing;YANG Jie(School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China)
出处 《计算机工程》 CAS CSCD 北大核心 2023年第2期222-230,共9页 Computer Engineering
基金 山西省基础性研究计划(201901D211176)。
关键词 场景对象注意 加权深度图融合 上下文信息 深度估计 三维重建 scene object attention weighted depth map fusion context information depth estimation three-dimensional reconstruction
  • 相关文献

参考文献4

二级参考文献29

共引文献40

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部