期刊文献+

基于深度学习的双目立体匹配方法综述 被引量:11

Survey of Binocular Stereo-matching Methods Based on Deep Learning
下载PDF
导出
摘要 双目立体匹配是计算机视觉领域的经典问题,在自动驾驶、遥感、机器人感知等诸多任务中得到广泛应用。双目立体匹配的主要目标是寻找双目图像对中同名点的对应关系,并利用三角测量原理恢复图像深度信息。近年来,基于深度学习的立体匹配方法在匹配精度和匹配效率上均取得了远超传统方法的性能表现。将现有基于深度学习的立体匹配方法分为非端到端方法和端到端方法。基于深度学习的非端到端方法利用深度神经网络取代传统立体匹配方法中的某一步骤,根据被取代步骤的不同,该类方法被分为基于代价计算网络、基于代价聚合网络和基于视差优化网络的3类方法。基于深度学习的端到端方法根据代价体维度的不同可分为基于3D代价体和基于4D代价体的方法。从匹配精度、时间复杂度、应用场景等多个角度对非端到端和端到端方法中的代表性成果进行分析,并归纳各类方法的优点以及存在的局限性。在此基础上,总结基于深度学习的立体匹配方法当前面临的主要挑战并展望该领域未来的研究方向。 Binocular stereo matching is a classical problem in the field of computer vision and has been widely used in many tasks such as automated driving,remote sensing,and robot perception.The main goal of binocular stereo matching is to identify the corresponding relationship of same-named points in a binocular image pair and to recover image depth information based on the triangulation principle.In recent years,stereo-matching methods based on deep learning have achieved much better performance than traditional methods in terms of matching accuracy and efficiency.Existing stereo-matching methods based on deep learning are divided into non-end-to-end and end-to-end methods.The non-end-to-end methods based on deep learning use deep neural networks to replace steps in traditional stereo-matching methods.Based on these different steps,these methods can be divided into three types of networks:cost-based computing,cost-based aggregation,and disparity-based optimization.The end-to-end methods based on deep learning can be divided into 3D and 4D cost-volume-based methods according to different cost-volume dimensions.The representative methods of non-and end-to-end methods are analyzed in terms of matching accuracy,time complexity,and application scenarios,and the advantages and limitations of various methods are summarized.Accordingly,the main challenges of stereo-matching methods based on deep learning are summarized and future research directions in the field are prospected.
作者 尹晨阳 职恒辉 李慧斌 YIN Chenyang;ZHI Henghui;LI Huibin(School of Mathematics and Statistics,Xi’an Jiaotong University,Xi’an 710049,China)
出处 《计算机工程》 CAS CSCD 北大核心 2022年第10期1-12,共12页 Computer Engineering
基金 国家自然科学基金面上项目(61976173) 教育部-中国移动人工智能建设项目(MCM20190701)。
关键词 计算机视觉 深度学习 双目图像 立体匹配方法 图像深度 computer vision deep learning binocular images stereo-matching method image depth
  • 相关文献

参考文献4

二级参考文献22

  • 1吴翊 李永乐 等.应用数理统计[M].长沙:国防科技大学出版社,1997.135-144.
  • 2Kanade T.,Okutomi M..A stereo matching algorithm with an adaptive window:Theory and experiment.IEEE Transactions on Pattern Analysis and Machine Intelligence,1994,16(9):920~932
  • 3Veksler O..Fast variable window for stereo correspondence using integral images.In:Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition,Madison,WI,USA,2003,556~561
  • 4Daniel S.,Szeliski R..Stereo matching with nonlinear diffusion.International Journal of Computer Vision,1998,28(2):155~174
  • 5Veksler O..Stereo matching by compact windows via minimum ratio cycle.In:Proceedings of the International Conference on Computer Vision,Vancouver,Canada,2001,540 ~547
  • 6Scharstein D.,Szeliski R..A taxonomy and evaluation of dense two-frame stereo correspondence algorithms.International Journal of Computer Vision,2002,47(1):7~42
  • 7Boykov Y.,Veksler O.,Zabih R..Fast approximate energy minimization via graph cuts.IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(11):1222~ 1239
  • 8Sun J.,Zheng N.-N.,Shum H.-Y..Stereo matching using belief propagation.IEEE Transactions on Pattern Analysis and Machine Intelligence,2003,25(7):787~800
  • 9狄红卫,柴颖,李逵.一种快速双目视觉立体匹配算法[J].光学学报,2009,29(8):2180-2184. 被引量:39
  • 10徐彦君,杜利民,侯自强,金贵昌.基于相位的尺度自适应立体匹配方法[J].电子学报,1999,27(7):38-41. 被引量:15

共引文献72

同被引文献64

引证文献11

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部