期刊文献+

深度学习的三维模型识别研究综述

Survey of 3D Model Recognition Based on Deep Learning
下载PDF
导出
摘要 随着三维扫描仪、LiDAR等三维视觉感知设备的快速发展,三维模型识别方向正逐渐引起越来越多的研究者的关注。该领域的核心任务是三维模型的分类与检索。深度学习技术在二维视觉任务方面已经取得显著的成就,将这一技术引入三维视觉领域不仅突破了传统方法的限制,还在自动驾驶、智能机器人等领域取得了引人瞩目的进展。然而,将深度学习技术应用于三维模型识别任务仍然面临着多项挑战。鉴于此,对深度学习在三维模型识别任务中的应用进行综述。首先,论述了常用的评价指标和公开数据集,介绍每个数据集的相关信息和来源。接着,从多个角度出发,包括点云、视图、体素以及多模态融合等,详细介绍现有具有代表性的方法,并梳理了近年来的相关研究工作。通过在数据集上对这些方法的性能进行对比,分析各个方法的优势和局限性。最后,基于各类方法的利弊,总结当前亟待解决的三维模型识别任务中的挑战,并展望了未来在该领域的发展趋势。 With the rapid advancement of three-dimensional visual perception devices such as 3D scanners and LiDAR,the field of 3D model recognition is gradually gaining the attention of a growing number of researchers.This domain encompasses two core tasks:3D model classification and retrieval.Since deep learning has already achieved significant success in two-dimensional visual tasks,its introduction into the realm of three-dimensional visual perception not only breaks free from the constraints of traditional methods but also makes notable strides in areas such as autonomous driving and intelligent robotics.However,the application of deep learning techniques to 3D model recognition tasks still faces several challenges.In light of this,there is a need for a comprehensive review of the application of deep learning in 3D model recognition.This review begins by discussing commonly used evaluation metrics and public datasets,providing relevant information and sources for each dataset.Subsequently,it delves into representative methods from various angles,including point clouds,views,voxels,and multimodal fusion.It also summarizes recent research development in the field.Through performance comparison on these datasets,the strengths and limitations of each method are analyzed.Finally,based on the merits and demerits of these approaches,the review outlines the challenges currently faced by 3D model recognition tasks and provides an outlook on future trends in this field.
作者 周燕 李文俊 党兆龙 曾凡智 叶德旺 ZHOU Yan;LI Wenjun;DANG Zhaolong;ZENG Fanzhi;YE Dewang(School of Electronic Information Engineering,Foshan University,Foshan,Guangdong 528000,China;School of Computer Science and Engineering,South China University of Technology,Guangzhou 510641,China)
出处 《计算机科学与探索》 CSCD 北大核心 2024年第4期916-929,共14页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金(61972091) 广东省自然科学基金(2022A1515010101,2021A1515012639) 广东省普通高校重点研究项目(2020ZDZX3049) 佛山市科技创新项目(2020001003285) 广东省教育科学规划课题(2021GXJK445)。
关键词 三维视觉 深度学习 点云 视图 体素 多模态 three-dimensional vision deep learning point clouds views voxels multimodal
  • 相关文献

参考文献6

二级参考文献39

  • 1杨育彬,林珲,朱庆.基于内容的三维模型检索综述[J].计算机学报,2004,27(10):1297-1310. 被引量:95
  • 2杨育彬,陈世福,林珲.一种基于颜色连通的图像纹理检索新方法[J].电子学报,2005,33(1):57-62. 被引量:16
  • 3Mohsen Zand,Shyamala Doraisamy,Alfian Abdul Halin.Texture classification and discrimination for region-based image retrieval[J].J.Vis.Comm.Image R,2015,26:305-316.
  • 4Xiaoyu Wang,Ming Yang,Timothee Cour,et al.Contextual Weighting for Vocabulary Tree based Image Retrieval[C].ICCV2011,pp.209-216,in Barcelona,Spain,November,2011.
  • 5Liang Zheng,Shengjin Wang,Ziqiong Liu,et al.Packing and padding:coupled multi-index for accurate image retrieval[C].CVPR2014,pp.1947-1954,in Columbus,Ohio,USA,June,2014.
  • 6Yong Xu,Hui Ji.Viewpoint invariant texture description using fractal analysis[J].Int.J Comput Vis,2009,83:85-100.
  • 7Yong Xu,Sibin Huang,Hui Ji.Scale-space texture description on SIFT-like textons[J].Computer vision and image understanding,2012,116:999-1013.
  • 8Donoho D.Compressed sensing[J].IEEE Transactions on Information Theory,2006,52(4):1289-1306.
  • 9Candes E,Wakin M.An introduction to compressive sampling[J].IEEE Signal Processing Magazine,2008,25(2):21-30.
  • 10Mahdi Cheraghchi,Venkatesan Guruswami,Ameya Velingker.Restricted isometry of fourier matrices and list decodability of random linear codes[J].Proceedings of the ACM-SIAM Symposium on Discrete Algorithms(SODA),2013.

共引文献180

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部