深度学习的三维模型识别研究综述

Survey of 3D Model Recognition Based on Deep Learning

下载PDF

导出

摘要随着三维扫描仪、LiDAR等三维视觉感知设备的快速发展,三维模型识别方向正逐渐引起越来越多的研究者的关注。该领域的核心任务是三维模型的分类与检索。深度学习技术在二维视觉任务方面已经取得显著的成就,将这一技术引入三维视觉领域不仅突破了传统方法的限制,还在自动驾驶、智能机器人等领域取得了引人瞩目的进展。然而,将深度学习技术应用于三维模型识别任务仍然面临着多项挑战。鉴于此,对深度学习在三维模型识别任务中的应用进行综述。首先,论述了常用的评价指标和公开数据集,介绍每个数据集的相关信息和来源。接着,从多个角度出发,包括点云、视图、体素以及多模态融合等,详细介绍现有具有代表性的方法,并梳理了近年来的相关研究工作。通过在数据集上对这些方法的性能进行对比,分析各个方法的优势和局限性。最后,基于各类方法的利弊,总结当前亟待解决的三维模型识别任务中的挑战,并展望了未来在该领域的发展趋势。 With the rapid advancement of three-dimensional visual perception devices such as 3D scanners and LiDAR,the field of 3D model recognition is gradually gaining the attention of a growing number of researchers.This domain encompasses two core tasks:3D model classification and retrieval.Since deep learning has already achieved significant success in two-dimensional visual tasks,its introduction into the realm of three-dimensional visual perception not only breaks free from the constraints of traditional methods but also makes notable strides in areas such as autonomous driving and intelligent robotics.However,the application of deep learning techniques to 3D model recognition tasks still faces several challenges.In light of this,there is a need for a comprehensive review of the application of deep learning in 3D model recognition.This review begins by discussing commonly used evaluation metrics and public datasets,providing relevant information and sources for each dataset.Subsequently,it delves into representative methods from various angles,including point clouds,views,voxels,and multimodal fusion.It also summarizes recent research development in the field.Through performance comparison on these datasets,the strengths and limitations of each method are analyzed.Finally,based on the merits and demerits of these approaches,the review outlines the challenges currently faced by 3D model recognition tasks and provides an outlook on future trends in this field.

作者周燕李文俊党兆龙曾凡智叶德旺 ZHOU Yan;LI Wenjun;DANG Zhaolong;ZENG Fanzhi;YE Dewang(School of Electronic Information Engineering,Foshan University,Foshan,Guangdong 528000,China;School of Computer Science and Engineering,South China University of Technology,Guangzhou 510641,China)

机构地区佛山科学技术学院电子信息工程学院华南理工大学计算机科学与工程学院

出处《计算机科学与探索》 CSCD 北大核心 2024年第4期916-929,共14页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金(61972091) 广东省自然科学基金(2022A1515010101,2021A1515012639) 广东省普通高校重点研究项目(2020ZDZX3049) 佛山市科技创新项目(2020001003285) 广东省教育科学规划课题(2021GXJK445)。

关键词三维视觉深度学习点云视图体素多模态 three-dimensional vision deep learning point clouds views voxels multimodal

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1周燕,曾凡智.基于二维压缩感知和分层特征的图像检索算法[J].电子学报,2016,44(2):453-460. 被引量：14
2汤磊,丁博,何勇军.基于卷积神经网络的高效三维模型检索方法[J].电子学报,2021,49(1):64-71. 被引量：10
3张满囤,燕明晓,马英石,王红,刘伟,黄向生.基于八叉树结构的三维体素模型检索[J].计算机学报,2021,44(2):334-346. 被引量：19
4李海生,孙莉,武玉娟,吴晓群,蔡强,杜军平.非刚性三维模型检索特征提取技术研究[J].软件学报,2018,29(2):483-505. 被引量：17
5李海生,武玉娟,郑艳萍,吴晓群,蔡强,杜军平.基于深度学习的三维数据分析理解方法研究综述[J].计算机学报,2020,43(1):41-63. 被引量：25
6Meng-Hao Guo,Jun-Xiong Cai,Zheng-Ning Liu,Tai-Jiang Mu,Ralph R.Martin,Shi-Min Hu.PCT:Point cloud transformer[J].Computational Visual Media,2021,7(2):187-199. 被引量：109

二级参考文献39

1杨育彬,林珲,朱庆.基于内容的三维模型检索综述[J].计算机学报,2004,27(10):1297-1310. 被引量：95
2杨育彬,陈世福,林珲.一种基于颜色连通的图像纹理检索新方法[J].电子学报,2005,33(1):57-62. 被引量：16
3Mohsen Zand,Shyamala Doraisamy,Alfian Abdul Halin.Texture classification and discrimination for region-based image retrieval[J].J.Vis.Comm.Image R,2015,26:305-316.
4Xiaoyu Wang,Ming Yang,Timothee Cour,et al.Contextual Weighting for Vocabulary Tree based Image Retrieval[C].ICCV2011,pp.209-216,in Barcelona,Spain,November,2011.
5Liang Zheng,Shengjin Wang,Ziqiong Liu,et al.Packing and padding:coupled multi-index for accurate image retrieval[C].CVPR2014,pp.1947-1954,in Columbus,Ohio,USA,June,2014.
6Yong Xu,Hui Ji.Viewpoint invariant texture description using fractal analysis[J].Int.J Comput Vis,2009,83:85-100.
7Yong Xu,Sibin Huang,Hui Ji.Scale-space texture description on SIFT-like textons[J].Computer vision and image understanding,2012,116:999-1013.
8Donoho D.Compressed sensing[J].IEEE Transactions on Information Theory,2006,52(4):1289-1306.
9Candes E,Wakin M.An introduction to compressive sampling[J].IEEE Signal Processing Magazine,2008,25(2):21-30.
10Mahdi Cheraghchi,Venkatesan Guruswami,Ameya Velingker.Restricted isometry of fourier matrices and list decodability of random linear codes[J].Proceedings of the ACM-SIAM Symposium on Discrete Algorithms(SODA),2013.

共引文献180

1ZHANG Ying,SUN Yue,WU Lin,ZHANG Lulu,MENG Bumin.3D Point Cloud Semantic Segmentation Based PAConv and SE_variant[J].Instrumentation,2023,10(4):27-38.
2吕坤朋,孙斌,赵玉晓.基于鸟鸣声及深度学习的鸟类识别方法研究[J].科技通报,2021,37(10):24-30. 被引量：5
3钟侠骄,张绍兵,郭静,王胜朝,成苗,何莲,赵铱民.基于RandLA-Net的3D点云牙颌分割与身份识别[J].计算机应用,2023,43(S01):269-275.
4王丽欢,任雨,刘建,李军阔,宫世杰.基于B-PointNet++的地下电缆工井点云语义分割模型[J].国外电子测量技术,2023,42(2):88-94. 被引量：4
5周燕,曾凡智,杨跃武.基于多特征融合的三维模型检索算法[J].计算机科学,2016,43(7):303-309. 被引量：4
6陆钊,朱晓姝.基于压缩感知的图像处理算法研究[J].计算机科学,2017,44(6):312-316. 被引量：2
7弓云峰,崔得龙.综合PHOG和LWT的图像检索算法[J].包装工程,2017,38(15):202-206.
8许德刚,廉飞宇.一种粮粒图像快速重构方法[J].河南工业大学学报（自然科学版）,2017,38(6):74-79.
9包晓安,詹秀娟,张俊为,王强,胡玲玲,桂江生.基于稀疏结构的图像特征匹配算法[J].计算机系统应用,2018,27(4):178-183. 被引量：2
10孙娜,刘继文,肖东亮.基于BFGS拟牛顿法的压缩感知SL0重构算法[J].电子与信息学报,2018,40(10):2408-2414. 被引量：10

1郑智鸿,宋海川.基于组对比学习的弱监督三维点云语义分割方法[J].华东师范大学学报（自然科学版）,2024(2):108-118.
2刘鹏,丁爱华,窦新宇.基于注意力机制和多级校正的单目室内场景深度估计[J].现代信息科技,2024,8(5):106-110.

计算机科学与探索

2024年第4期

浏览历史

内容加载中请稍等...

深度学习的三维模型识别研究综述

参考文献6

二级参考文献39

共引文献180

相关作者

相关机构

相关主题

浏览历史