基于点云数据的三维目标检测技术研究进展被引量：5

Three-Dimensional Object Detection Technology Based on Point Cloud Data

导出

摘要近年来,随着深度传感器和三维激光扫描设备的普及,点云数据引起了广泛关注。相对于二维图像,点云数据不仅包含场景的深度信息,还不受光照等环境因素的影响,能够更精确地实现目标识别和三维定位。因此,基于点云的三维目标检测技术已经成为智能空间感知和场景理解的关键技术。本文首先介绍了点云数据的特点,并探讨了不同类型的点云特征提取方法;其次,详细阐述了基于体素、点、图以及体素与点混合的点云目标检测方法的原理和发展历程;然后,介绍了常见的室内外点云目标检测数据集和评价指标,并对各类点云目标检测方法在KITTI和Waymo数据集上的性能进行了详细的比较和分析;最后,对点云目标检测技术的研究进展进行了总结和展望。 Significance In recent years,self-driving technology has garnered considerable attention from both academia and industry.Autonomous perception,which encompasses the perception of the vehicle's state and the surrounding environment,is a critical component of self-driving technology,guiding decision-making and planning modules.In order to perceive the environment accurately,it is necessary to detect objects in three-dimensional(3D)scenes.However,traditional 3D object detection techniques are typically based on image data,which lack depth information.This makes it challenging to use image-based object detection in 3D scene tasks.Therefore,3D object detection predominantly relies on point cloud data obtained from devices such as lidar and 3D scanners.Point cloud data consist of a collection of points,with each containing coordinate information and additional attributes such as color,normal vector,and intensity.Point cloud data are rich in depth information.However,in contrast to twodimensional images,point cloud data are sparse and unordered,and they exhibit a complex and irregular structure,posing challenges for feature extraction processes.Traditional methods rely on local point cloud information such as curvature,normal vector,and density,combined with methods such as the Gaussian model to manually design descriptors for processing point cloud data.However,these methods rely heavily on a priori knowledge and fail to account for the relationships between neighboring points,resulting in low robustness and susceptibility to noise.In recent years,deep learning methods have gained significant attention from researchers due to their robust feature representation and generalization capabilities.The effectiveness of deep learning methods relies heavily on high-quality datasets.To advance the field of point cloud object detection,numerous companies such as Waymo and Baidu,as well as research institutes have produced large-scale point cloud datasets.With the help of such datasets,point cloud object detection combined with deep learning has rapidly developed and demonstrated powerful performance.Despite the progress made in this field,challenges related to accuracy and real-time performance still exist.Therefore,this paper provides a review of the research conducted in point cloud object detection and looks forward to future developments to promote the advancement of this field.Progress The development of point cloud object detection has been significantly promoted by the recent emergence of large-scale open-source datasets.Several standard datasets for outdoor scenes,including KITTI,Waymo,and nuScenes,as well as indoor scenes,including NYU-Depth,SUN RGB-D,and ScanNet,have been released,which have greatly facilitated research in this field.The relevant properties of these datasets are summarized in Table 1.Point cloud data are characterized by sparsity,non-uniformity,and disorder,which distinguish them from image data.To address these unique properties of point clouds,researchers have developed a range of object detection algorithms specifically designed for this type of data.Based on the methods of feature extraction,point cloud-based single-modal methods can be categorized into four groups:voxel-based,point-based,graph-based,and point+voxel-based methods.Voxel-based methods divide the point cloud into regular voxel grids and aggregate point cloud features within each voxel to generate regular four-dimensional feature maps.VoxelNet,SECOND,and PointPillars are classic architectures of this kind of method.Point-based methods process the point cloud directly and utilize symmetric functions to aggregate point cloud features while retaining the geometric information of the point cloud to the greatest extent.PointNet,PointNet++,and Point R-CNN are their classic architectures.Graph-based methods convert the point cloud into a graph representation and process it through the graph neural network.Point GNN and Graph R-CNN are classic architectures of this approach.Point+voxel-based methods combine the methods based on point and those based on voxel,with STD and PV R-CNN as classic architectures.In addition,to enhance the semantic information of point cloud data,researchers have used image data to supplement secondary information to design multi-modal methods.MV3D,AVOD,and MMF are classic architectures of multi-modal methods.A chronological summary of classical methods for object detection from point clouds is presented in Fig.4.Conclusions and Prospects The field of 3D object detection from point clouds is a significant research area in computer vision that is gaining increasing attention from scholars.The foundational branch of 3D object detection from point clouds has flourished,and future research may focus on several areas.These include multi-branch and multi-mode fusion,the integration of two-dimensional detection methods,weakly supervised and self-supervised learning,and the creation and utilization of complex datasets.

作者李佳男王泽许廷发 Li Jianan;Wang Ze;Xu Tingfa(School of Optoelectronics,Beijing Institute of Technology,Beijing 100081,China;Key Laboratory of Photoelectronic Imaging Technology and System,Ministry of Education,Beijing Institute of Technology,Beijing 100081,China;Chongqing Innovation Center,Beijing Institute of Technology,Chongqing 401135,China)

机构地区北京理工大学光电学院北京理工大学光电成像技术与系统教育部重点实验室北京理工大学重庆创新中心

出处《光学学报》 EI CAS CSCD 北大核心 2023年第15期286-302,共17页 Acta Optica Sinica

基金国家自然科学基金青年科学基金(62101032)。

关键词点云三维目标检测单模态多模态 point cloud 3D object detection single modality multi-modality

分类号 TP121 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1王亚东,田永林,李国强,王坤峰,李大字.基于卷积神经网络的三维目标检测研究综述[J].模式识别与人工智能,2021,34(12):1103-1119. 被引量：18

二级参考文献10

1刘斌平,周越.一种新颖的无锚框三维目标检测器[J].中国体视学与图像分析,2020,25(1):65-71. 被引量：1
2张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,43(8):1289-1305. 被引量：239
3吴帅,徐勇,赵东宁.基于深度卷积网络的目标检测综述[J].模式识别与人工智能,2018,31(4):335-346. 被引量：85
4季一木,陈治宇,田鹏浩,吴飞,刘尚东,孙静,焦志鹏,王娜,毕强.无人驾驶中3D目标检测方法研究综述[J].南京邮电大学学报（自然科学版）,2019,39(4):72-79. 被引量：12
5王浩,单文静,方宝富.基于多层上下文卷积神经网络的目标检测算法[J].模式识别与人工智能,2020,33(2):113-120. 被引量：8
6张绳昱,董士风,焦林,王琦进,王红强.基于有效感受野的区域推荐网络[J].模式识别与人工智能,2020,33(5):393-400. 被引量：3
7储珺,朱晓阳,冷璐,缪君.引入通道注意力和残差学习的目标检测器[J].模式识别与人工智能,2020,33(10):889-897. 被引量：8
8王刚,王沛.基于深度学习的三维目标检测方法研究[J].计算机应用与软件,2020,37(12):164-168. 被引量：9
9黄漫,黄勃,高永彬.引入深度补全与实例分割的三维目标检测[J].传感器与微系统,2021,40(1):129-132. 被引量：5
10田永林,沈宇,李强,王飞跃.平行点云:虚实互动的点云生成与三维模型进化方法[J].自动化学报,2020,46(12):2572-2582. 被引量：12

共引文献17

1陈慧娴,吴一全,张耀.基于深度学习的三维点云分析方法研究进展[J].仪器仪表学报,2023,44(11):130-158. 被引量：3
2钱多,殷俊.基于俯视角融合的多模态三维目标检测[J].南京大学学报（自然科学版）,2023,59(6):996-1002.
3胡远洋.基于深度神经网络的电阻层析成像重建方法[J].电子测量技术,2023,46(5):78-82.
4温舒桦.基于深度卷积神经网络的屏幕主动防御技术研究[J].保密科学技术,2022(5):29-36. 被引量：1
5郭毅锋,吴帝浩,魏青民.基于深度学习的点云三维目标检测方法综述[J].计算机应用研究,2023,40(1):20-27. 被引量：5
6王海麟,朱加良,何正熙,周新志.基于卷积径向基网络的多变量水位预测模型[J].水力发电学报,2023,42(3):70-81. 被引量：3
7程鑫,王宏飞,周经美,张伟,赵祥模.基于体素柱形的激光雷达点云车辆目标检测算法[J].中国公路学报,2023,36(3):247-260. 被引量：3
8秦建国,胡晓阳.基于PCPNet改进的深度学习点云去噪方法研究[J].工业控制计算机,2023,36(4):107-108.
9刘磊,熊风光,尹宇慧,郭锐,薛红新,韩燮.多特征提取与匹配矩阵驱动的点云配准[J].计算机工程与设计,2023,44(5):1419-1426. 被引量：1
10陈辉,王帅杰,蔡晗.基于点云补全的三维目标检测[J].电子技术应用,2023,49(8):1-6.

同被引文献41

1陈湘生,徐志豪,包小华,崔宏志.隧道病害监测检测技术研究现状概述[J].隧道与地下工程灾害防治,2020(3):1-12. 被引量：29
2王瑞琦,赵艳,吴世凯,陆洪涛.基于线阵相机的多特征激光焊接焊缝轨迹识别方法[J].激光与光电子学进展,2023,60(1):239-246. 被引量：6
3李兆新,吕劲松,胡远江,刘正一,邹梦.线阵相机图像自适应畸变校正方法及在列车成像上的应用[J].电子测量技术,2020(15):158-165. 被引量：4
4占栋,于龙,邱存勇,肖建,陈唐龙.钢轨轮廓测量中的车体振动补偿问题研究[J].仪器仪表学报,2013,34(7):1625-1633. 被引量：28
5何晖光,田捷,赵明昌,杨骅.基于分割的三维医学图像表面重建算法[J].软件学报,2002,13(2):219-226. 被引量：59
6杨杰,卢钰仁,田颖,吕晓玲.基于改进ICP算法的点云拼接方法[J].传感器与微系统,2018,37(9):41-43. 被引量：12
7刘博,于洋,姜朔.激光雷达探测及三维成像研究进展[J].光电工程,2019,46(7):15-27. 被引量：58
8赵斌,王春平,付强,陈一超.基于深度注意力机制的多尺度红外行人检测[J].光学学报,2020,40(5):41-52. 被引量：21
9王本杰,农丽萍,张文辉,林基明,王俊义.基于Spider卷积的三维点云分类与分割网络[J].计算机应用,2020,40(6):1607-1612. 被引量：11
10段仲静,李少波,胡建军,杨静,王铮.深度学习目标检测方法及其主流框架综述[J].激光与光电子学进展,2020,57(12):51-66. 被引量：61

引证文献5

1陶志勇,李衡,豆淼森,林森.融合多分辨率特征的点云分类与分割网络[J].光电工程,2023,50(10):50-61.
2马璐瑶,邾继贵,杨凌辉,刘皓月,樊一源,杨朔.基于二维图像基准的动态线扫描点云校正方法[J].光学学报,2024,44(4):175-187.
3何鸿添,陈晗,刘洋,周礼亮,张敏,雷印杰.面向多模态交互式融合与渐进式优化的三维视觉理解[J].计算机应用研究,2024,41(5):1554-1561.
4贾剑利,韩慧妍,况立群,韩方正,郑心怡,张秀权.基于关联和识别的少样本目标检测[J].激光与光电子学进展,2024,61(8):461-472.
5田枫,宗内丽,刘芳,卢圆圆,刘超,姜文文,赵玲,韩玉祥.多模态融合的三维目标检测方法研究[J].计算机工程与应用,2024,60(13):113-123.

1王从宝,张安思,杨磊,梁国强,张保.基于深度视觉的四旋翼无人机自主飞行感知和避障综述[J].无线电工程,2023,53(10):2233-2243.
2宗荣珍,宋伟轩,孟慧芳,欧拴柱,和政翔,孙佳童.直线振动筛选机设计[J].机械工程师,2023(10):7-9.
3黄成荣.基于深度学习的零件质量缺陷检测在制造业中的应用研究[J].时代汽车,2023(20):165-167. 被引量：2
4袁红春,臧天祺.基于注意力机制及Ghost-YOLOv5的水下垃圾目标检测[J].环境工程,2023,41(7):214-221. 被引量：2
5谭小地,林枭,臧金亮,范凤兰,刘金鹏,任宇红,郝建颖.多维调制全息数据存储研究进展[J].光学学报,2023,43(15):45-69.
6类晶晶,公茂庆,刘丽娟.多组学分析在昆虫滞育中的应用[J].环境昆虫学报,2023,45(4):899-909.
7李兵祖.高精度标准压力源设计方案研究[J].中国科技期刊数据库工业A,2023(9):151-154.
8王立永.液压与气压传动系统控制方法探析[J].时代汽车,2023(20):19-21.
9孟醒,陈书芳,崔炯.知觉体验在展陈空间设计中的应用研究[J].工程与建设,2023,37(4):1178-1180.
10宿奥宇,李发志,王晓迪.基于认知地图学习方法的地理项目化学习实施研究——以卡塔尔世界杯为例[J].黑龙江教师发展学院学报,2023,42(10):99-102.

光学学报

2023年第15期

浏览历史

内容加载中请稍等...

基于点云数据的三维目标检测技术研究进展被引量：5

参考文献1

二级参考文献10

共引文献17

同被引文献41

引证文献5

相关作者

相关机构

相关主题

浏览历史

基于点云数据的三维目标检测技术研究进展 被引量：5

参考文献1

二级参考文献10

共引文献17

同被引文献41

引证文献5

相关作者

相关机构

相关主题

浏览历史

基于点云数据的三维目标检测技术研究进展被引量：5