基于激光点云的深度语义和位置信息融合的三维目标检测被引量：15

3D Object Detection Based on Deep Semantics and Position Information Fusion of Laser Point Cloud

导出

摘要提出一种高性能的基于深度语义和位置信息融合的双阶段三维目标检测(DSPF-RCNN)算法。在第一阶段提出深度特征提取-区域选取网络(DFE-RPN),使网络在俯视图中能够提取目标更深层次的纹理特征和语义特征。在第二阶段提出逐点语义和位置特征融合(ASPF)模块,使网络能够自适应地提取目标最有差异性的特征,增强中心点在特征提取时的聚合能力。算法在KITTI数据集上进行测试,结果显示,测试集中Car类目标在Easy、Moderate和Hard水平的检测精度均优于现有的主流算法,检测精度分别为89.90%,81.04%和76.45%;验证集中Car和Cyclist类目标在Moderate水平的检测精度分别为84.40%和73.90%,相对于主流算法提升了4%左右,推理时间为64 ms。最后将算法部署在实车平台上实现了在线检测,验证了其工程价值。 Object Precise perception of the surrounding environment is the basis for realizing various functions in autonomous driving.The accurate identification of the location of 3D targets in real scenes is key to improving the overall performance of autonomous driving.Lidar has become pivotal in this field because of its superiority in sensing richer 3D spatial information while being less affected by weather and other environmental factors.Current 3D target detection methods are mainly based on deep learning,which can achieve a higher detection accuracy than traditional clustering and segmentation algorithms.The key to target detection based on deep learning is the in-depth extraction and utilization of point-cloud feature information.If feature information cannot be fully utilized,the target is misdetected or missed(Fig.1),which has a significant impact on the safety of the automatic driving function.Therefore,deep extraction and utilization of point cloud information are key to improving the accuracy of 3D target detection.Methods This study proposes a two-stage 3D target detection network(DSPF-RCNN,Fig.1).In the first stage,the unordered original point cloud is divided into the regular voxel space,and the point-wise feature is converted into voxel-wise feature by using convolution neural network.The down-sampling output of the last layer is transformed into a 2D bird s eye view(BEV),whereby the BEV is input into the deep feature extraction-region proposal network(DFE-RPN,Fig.2)for depth extraction of 2D features.Through the fusion of deep and shallow texture features with deep semantic features,the ability of the network to capture 2D image features is enhanced.In the second stage,some point clouds are selected as center points in the latter two 3D down-sampling voxel spaces through the farthest point sampling,and the center points are input into the aware-point semantics and position feature fusion(ASPF)module(Fig.3),allowing the integration of the 3D semantic features and location information of the surrounding point clouds.In this manner,the network can adaptively extract more diverse features of the target because these center points have a stronger feature aggregation ability when aggregating neighboring point clouds,which improves the network s ability to aggregate different feature information of the target.These center points are then used to aggregate the features of the surrounding point clouds in the 3D voxel space(Fig.4).Subsequently,the region-of-interest pooling is conducted for the aggregated features and target candidate boxes generated in the first stage.Finally,the more refined classification and boundary box regression are conducted for the target through the fully connected layer.Discussions The DSPF-RCNN is tested and evaluated using the official KITTI test and validation sets.The detection results for Car are better than those of the existing mainstream algorithms in the test set(Table 1),and the detection accuracies at the three difficulty levels are 89.90%,81.04%,and 76.45%.In the KITTI validation set(Table 2),at the 11 recall positions,the detection accuracy is improved by 4%compared with those of the SVGA-Net and Part-A2 networks at moderate levels for Car and Cyclist.The DSPF-RCNN can accurately detect the three types of targets(Fig.5).The effectiveness of the proposed innovation module is further compared and analyzed(Table 5).The results show that,after integrating the 3D semantic features and position features of the surrounding point cloud,the central point can better aggregate the feature information of the surrounding point cloud in the feature aggregation stage.However,when the DFE-RPN module is added,the network s ability to capture features increase further,and the ability to extract small-target feature information,such as cyclists and pedestrians,is significantly improved.Finally,a comparative analysis is performed on the network time utilization,including the time consumed by each module in reasoning through a frame of point cloud data(Table 6).The comparison between DSPF-RCNN and the other two-stage algorithms(Table 7)shows that the total inference time of DSPF-RCNN is 64 ms,which is more advantageous in terms of the inference speed of the two-stage algorithm.Finally,the algorithm is deployed on a real vehicle platform to realize online detection(Fig.7).Conclusions In this study,a two-stage target detection algorithm,the DSPF-RCNN,based on a laser point cloud is proposed.First,the proposed DFE-RPN module extracts abundant target feature information from 2D images.In the second stage,the proposed ASPF module allows the central points to aggregate the salient features of different targets.Through testing on the KITTI test set and validation set,and comparison with mainstream methods,it is concluded that DSPF-RCNN performance is more advantageous in accurately detecting targets with different sizes,including small targets.At moderate levels in the KITTI validation set,the detection accuracies for Car and Cyclist are improved by approximately 4%,and the total network inference time is 64 ms.Finally,the DSPF-RCNN is applied to a local dataset to verify its engineering value.

作者胡杰安永鹏徐文才熊宗权刘汉 Hu Jie;An Yongpeng;Xu Wencai;Xiong Zongquan;Liu Han(Hubei Key Laboratory of Advanced Technology for Automotive Components,Wuhan University of Technology,Wuhan 430070,Hubei,China;Hubei Collaborative Innovation Center for Automotive Components Technology,Wuhan University of Technology,Wuhan 430070,Hubei,China;Hubei Research Center for New Energy&Intelligent Connected Vehicle,Wuhan University of Technology,Wuhan 430070,Hubei,China)

机构地区武汉理工大学现代汽车零部件技术湖北省重点实验室武汉理工大学汽车零部件技术湖北省协同创新中心武汉理工大学湖北省新能源与智能网联车工程技术研究中心

出处《中国激光》 EI CAS CSCD 北大核心 2023年第10期192-202,共11页 Chinese Journal of Lasers

基金湖北省科技重大专项(2020AAA001,2022AAA001)。

关键词遥感自动驾驶激光雷达三维目标检测特征融合 remote sensing automatic drive LIDAR 3D target detection feature fusion

分类号 TN958.98 [电子电信—信号与信息处理] TN249 [电子电信—物理电子学]

引文网络
相关文献

参考文献4

1邵靖滔,杜常清,邹斌.基于点云簇组合特征的激光雷达地面分割方法[J].激光与光电子学进展,2021,58(4):414-422. 被引量：15
2张长勇,陈治华,韩梁.基于改进DBSCAN的激光雷达障碍物检测[J].激光与光电子学进展,2021,58(24):443-450. 被引量：8
3梅圣明,黄妙华,柳子晗,魏海元.基于三维激光雷达的复杂场景中地面分割方法[J].激光与光电子学进展,2022,59(10):412-419. 被引量：11
4李立刚,郭玉杰,李林,郝宪锋,金久才,刘德庆,戴永寿.基于变尺寸栅格地图的船载激光雷达目标检测[J].激光与光电子学进展,2022,59(8):495-501. 被引量：4

二级参考文献34

1于亚飞,周爱武.一种改进的DBSCAN密度算法[J].计算机技术与发展,2011,21(2):30-33. 被引量：35
2朱株,刘济林.基于马尔科夫随机场的三维激光雷达路面实时分割[J].浙江大学学报（工学版）,2015,49(3):464-469. 被引量：11
3王新竹,李骏,李红建,尚秉旭.基于三维激光雷达和深度图像的自动驾驶汽车障碍物检测方法[J].吉林大学学报（工学版）,2016,46(2):360-365. 被引量：28
4李小毛,张鑫,王文涛,瞿栋,祝川.基于3D激光雷达的无人水面艇海上目标检测[J].上海大学学报（自然科学版）,2017,23(1):27-36. 被引量：27
5孔栋,孙亮,王建强,王晓原.基于3D激光雷达点云的道路边界识别算法[J].广西大学学报（自然科学版）,2017,42(3):855-863. 被引量：6
6陈龙,蔡勇,张建生.自适应K-means聚类的散乱点云精简[J].中国图象图形学报,2017,22(8):1089-1097. 被引量：20
7惠振阳,程朋根,官云兰,聂运菊.机载LiDAR点云滤波综述[J].激光与光电子学进展,2018,55(6):1-9. 被引量：64
8李会宾,史云,张文利,项铭涛,刘含海.基于车载LiDAR的道路边界检测[J].测绘工程,2018,27(12):37-43. 被引量：3
9蔡怀宇,陈延真,卓励然,陈晓冬.基于优化DBSCAN算法的激光雷达障碍物检测[J].光电工程,2019,46(7):77-84. 被引量：28
10程子阳,任国全,张银.扫描线段特征用于三维点云地面分割[J].光电工程,2019,46(7):105-114. 被引量：10

共引文献29

1贾继阳,黄振峰,梁巍.基于凹包算法的点云地面分割方法[J].电子测量技术,2023,46(15):67-72.
2张长勇,陈治华,韩梁.基于改进DBSCAN的激光雷达障碍物检测[J].激光与光电子学进展,2021,58(24):443-450. 被引量：8
3胡杰,刘汉,徐文才,赵亮.基于三维激光雷达的道路障碍物目标位姿检测算法[J].中国激光,2021,48(24):158-168. 被引量：20
4杜艳玲,崔建华,魏泉苗,黄冬梅.基于改进FCN的多极化SAR影像海上溢油检测[J].激光与光电子学进展,2022,59(4):313-320. 被引量：5
5张长勇,韩梁.基于优化DBSCAN的激光雷达障碍物检测[J].激光与光电子学进展,2022,59(12):506-514. 被引量：7
6张煌,何佳洲,王景石,蒋佳锐.基于全景图像的无人艇激光雷达杂波滤除方法[J].光学学报,2022,42(18):66-76. 被引量：4
7Lili Yang,Yuanyuan Xu,Yajie Liang,Jia Qin,Yuanbo Li,Xinxin Wang,Weixin Zhai,Long Wen,Zhibo Chen,Caicong Wu.Extraction of straight field roads between farmlands based on agricultural vehicle-mounted LiDAR[J].International Journal of Agricultural and Biological Engineering,2022,15(5):155-162. 被引量：1
8杨建宇,胡芬,邢福临,董浩,侯梦迪,李任植,潘雷霆,许京军.结合多次DBSCAN和层次聚类算法的膜蛋白单分子定位超分辨图像分割[J].中国激光,2023,50(3):78-85. 被引量：2
9王庆龙,秦宁宁.视距-非视距传感网的立体感知优化[J].激光与光电子学进展,2023,60(6):357-365. 被引量：1
10梁浩林,蔡怀宇,刘博翀,汪毅,陈晓冬.基于图像与点云融合的公路撒落物检测算法[J].激光与光电子学进展,2023,60(10):32-39. 被引量：3

同被引文献118

1郑少武,李巍华,胡坚耀.基于激光点云与图像信息融合的交通环境车辆检测[J].仪器仪表学报,2019,40(12):143-151. 被引量：39
2刘永刚,于丰宁,章新杰,陈峥,秦大同.基于激光点云与图像融合的3D目标检测研究[J].机械工程学报,2022,58(24):289-299. 被引量：13
3陈丽,陈洋,杨艳华.面向三维结构视觉检测的无人机覆盖路径规划[J].电子测量与仪器学报,2023,37(2):1-10. 被引量：9
4张敏,郭鑫鑫,张驰.积水条件下考虑行车安全的车辙长度仿真分析[J].系统仿真学报,2015,27(4):747-754. 被引量：10
5侯相深,马松林,王彩霞.基于行车安全的沥青路面车辙测量与评价指标的研究[J].公路交通科技,2006,23(8):14-17. 被引量：21
6方帅,杨静荣,曹洋,武鹏飞,饶瑞中.图像引导滤波的局部多尺度Retinex算法[J].中国图象图形学报,2012,17(7):748-755. 被引量：51
7张德津,李清泉,何莉.一种新的激光车辙深度测量方法研究[J].光学学报,2013,33(1):115-121. 被引量：16
8李莉,孙立军,谭生光,宁国宝.用于路面车辙检测的线结构光图像处理流程[J].同济大学学报（自然科学版）,2013,41(5):710-715. 被引量：13
9何晖光,田捷,赵明昌,杨骅.基于分割的三维医学图像表面重建算法[J].软件学报,2002,13(2):219-226. 被引量：59
10沈夏炯,吴晓洋,韩道军.分水岭分割算法研究综述[J].计算机工程,2015,41(10):26-30. 被引量：21

引证文献15

1涂新奎,郑少武,于善虎,李巍华.基于对称形状生成的三维目标检测网络[J].仪器仪表学报,2023,44(6):252-263. 被引量：1
2陈西江,邓辉,赵不钒,花向红,李彭.综合动态图卷积和局部关系卷积的点云分类与分割[J].测绘科学,2023,48(8):182-192.
3胡杰,陈楠,徐文才,昌敏杰,徐博远,王占彬,郭启翔.基于自适应门控的双路激光雷达三维车道线检测[J].中国激光,2023,50(22):132-146.
4石瑶,陈美玲.基于深度学习算法的三维激光雷达主动成像目标检测[J].激光杂志,2023,44(12):70-74.
5赵帅,刘如飞,马召恒,马新江.基于点云分水岭算法的路面车辙三维轮廓提取方法[J].激光与光电子学进展,2024,61(4):189-198.
6张迪,刘婷婷,宋家友.采用边界对比学习的三维激光点云场景分割算法[J].电光与控制,2024,31(5):54-59.
7单慧琳,王硕洋,童俊毅,胡宇翔,张雁皓,张银胜.增强小目标特征的多尺度光学遥感图像目标检测[J].光学学报,2024,44(6):374-386. 被引量：4
8田枫,刘超,刘芳,姜文文,徐昕,赵玲.基于改进PointPillars的激光雷达三维目标检测[J].激光与光电子学进展,2024,61(8):225-234. 被引量：1
9张普,刘金清,肖金超,熊俊峰,冯天伟,王忠泽.基于相机与激光雷达融合的目标定位与跟踪[J].激光与光电子学进展,2024,61(8):305-313. 被引量：2
10张天翔,蔡黎明,欧阳传赟,成贤锴,闫书豪.基于固态激光雷达融合2D激光雷达的建图研究[J].激光与光电子学进展,2024,61(8):325-333.

二级引证文献9

1滕文想,王成,费树辉.基于HGTC−YOLOv8n模型的煤矸识别算法研究[J].工矿自动化,2024,50(5):52-59. 被引量：1
2刘智慧,林荣智.基于图像处理技术的纸张质量检测方案[J].造纸科学与技术,2024,43(4):70-72.
3李倩,陈付龙,郑亮,赵法龙,陈智君.IMU紧耦合的多激光雷达定位与建图方法[J].电子测量技术,2024,47(9):26-32.
4徐志博,吕秋娟,甘鑫斌,谭佳敏,刘永生.基于最优邻域特征加权的点云引导滤波算法[J].激光与光电子学进展,2024,61(14):219-226.
5宋博仕,牛津成,魏孔平,傅继军,马伟俊.基于双目视觉的动态目标定位与抓取研究[J].智能物联技术,2024,56(4):61-64.
6曹晓桢,张智敏,查泽超.基于多源点云数据集成的河道地形特征提取与变化监测[J].科技与创新,2024(20):102-104.
7王忠丰,范宝国.基于多尺度半耦合卷积稀疏编码的遥感地貌影像纹理识别方法[J].计算机测量与控制,2024,32(10):284-290.
8喻佳祺,杨洪刚,王阳.基于改进PointPillars的点云车辆目标检测[J].国外电子测量技术,2024,43(9):69-77.
9李立凡,曹鹏彬,杜兵,沈琼霞.基于激光雷达与深度相机融合的行人检测方法[J].激光与红外,2024,54(10):1547-1553.

1杨频,李惠颖.双层点电荷配位场(DSPF)模型在稀土络合物中的应用(Ⅰ)——Yb^(3+)有机螯合物的电子光谱[J].高等学校化学学报,1984,5(1):99-104.
2黄凌悦,覃巧静,王国泰,敖亮,张贇.基于AHP-改进TOPSIS法的城镇污水处理运行效率评价——以重庆市为例[J].环境影响评价,2023,45(3):115-120. 被引量：1
3雷颖,刘峰.基于双流结构缩放和多重注意力机制的轻量级脑电情感识别方法[J].计算机科学,2023,50(S01):229-237.
4陈树骏.基于双头Faster R-CNN的周界入侵行人检测研究[J].信息技术与信息化,2023(6):29-32.
5江志通,陈雪,雷金晖,薛慧珍,张博,徐晓凡,耿惠京,李周坤,闫新,董维亮,曹慧,崔中利.聚氨酯塑料降解菌G-11的筛选鉴定及其塑料降解特性[J].生物工程学报,2023,39(5):1963-1975. 被引量：1
6李恩华,闫梦若,张佃君.基于改进GhostNet模型的快速单目图像深度估计[J].信息记录材料,2023,24(6):137-140.
7王禹钧,马致明.基于深度学习的学生课堂行为识别研究[J].软件工程,2023,26(7):40-43. 被引量：1
8N.A.Maidin,S.M.Sapuan,M.T.Mastura,M.Y.M.Zuhri.Materials Selection of Thermoplastic Matrices of Natural Fibre Composites for Cyclist Helmet Using an Integration of DMAIC Approach in Six Sigma Method Together with Grey Relational Analysis Approach[J].Journal of Renewable Materials,2023,11(5):2381-2397.
9赖明澈,吕方旭,张庚,许超龙.面向112 Gbps PAM4串行接收机的低误码协同自适应均衡器[J].计算机工程与科学,2023,45(6):951-960.
10余晨颖,钟芳权,陈祖云,郭裕民,张际敏,骆炳林,刘立新,赖昭琦.工业防毒面具滤毒罐用活性炭的研究[J].材料化学前沿,2023,11(2):51-56.

中国激光

2023年第10期

浏览历史

内容加载中请稍等...

基于激光点云的深度语义和位置信息融合的三维目标检测被引量：15

参考文献4

二级参考文献34

共引文献29

同被引文献118

引证文献15

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

基于激光点云的深度语义和位置信息融合的三维目标检测 被引量：15

参考文献4

二级参考文献34

共引文献29

同被引文献118

引证文献15

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

基于激光点云的深度语义和位置信息融合的三维目标检测被引量：15