融合附加神经网络的激光雷达点云单目标跟踪被引量：6

Single Object Tracking of LiDAR Point Cloud Combined with Auxiliary Deep Neural Network

导出

摘要已有激光雷达点云单目标跟踪工作对分布稀疏、小规模点云目标的跟踪性能不佳。针对该问题,提出了一种融合附加神经网络的点云单目标跟踪算法。所提算法在网络训练时,利用附加网络执行前景点云分割和中心坐标偏移回归的附加任务,引导骨干网络学习稀疏、小规模目标点云的高鉴别力特征,供后续网络在逐帧点云中完成对目标的定位和与背景的区分;网络推断时,则绕过附加网络,直接由骨干网络提取目标点云特征,在保证跟踪任务准确性的同时满足任务实时性的要求。在KITTI数据集上的测试结果表明:相较于已有工作,所提算法在相同参数设定下平均跟踪成功率提高了0.89个百分点,平均跟踪准确率提高了2.51个百分点;并且通过对参数设定的进一步研究与调整,最终所提算法平均跟踪成功率提高了4.54个百分点,平均跟踪准确率提高了7.83个百分点,且对分布稀疏、点数较少的点云目标有明显的性能提升。 Objective Point cloud can be selected as an ideal data format for tasks such as object classification,detection,segmentation,reconstruction,and tracking in a three-dimensional(3D)scene.In the case of a single object tracking task,the approach of considering point clouds as data outperforms that of selecting two-dimensional(2D)picture or video sequences for two reasons.First,the point cloud can better describe the 3D geometric information of the object in real scenes,such as the position,scale,and posture of the object.Second,different from the passive optical imaging principle of cameras,information is collected using light detection and ranging(LiDAR)following an active imaging approach,which is not prone to be affected by natural light conditions.Therefore,the point cloud can adapt to different conditions involving visual degradation or illumination and is robust to glare,reflections,and shadows.Based on this discussion,the single object tracking of 3D point cloud is a topic worth investigating.Generally,single object tracking tasks aim to use the information in the given initial frame to determine the tracked object and predict the locating bounding box of the object in each subsequent frame.However,existing single object trackers of LiDAR point clouds exhibit a poor tracking performance of sparsely distributed and small-scale point cloud objects.This is mainly attributed to the downscaling operation applied to features extracted from the point cloud,leading to the insufficient application of object’s structural information;this distracts the tracker from performing accurate bounding box predictions of sparsely distributed and small-scale point cloud objects.Methods To address this problem,a single object tracking network combined with auxiliary deep neural network is proposed herein.During the training stage,we attach a modified auxiliary network to the backbone network,which accomplishes two auxiliary tasks:1)foreground point cloud segmentation,which guides the backbone network to focus on pointwise semantic information;2)pointwise center coordinate offset regression,which leads the features to be aware of the intrastructural information of the object.These two tasks are jointly supervised using the backbone network such that the semantic and structural features are naturally stored in the object features extracted using the backbone network.However,during the inference stage,the auxiliary network is bypassed in this process because the trained backbone network is already optimized to be structure aware and detaching the auxiliary network can avoid extra computational cost,which is essential for retaining the real-time performance of the tracker.Moreover,we notice that the latest work follows the same manner as the dataset organization.In particular,the number of input points in the search area point cloud and template point cloud is fixed,irrespective of the class of point cloud data.However,as the KITTI dataset presents,the point cloud of some classes is dense and comprises a large number of points,while the point cloud of other classes suffers from scarce points,providing insufficient and limited object information.A fixed number of input points may be unsuitable for all data classes.Hence,we propose setting different input quantities for each class during both the training and inference stages,which is accomplished without changing the network structure.In general,the structure of different network modules is shown in Fig.1 to Fig.6 respectively.Results and Discussions Both qualitative and quantitative experiments are conducted to prove the superiority of our propositions.Table 1 shows the result of extensive comparisons between our proposed tracker and other two former trackers;our tracker achieves better results in three of four data classes and shows higher mean performance than the other two former trackers.Test results on the KITTI dataset show that our network increases the average tracking success by 0.89 percent and the average tracking accuracy by 2.51 percent under the same parameter settings as the existing work.Second,some specific tracking results of four data classes are depicted in Fig.7--Fig.10.These figures show that our tracker can predict bounding boxes closely similar to the ground truth.Furthermore,we present some results of the tasks processed using the auxiliary network.Concretely,Fig.11--Fig.14 show the results of foreground segmentation task,in which our auxiliary network can accurately segment the surface of the object from the background points.Moreover,details of comparisons on tracking performance with different numbers of the search area point cloud and template point cloud are discussed in Table 2,which prove the efficiency of adjusting a suitable quantity of input points of different data classes.Compared with proposed algorithm,the average tracking success increases by 4.54 percent,and the average tracking accuracy increases by 7.83 percent.In particular,for class Cyclist,the average tracking success increases by 2.38 percent and the average tracking accuracy increases by 2.86 percent;for class Pedestrian,the average tracking success increases by 8.47 percent and the average tracking accuracy increases by 13.71 percent.These findings imply that our tracker achieves improvements in terms of the tracking performance of sparse and small-scale objects.Finally,Table 3 and Table 4 show that we succeed in maintaining a balance between performance and computational costs.Conclusions In summary,the proposed method achieves reasonable results in addressing the problem of tracking sparsely distributed and small-scale point cloud objects as expected,and can be applied to solve other tasks.Inspired by our experiment results,to achieve further improvement,we will seek new approaches on data augmentation and extract more useful clues using the background information for search area updates.

作者周笑宇王玲马燕新陈沛铂 Zhou Xiaoyu;Wang Ling;Ma Yanxin;Chen Peibo(College of Electronic Science,National University of Defense Technology,Changsha,Hunan 410073,China;College of Meteorology and Oceanography,National University of Defense Technology,Changsha,Hunan 410073,China)

机构地区国防科技大学电子科学学院国防科技大学气象海洋学院

出处《中国激光》 EI CAS CSCD 北大核心 2021年第21期152-164,共13页 Chinese Journal of Lasers

关键词图像处理激光雷达点云附加网络单目标跟踪 image processing LiDAR point cloud auxiliary network single object tracking

分类号 TN249 [电子电信—物理电子学]

引文网络
相关文献

参考文献5

1顾尚泰,王玲,马燕新,马超.基于分层墨卡托投影的激光雷达点云数据局部特征描述[J].光学学报,2020,40(20):120-126. 被引量：9
2胡海瑛,惠振阳,李娜.基于多基元特征向量融合的机载LiDAR点云分类[J].中国激光,2020,47(8):229-239. 被引量：21
3易倩,钟浩宇,刘龙,刘文龙,易兵.基于ROI-RSICP算法的车轮廓形动态检测[J].中国激光,2020,47(11):147-158. 被引量：5
4范小辉,许国良,李万林,王茜竹,常亮亮.基于深度图的三维激光雷达点云目标分割方法[J].中国激光,2019,46(7):284-291. 被引量：47
5张子健,程效军,曹宇杰,王峰,喻月.结合激光与视觉点云的古遗迹三维重建应用[J].中国激光,2020,47(11):266-275. 被引量：35

二级参考文献45

1张红波,叶海建.基于图像处理的轮对磨耗值检测方法的研究[J].机械,2004,31(8):51-53. 被引量：2
2邓非,张祖勋,张剑清.利用激光扫描和数码相机进行古建筑三维重建研究[J].测绘科学,2007,32(2):29-30. 被引量：50
3邓非,徐国杰,冯晨,管海燕.LiDAR数据与航空影像结合的建筑物重建[J].测绘信息与工程,2010,35(1):35-37. 被引量：18
4程亮,龚健雅,李满春,刘永学,宋小刚.集成多视航空影像与LiDAR数据重建3维建筑物模型[J].测绘学报,2009,38(6):494-501. 被引量：32
5袁夏,赵春霞.一种应用于机器人导航的激光点云聚类算法[J].机器人,2011,33(1):90-96. 被引量：11
6常永敏,张帆,黄先锋,刘刚.基于激光扫描和高精度数字影像的敦煌石窟第196、285窟球幕图像制作[J].敦煌研究,2011(6):96-100. 被引量：7
7左志权,张祖勋,张剑清.区域回波比率与拓扑识别模型结合的城区激光雷达点云分类方法[J].中国激光,2012,39(4):189-194. 被引量：16
8杨飞,朱株,龚小谨,刘济林.基于三维激光雷达的动态障碍实时检测与跟踪[J].浙江大学学报（工学版）,2012,46(9):1565-1571. 被引量：24
9高岩,邵双运,冯其波.一种激光扫描自动测量轮对几何参数的方法[J].中国激光,2013,40(7):176-181. 被引量：17
10郭波,黄先锋,张帆,王晏民.顾及空间上下文关系的JointBoost点云分类及特征降维[J].测绘学报,2013,42(5):715-721. 被引量：33

共引文献111

1陈西江,安庆,班亚,王德欣,李坤,刘海鹏.融合高斯核及指数函数聚类的点云目标物提取[J].应用科学学报,2022,40(3):411-422.
2吴冬,阎卫东,王井利.基于特征重要性加权的随机森林点云分类研究[J].电子测量技术,2023,46(20):120-127.
3倪风岳,于长海.线性预测编码系数分类雷达反射截面积目标的方法及实现[J].光学与光电技术,2020,18(2):42-46.
4武泽永,岳维平.高速冲剪模的设计与制造[J].防爆电机,2000,35(1):28-29.
5张心睿,潘新福.基于局部凸性的三维激光雷达点云分割算法[J].现代信息科技,2019,3(21):165-166.
6蒋剑飞,李其仲,黄妙华,龚杰.基于三维激光雷达的障碍物及可通行区域实时检测[J].激光与光电子学进展,2019,56(24):241-250. 被引量：12
7钱其姝,胡以华,赵楠翔,李敏乐,邵福才.基于激光点云全局特征匹配处理的目标跟踪算法[J].激光与光电子学进展,2020,57(6):149-156. 被引量：15
8张爱武,刘路路,张希珍.道路三维点云多特征卷积神经网络语义分割方法[J].中国激光,2020,47(4):261-269. 被引量：18
9杜艺,葛帅,单萌蕾.一种基于LiDAR点云的河道纵横断面获取方法[J].北京测绘,2020,34(8):1114-1118. 被引量：4
10刘路,潘艳娟,陈志健,王玉伟,李亚伟,陈黎卿.高遮挡环境下玉米植保机器人作物行间导航研究[J].农业机械学报,2020,51(10):11-17. 被引量：20

同被引文献81

1李笑宇,林虎,薛梓,杨国梁.激光跟踪多边测量自标定优化方法[J].仪器仪表学报,2021,42(2):10-17. 被引量：7
2郑少武,李巍华,胡坚耀.基于激光点云与图像信息融合的交通环境车辆检测[J].仪器仪表学报,2019,40(12):143-151. 被引量：37
3袁建英,王琼,李柏林.利用标志点多视图约束实现结构光扫描高精度粗拼接[J].计算机辅助设计与图形学学报,2015,27(4):674-683. 被引量：7
4王鑫,唐振民.基于特征融合的粒子滤波在红外小目标跟踪中的应用[J].中国图象图形学报,2010,15(1):91-97. 被引量：17
5丁欢,张文生.融合SPA遮挡分割的多目标跟踪方法[J].中国图象图形学报,2012,17(1):90-98. 被引量：3
6任仙怡,廖云涛,张桂林,张天序.一种新的相关跟踪方法研究[J].中国图象图形学报（A辑）,2002,7(6):553-557. 被引量：56
7宁纪锋,赵耀博,石武祯.多通道Haar-like特征多示例学习目标跟踪[J].中国图象图形学报,2014,19(7):1038-1045. 被引量：11
8闫利,魏峰.利用密集匹配点云的建筑单体提取算法研究[J].中国激光,2018,45(7):264-271. 被引量：13
9宫海洋,任红格,史涛,李福进.基于改进粒子滤波的稀疏子空间单目标跟踪算法[J].现代电子技术,2018,41(13):10-13. 被引量：4
10余洪山,付强,孙健,吴司良,陈昱名.面向室内移动机器人的改进3D-NDT点云配准算法[J].仪器仪表学报,2019,40(9):151-161. 被引量：22

引证文献6

1陈慧娴,吴一全,张耀.基于深度学习的三维点云分析方法研究进展[J].仪器仪表学报,2023,44(11):130-158.
2王蒙蒙,杨小倩,刘勇.利用时空特征编码的单目标跟踪网络[J].中国图象图形学报,2022,27(9):2733-2748. 被引量：2
3杨治,彭蕾,涂起龙.基于激光雷达的无人机导航图像分水岭快速分割方法[J].激光杂志,2023,44(8):125-129.
4刘丹,廖俊东.基于位置探测器的高精度激光跟踪系统[J].激光杂志,2023,44(8):216-220.
5龙科军,余娟,费怡,向凌云,骆嫚,杨双辉.激光雷达和相机的决策级融合目标检测方法[J].长沙理工大学学报（自然科学版）,2024,21(1):133-140.
6贾冕茜.面向无人驾驶车避障的光跳频激光雷达探测目标智能主动跟踪方法[J].河南工程学院学报（自然科学版）,2024,36(1):55-59.

二级引证文献2

1杜彦东,冯林,陶鹏,龚勋,王俊.元迁移学习在少样本跨域图像分类中的研究[J].中国图象图形学报,2023,28(9):2899-2912. 被引量：2
2赵洁,袁永胜,张鹏宇,王栋.轻量化Transformer目标跟踪数据标注算法[J].中国图象图形学报,2023,28(10):3176-3190.

1高银,杨崇一,何建华,杨丰强,王伟根,应碧伟.多层螺旋CT增强扫描与MRI用于肾实性肿瘤鉴别诊断的临床研究[J].中华全科医学,2021,19(11):1912-1915. 被引量：5
2苏建,李在娟.融合视觉和以太网技术的工业机器人分拣装配控制系统设计[J].机床与液压,2021,49(24):119-123. 被引量：19
3王慧玲,谢卓辰,梁旭文.单粒子翻转对神经网络的影响分析与优化[J].中国科学院大学学报（中英文）,2021,38(6):832-840.
4曾敏,袁松,石永华,胡子鑫,王卓然.基于异构多核的焊接集控器人机交互设计[J].焊接,2021(11):38-41.

中国激光

2021年第21期

浏览历史

内容加载中请稍等...

融合附加神经网络的激光雷达点云单目标跟踪被引量：6

参考文献5

二级参考文献45

共引文献111

同被引文献81

引证文献6

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

融合附加神经网络的激光雷达点云单目标跟踪 被引量：6

参考文献5

二级参考文献45

共引文献111

同被引文献81

引证文献6

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

融合附加神经网络的激光雷达点云单目标跟踪被引量：6