点云多尺度编码的单阶段3D目标检测网络

Multiscale encoding for single-stage 3D object detector from point clouds

导出

摘要目的自动引导运输小车(automatic guided vehicles,AGV)在工厂中搬运货物时会沿着规定路线运行,但是在靠近障碍物时只会简单地自动停止,无法感知障碍物的具体位置和大小,为了让AGV小车在复杂的工业场景中检测出各种障碍物,提出了一个点云多尺度编码的单阶段3D目标检测网络(multi-scale encoding for single-stage 3D object detector from point clouds,MSE-SSD)。方法首先,该网络通过可学习的前景点下采样模块来对原始点云进行下采样,以精确地分割出前景点。其次,将这些前景点送入多抽象尺度特征提取模块进行处理,该模块能够分离出不同抽象尺度的特征图并对它们进行自适应地融合,以减少特征信息的丢失。然后,从特征图中预测出中心点,通过多距离尺度特征聚合模块将中心点周围的前景点按不同距离尺度进行聚合编码,得到语义特征向量。最后,利用中心点和语义特征向量一起预测包围框。结果MSE-SSD在自定义数据集中进行实验,多个目标的平均精度(average precision,AP)达到了最优,其中,在困难级别下空AGV分类、简单级别下载货AGV分类比排名第2的IASSD(learning highly efficient point-based detectors for 3D LiDAR point clouds)高出1.27%、0.08%,在简单级别下工人分类比排名第2的SA-SSD(structure aware single-stage 3D object detection from point cloud)高出0.71%。网络运行在单个RTX 2080Ti GPU上检测速度高达77帧/s,该速度在所有主流网络中排名第2。将训练好的网络部署在AGV小车搭载的开发板TXR上,检测速度达到了8.6帧/s。结论MSE-SSD在AGV小车避障检测方面具有较高的精确性和实时性。 scale feature maps can provide the computer with almost all target semantic information.The final feature map is used to predict the heatmap,which is sent to the next module.The multi-distance scale feature aggregation module then obtains the center point of each target from the heatmap and aggregates the foreground points near each center point in the voxel space.The module quickly obtains the foreground points through a voxel query and groups them according to the different distances between them and the center point.When the probability that the foreground point close to the center point belongs to this target is high,the probability that the foreground point far away belongs to the center point target is low.Therefore,networks with different weights are used to encode the groups of foreground points to obtain distance-sensitive multiscale semantic features.Finally,the semantic feature and the center point jointly predict the bounding box,where the center point represents the center coordinate of the bounding box and the semantic feature predicts the confidence,size,and deflection angle of the bounding box.Result The official data sets KITTI and Waymo are used to evaluate the performance of the model,and the custom data set is then utilized to evaluate the final combat effect of the model.In the KITTI test set,the nine most popular methods at present are compared.MSE-SSD ranked third in detection speed,and the frames per second reached 34.Simultaneously,in the comparison of average precision(AP),MSE-SSD and the most advanced singlestage detector at present were almost the same.In the Waymo verification set,compared with other single-stage detectors,the average accuracy of multiple indicators(pedestrians and bicycles)of MSE-SSD for relatively complex targets ranked first.In the customized data set,the following three targets are detected:empty AGV,loaded AGV,and pedestrian.Under the simple level,the AP of MSE-SSD in the cargo AGV and pedestrian targets is 0.08%and 0.71%higher than the second,respectively.At this difficulty level,the AP of MSE-SSD is 1.27%higher than the second in the empty AGV target.Simultaneously,the detection speed of MSE-SSD reached the second level at 65 frame/s.The trained network is deployed on the TXR demoboard carried by the AGV car,and the detection speed reached 7.3 frame/s.Conclusion Considering the transportation problem in the industrial scene,an obstacle avoidance detection method for AGV is introduced based on two point cloud scales.This method has high detection accuracy and speed and provides a detection guarantee for AGV when running on mobile devices.

作者韩俊博胡海洋李忠金潘开来王利红 Han Junbo;Hu Haiyang;Li Zhongjin;Pan Kailai;Wang Lihong(School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China;Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province,Hangzhou 310018,China)

机构地区杭州电子科技大学计算机学院浙江省脑机协同智能重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2024年第11期3417-3432,共16页 Journal of Image and Graphics

基金浙江省重点研发计划“领雁”项目(2023C01145)。

关键词 3D目标检测单阶段检测网络点云下采样点云特征提取点云特征聚合 3D object detection single-stage detector point cloud down-sampling point cloud feature extraction point cloud feature aggregation

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1刘梦涵,王坚,马运涛,柳根,鲍王雨莎,孙昱.一种融合自适应点云特征提取的激光SLAM方法[J].测绘科学,2024,49(9):155-163.
2梁晓洪.面向5G移动通信技术的网络优化分析[J].移动信息,2024,46(11):1-3.
3刘琦.AI自动识别在计算机网络安全中的应用[J].中国宽带,2024,20(2):133-135.
4卢成浩,陈秀宏.基于隐式分区学习深度特征融合重建曲面网络[J].数据与计算发展前沿（中英文）,2024,6(6):19-31.
5李峰,孟飙.基于K4PCS与ICP算法在点云配准中的应用[J].智能计算机与应用,2024,14(11):138-143.
6敖永曦,向婷婷.静脉注射用免疫球蛋白在利妥昔单抗治疗非霍奇金淋巴瘤防治作用的研究进展[J].吉林医学,2024,45(12):3108-3110.
7秦军,谭峻雄,孙瑜,吕俊德,朱可佳,李月琴,孙剑,缪旻.硅基微环调制器耦合状态对高速PAM4通信系统的性能影响分析[J].光学学报,2024,44(20):129-142.
8王涌,曾祥国,肖桂林,温昕,周鑫鑫,邓江丽,韩永超.草莓白化伴随病毒RT-LAMP检测方法的建立[J].植物保护,2024,50(6):237-245.
9Shangjun Gao,Yang Yang,Man Chen,Jian Zheng,Luqi Qin,Xiangyu Liu,Jianying Yang.Hydraulic Fracture Parameter Inversion Method for Shale GasWells Based on Transient Pressure-Drop Analysis during Hydraulic Fracturing Shut-in Period[J].Energy Engineering,2024,121(11):3305-3329.
10王凌波,石国祥,吴蓓蓓,姚文武,吴卓颖,吴亦斐,杨章女.杭州市2023年污水环境中霍乱弧菌的毒力及耐药特征[J].中国热带医学,2024,24(11):1306-1311.

中国图象图形学报

2024年第11期

浏览历史

内容加载中请稍等...

点云多尺度编码的单阶段3D目标检测网络

相关作者

相关机构

相关主题

浏览历史