摘要
目的自动引导运输小车(automatic guided vehicles,AGV)在工厂中搬运货物时会沿着规定路线运行,但是在靠近障碍物时只会简单地自动停止,无法感知障碍物的具体位置和大小,为了让AGV小车在复杂的工业场景中检测出各种障碍物,提出了一个点云多尺度编码的单阶段3D目标检测网络(multi-scale encoding for single-stage 3D object detector from point clouds,MSE-SSD)。方法首先,该网络通过可学习的前景点下采样模块来对原始点云进行下采样,以精确地分割出前景点。其次,将这些前景点送入多抽象尺度特征提取模块进行处理,该模块能够分离出不同抽象尺度的特征图并对它们进行自适应地融合,以减少特征信息的丢失。然后,从特征图中预测出中心点,通过多距离尺度特征聚合模块将中心点周围的前景点按不同距离尺度进行聚合编码,得到语义特征向量。最后,利用中心点和语义特征向量一起预测包围框。结果MSE-SSD在自定义数据集中进行实验,多个目标的平均精度(average precision,AP)达到了最优,其中,在困难级别下空AGV分类、简单级别下载货AGV分类比排名第2的IASSD(learning highly efficient point-based detectors for 3D LiDAR point clouds)高出1.27%、0.08%,在简单级别下工人分类比排名第2的SA-SSD(structure aware single-stage 3D object detection from point cloud)高出0.71%。网络运行在单个RTX 2080Ti GPU上检测速度高达77帧/s,该速度在所有主流网络中排名第2。将训练好的网络部署在AGV小车搭载的开发板TXR上,检测速度达到了8.6帧/s。结论MSE-SSD在AGV小车避障检测方面具有较高的精确性和实时性。
scale feature maps can provide the computer with almost all target semantic information.The final feature map is used to predict the heatmap,which is sent to the next module.The multi-distance scale feature aggregation module then obtains the center point of each target from the heatmap and aggregates the foreground points near each center point in the voxel space.The module quickly obtains the foreground points through a voxel query and groups them according to the different distances between them and the center point.When the probability that the foreground point close to the center point belongs to this target is high,the probability that the foreground point far away belongs to the center point target is low.Therefore,networks with different weights are used to encode the groups of foreground points to obtain distance-sensitive multiscale semantic features.Finally,the semantic feature and the center point jointly predict the bounding box,where the center point represents the center coordinate of the bounding box and the semantic feature predicts the confidence,size,and deflection angle of the bounding box.Result The official data sets KITTI and Waymo are used to evaluate the performance of the model,and the custom data set is then utilized to evaluate the final combat effect of the model.In the KITTI test set,the nine most popular methods at present are compared.MSE-SSD ranked third in detection speed,and the frames per second reached 34.Simultaneously,in the comparison of average precision(AP),MSE-SSD and the most advanced singlestage detector at present were almost the same.In the Waymo verification set,compared with other single-stage detectors,the average accuracy of multiple indicators(pedestrians and bicycles)of MSE-SSD for relatively complex targets ranked first.In the customized data set,the following three targets are detected:empty AGV,loaded AGV,and pedestrian.Under the simple level,the AP of MSE-SSD in the cargo AGV and pedestrian targets is 0.08%and 0.71%higher than the second,respectively.At this difficulty level,the AP of MSE-SSD is 1.27%higher than the second in the empty AGV target.Simultaneously,the detection speed of MSE-SSD reached the second level at 65 frame/s.The trained network is deployed on the TXR demoboard carried by the AGV car,and the detection speed reached 7.3 frame/s.Conclusion Considering the transportation problem in the industrial scene,an obstacle avoidance detection method for AGV is introduced based on two point cloud scales.This method has high detection accuracy and speed and provides a detection guarantee for AGV when running on mobile devices.
作者
韩俊博
胡海洋
李忠金
潘开来
王利红
Han Junbo;Hu Haiyang;Li Zhongjin;Pan Kailai;Wang Lihong(School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China;Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province,Hangzhou 310018,China)
出处
《中国图象图形学报》
CSCD
北大核心
2024年第11期3417-3432,共16页
Journal of Image and Graphics
基金
浙江省重点研发计划“领雁”项目(2023C01145)。
关键词
3D目标检测
单阶段检测网络
点云下采样
点云特征提取
点云特征聚合
3D object detection
single-stage detector
point cloud down-sampling
point cloud feature extraction
point cloud feature aggregation