场景视点偏移的激光雷达点云分割被引量：6

LiDAR point cloud segmentation through scene viewpoint offset

导出

摘要目的激光雷达采集的室外场景点云数据规模庞大且包含丰富的空间结构细节信息,但是目前多数点云分割方法并不能很好地平衡结构细节信息的提取和计算量之间的关系。一些方法将点云变换到多视图或体素化网格等稠密表示形式进行处理,虽然极大地减少了计算量,但却忽略了由激光雷达成像特点以及点云变换引起的信息丢失和遮挡问题,导致分割性能降低,尤其是在小样本数据以及行人和骑行者等小物体场景中。针对投影过程中的空间细节信息丢失问题,根据人类观察机制提出了一种场景视点偏移方法,以改善三维(3D)激光雷达点云分割结果。方法利用球面投影将3D点云转换为2维(2D)球面正视图(spherical front view,SFV)。水平移动SFV的原始视点以生成多视点序列,解决点云变换引起的信息丢失和遮挡的问题。考虑到多视图序列中的冗余,利用卷积神经网络(convolutional neural networks,CNN)构建场景视点偏移预测模块来预测最佳场景视点偏移。结果添加场景视点偏移模块后,在小样本数据集中,行人和骑行者分割结果改善相对明显,行人和骑行者(不同偏移距离下)的交叉比相较于原方法最高提升6.5%和15.5%。添加场景视点偏移模块和偏移预测模块后,各类别的交叉比提高1.6%Institute)上与其他算法相比,行人和骑行者的分割结果取得了较大提升,其中行人交叉比最高提升9.1%。结论本文提出的结合人类观察机制和激光雷达点云成像特点的场景视点偏移与偏移预测方法易于适配不同的点云分割方法,使得点云分割结果更加准确。 Objective The point cloud data of the ground scene collected by Li DAR is large in scale and contains rich spatial structure detail information.Many current point cloud segmentation methods cannot well balance the relationship between the extraction of structure detail information and computation.To solve the problem,the current point cloud learning tasks are mainly divided into direct method and conversion method.The direct method directly extracts features from all point clouds and can obtain more spatial structure information,but the scale of point clouds that can be processed is usually limited.Therefore,the direct method requires other auxiliary processing methods for outdoor scenes with a large data scale.The transformation method adopts projection and voxelization methods to transform the point cloud into a dense representation.The image generated by the point cloud transformation method of projecting the point cloud into 2 D graphics is denser and more consistent with people’s cognition.Moreover,2 D point clouds are easier to fuse with mature 2 D convolutional neural networks(CNN).However,the real spatial structure information will inevitably be lost in the transformation.In addition,for small sample data and small object scenes(such as pedestrians and cyclists),the segmentation performance will decrease.The reasons mainly include the loss of information caused by the imaging characteristics and transformation of Li DAR and the more serious occlusion problems.A scene view point offset method based on the human observation mechanism is proposed in this paper to improve the 3 D Li DAR point cloud segmentation performance and solve the problem of loss of spatial detail information in projection.Method First,a spherical projection is exploited to transform the 3 D point cloud into a 2 D spherical front view(SFV).This method is more consistent with Li DAR imaging,which minimizes the loss of generating new information.Moreover,the generated images are denser,more in line with people’s cognition,and easy to be combined with the mature 2 D convolutional neural network.In addition,the projection method removes part of the point cloud and reduces the amount of computation.Then,to address the problems of information loss and occlusion,the original viewpoint of SFV is horizontally moved to generate a multiview series.SFV projection solves several problems such as sparseness and occlusion in point clouds,but many spatial details will inevitably be lost in the projection.The 3 D object itself can be observed from different angles,and the shape characteristics of different angles can be obtained.Based on this feature,a multi-view observation sequence is formed by moving the projection center to obtain a more reliable sample sequence for point cloud segmentation.In the segmentation network,the information of SFV is downsampled by using the Fire convolutional layer and the maximum pooling layer using a series of network Squeeze Seg.To obtain the full-resolution label features of each point,deconvolution is used to carry out upsampling and obtain the decoding features.The skip layer connection structure is adopted to add the upsampling feature map to the low-level feature map of the same size and better combine the low-level features and high-level semantic features of the network.Although the deviation will improve the segmentation results to some extent,blindly increasing the deviation will add unnecessary computation to the system.Considering the redundancy of the multi-view point sequence,finding the optimal offset point in actual work is important.Finally,the CNN is used to construct the scene viewpoint offset prediction module and predict the optimal scene viewpoint offset.Result The dataset adopted in this paper is the converted Karlsruhe Institute of Technology and Toyota Technological Institute(KITTI)dataset.To prove that the proposed method used is suitable for a relatively small dataset,a smaller dataset(contains a training set of 1182 frames,a validation set of 169 frames)is extracted for ablation experiment verification.In the small sample dataset,after adding the scene viewpoint offset module,the segmentation results of pedestrians and cyclists are improved,and the intersection over union of pedestrians and cyclists at different offset distances are increased by 6.5%and 15.5%,respectively,compared with the original method.After adding the scene viewpoint offset module and the offset prediction module,the crossover ratio of each category is increased by 1.6%3%.On KITTI’s raw dataset,compared with other methods,several categories of the intersection over union achieve the best results,and that of the pedestrian increases by 9.1%.Conclusion Combined with the human observation mechanism and Li DAR point cloud imaging characteristics,the method is greatly reduced based on retaining certain 3 D space information.High-precision segmentation is efficiently realized to improve point cloud segmentation results easily and adapt to different point cloud segmentation methods.Although the viewpoint shift and the offset prediction method can improve the segmentation results of Li DAR point cloud to a certain extent,an improvement remains possible,especially in the case of a strong correlation between images.Moreover,global and local offset fusion architectures for objects of different types and sizes are designed to utilize the correlation between images,making more accurate,effective predictions for objects in the view.

作者郑阳林春雨廖康赵耀薛松 Zheng Yang;Lin Chunyu;Liao Kang;Zhao Yao;Xue Song(Institute of Information Science,Beijing Jiaotong University,Beijing 100044,China;China Railway Rolling Stock Corporation Qingdao Sifang Rolling Stock Research Institute Co.,Ltd.,Qingdao 266031,China)

机构地区北京交通大学信息科学研究所中车青岛四方车辆研究所有限公司

出处《中国图象图形学报》 CSCD 北大核心 2021年第10期2514-2523,共10页 Journal of Image and Graphics

基金国家重点研发计划项目(2018YFB1201601) 国家自然科学基金项目(61772066,61972028)。

关键词点云分割球面正视图(SFV) 场景视点偏移场景视点偏移预测卷积神经网络(CNN) point cloud segmentation spherical front view(SFV) scene viewpoint shift scene viewpoint offset prediction convolutional neural network(CNN)

分类号 TP399 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献40

1周汉飞,李禹,粟毅.利用多角度SAR数据实现三维成像[J].电子与信息学报,2013,35(10):2467-2474. 被引量：8
2秦彩杰,管强.三维点云数据分割研究现状[J].宜宾学院学报,2017,17(6):30-35. 被引量：5
3王肖锋,张明路,刘军.基于增量式双向主成分分析的机器人感知学习方法研究[J].电子与信息学报,2018,40(3):618-625. 被引量：15
4俞斌,董晨,刘延华,程烨.基于深度学习的点云分割方法综述[J].计算机工程与应用,2020,56(1):38-45. 被引量：11
5陈境焕,李海艳,林景亮.基于深度学习的零件点云分割算法研究[J].机电工程,2020,37(3):326-331. 被引量：5
6杨军,党吉圣.采用深度级联卷积神经网络的三维点云识别与分割[J].光学精密工程,2020,28(5):1187-1199. 被引量：16
7党吉圣,杨军.多特征融合的三维模型识别与分割[J].西安电子科技大学学报,2020,47(4):149-157. 被引量：11
8李国豪,袁一帆,贲晛烨,张军平.采用时空注意力机制的人脸微表情识别[J].中国图象图形学报,2020,25(11):2380-2390. 被引量：9
9Haotian PENG,Bin ZHOU,Liyuan YIN,Kan GUO,Qinping ZHAO.Semantic part segmentation of single-view point cloud[J].Science China(Information Sciences),2020,63(12):247-249. 被引量：2
10余帅,汪西莉.含多级通道注意力机制的CGAN遥感图像建筑物分割[J].中国图象图形学报,2021,26(3):686-699. 被引量：9

引证文献6

1赵佳琦,周勇,何欣,卜一凡,姚睿,郭睿.基于深度学习的点云分割研究进展分析[J].电子与信息学报,2022,44(12):4426-4440. 被引量：10
2刘盛,曹益烽,黄文豪,李丁达.融合稀疏注意力和实例增强的雷达点云分割[J].中国图象图形学报,2023,28(2):483-494. 被引量：2
3杨雨桐,和红杰.使用中心预测-聚类的3D箱体实例分割方法[J].计算机工程与应用,2024,60(10):132-139.
4康玥,杨军.可学习动态分组卷积神经网络的大规模点云分割[J].计算机工程与应用,2024,60(10):217-226.
5庞华廷,刘立东,黄莉添.车载激光雷达下新能源汽车无人驾驶障碍检测[J].激光杂志,2024,45(6):114-119.
6朱仲杰,张荣,白永强,王玉儿,孙嘉敏.结合双边交叉增强与自注意力补偿的点云语义分割[J].中国图象图形学报,2024,29(8):2388-2398.

二级引证文献12

1朱晓强,陈琦.基于三维激光扫描点云配准的目标位姿测量[J].电子测量技术,2022,45(4):13-18. 被引量：5
2张蕊,孟晓曼,曾志远,金玮,武益超.图卷积神经网络在点云语义分割中的研究综述[J].计算机工程与应用,2022,58(24):29-46. 被引量：4
3田春生,陈雷,王源,王硕,周婧,庞永江,杜忠.基于机器学习的FPGA电子设计自动化技术研究综述[J].电子与信息学报,2023,45(1):1-13. 被引量：3
4夏金泽,孙浩铭,胡盛辉,梁冬泰.基于图像信息约束的三维激光点云聚类方法[J].光电工程,2023,50(2):35-46. 被引量：7
5王鑫蕊,吴沛然,庄朝淳,刘伟康,韦韬,薛秀云,王卫星,孙道宗.LiDAR和双目传感数据在果树变量喷雾的应用[J].激光与红外,2023,53(3):346-354.
6梁振华,王丰.面向部件分割的PointNet注意力加权特征聚合网络[J].计算机应用研究,2023,40(5):1571-1576. 被引量：2
7黄鹤,温夏露,杨澜,王会峰,高涛,茹锋.基于疯狂捕猎秃鹰算法的K均值互补迭代聚类优化[J].浙江大学学报（工学版）,2023,57(11):2147-2159.
8鲁子明,黄世秀,季铮,张思仪,黄翔翔.基于PointNet优化网络的铁路站台语义分割[J].现代电子技术,2024,47(3):68-72. 被引量：1
9黄鹤,黄佳慧,刘国权,王会峰,高涛.采用混合策略联合优化的模糊C-均值聚类信息熵点云简化算法[J].西安交通大学学报,2024,58(7):214-226. 被引量：1
10孙峥,林国成,谢睿,朱俊鹏,周煜,吴汪平,许阔.视觉图像与三维点云融合的障碍物主动识别与距离感知研究[J].机床与液压,2024,52(16):80-86.

1赵少若,梁严予,郑丁丁,王彦伟,郝丽影,逄文强,邓均华,田克恭.非洲猪瘟病毒p11.5蛋白的表达及其单克隆抗体的制备[J].畜牧与兽医,2021,53(10):77-82. 被引量：1
2章后甜,陈冰.图像全站仪图像测量模型及其简化计算[J].测绘工程,2021,30(6):50-55. 被引量：1
3Siyu Lu,Haiyan Yang,Zixuan Zhou,Liangshu Zhong,Shenggang Li,Peng Gao,Yuhan Sun.Effect of In_(2)O_(3)particle size on CO_(2) hydrogenation to lower olefins over bifunctional catalysts[J].Chinese Journal of Catalysis,2021,42(11):2038-2048. 被引量：5
4孙昌潇,毛伟建,张庆臣,石星辰.三维最小二乘弹性高斯束叠前深度偏移[J].地球物理学报,2021,64(11):4181-4195. 被引量：1
5陈荣,韩浩武,傅佩红,杨雨菲,黄魏.基于多时相遥感影像和随机森林算法的土壤制图[J].土壤,2021,53(5):1087-1094. 被引量：13
6刘金辉,王伟,韩永良,尤加春.盾构机数据采集系统的超前预报波场成像研究[J].公路,2021,66(10):409-414. 被引量：1
7张村,宋子玉,赵毅鑫,韩鹏华,滕腾.矿井地下水库破碎岩体运移的DEM-CFD耦合分析[J].采矿与岩层控制工程学报,2021,3(4):81-91. 被引量：4
8钱伯章.帝人将在泰国建设聚酯回收工厂[J].聚酯工业,2021,34(6):29-29.
9朱艳秋.我国综合工时制工时标准之立法完善--用人单位时间支配权与劳动者休息权再界分[J].西南政法大学学报,2021,23(5):17-26. 被引量：3
10刘若琳,杨俊峰,王天星,赵弘炜,张雷.CEE实验中TOF探测器数据获取系统的设计[J].原子核物理评论,2021,38(3):301-310. 被引量：1

中国图象图形学报

2021年第10期

浏览历史

内容加载中请稍等...

场景视点偏移的激光雷达点云分割被引量：6

同被引文献40

引证文献6

二级引证文献12

相关作者

相关机构

相关主题

浏览历史

场景视点偏移的激光雷达点云分割 被引量：6

同被引文献40

引证文献6

二级引证文献12

相关作者

相关机构

相关主题

浏览历史

场景视点偏移的激光雷达点云分割被引量：6