结合语义信息与3D点云技术的未知环境地图构建方法被引量：1

The 3D point cloud based semantic information-relevant map construction method for unrecognized scenario

导出

摘要目的机器人在进行同时定位与地图构建(simultaneous localization and mapping,SLAM)时需要有效利用未知复杂环境的场景信息,针对现有SLAM算法对场景细节理解不够及建图细节信息缺失的问题,本文构造出一种将SLAM点云定位技术与语义分割网络相结合的未知环境地图构建方法,实现高精度三维地图重建。方法首先,利用场景的实时彩色信息进行相机的位姿估计,并构造融合空间多尺度稀疏及稠密特征的深度学习网络HieSemNet(hierarchical semantic network),对未知场景信息进行语义分割,得到场景的实时二维语义信息;其次,利用深度信息和相机位姿进行空间点云估计,并将二维语义分割信息与三维点云信息融合,使语义分割的结果对应到点云的相应空间位置,构建出具有语义信息的高精度点云地图,实现三维地图重建。结果为验证本文方法的有效性,分别针对所构造的HieSemNet网络和语义SLAM系统进行验证实验。实验结果表明,本文的网络在平均像素准确度和平均交并比上均取得了较好的精度,MPA(mean pixel accuracy)指标相较于其他网络分别提高了17.47%、11.67%、4.86%、2.90%和0.44%,MIoU(mean intersection over union)指标分别提高了13.94%、1.10%、6.28%、2.28%和0.62%。本文的SLAM算法可以获得更多的建图信息,构建的地图精度和准确度都更好。结论本文方法充分考虑了不同尺寸物体的分割效果,提出的HieSemNet网络能有效提高场景语义分割准确性,此外,与现有的前沿语义SLAM系统相比,本文方法能够明显提高建图的精度和准确度,获得更高质量的地图。 Objective With the continuous in-depth development of computer technology and artificial intelligence,the intelligent robot contexts have been developing intensively.The simultaneous localization and mapping(SLAM)can be as an effective robot-related technique to recognize scene information.Simultaneous localization and mapping is focused on robot motion location starting from the unknown position of the unknown environment while its own position can be identified and located through the observed map features,and a complete map of the scene is then constructed based on its own posture and trajectory.The environment map constructed by traditional SLAM lacks semantic information,and the robot cannot recognize the scene environment to a certain extent.To achieve the ability to perceive increasingly complex scenes,some scholars have been focused on introducing deep learning methods into SLAM systems to achieve the recognition of scenario objects.However,there are still some challenging problems to be resolved for insufficient scene recognition and map building.SLAM tasks-related robots are required to explore unknown environments and use effective scene information of complex environments.Aiming at the problems that the existing SLAM algorithms understanding insufficiently of scene details and lack of information of map building details,as well as the existing semantic segmentation algorithms do not perform well in the segmentation of multi-scale objects,have slow segmentation speed and indistinct segmentation pictures,We develop main research objectives of improving the recognition ability of the semantic segmentation algorithm for multiscale objects and improving the accuracy and precision of map construction by semantic SLAM technology.A method of unknown environment-related map construction is constructed linked with SLAM point cloud localization technology and semantic segmentation network,which can identify objects of different sizes in the scene effectively and realize highprecision 3D map reconstruction.Method We design a spatial multi-scale sparse and dense features-fused deep learning semantic segmentation network,which is called hierarchical semantic network(HieSemNet).A spatial pyramid module is opted with different dilation rates of dilated convolution,and to capture global contextual information,such features can be extracted using multi-scale structure.To extract features deliberately,the network consists of two branches of the feature extraction base network and the spatial pyramid module.Besides,to supervise the training and calculate the loss function,the semantic labels can be used solely at different scales of the two branches.The final feature map can be generated in terms of weighted fusion method of the feature maps of the two branches.The built semantic segmentation network is then applied to the SLAM system,and the map construction is completed by three modules:tracking,local mapping and Loop‐Closing.The tracking module extracts ORB(oriented FAST and rotated BRIEF)features from the image sequences acquired by the RGB-D camera,determines key frames based on the ORB feature point pairs between frames and performs camera pose estimation.The local mapping module further filters the inserted key frames,then calculates and filters the map points associated with the key frames.The LoopClosing module performs optimization and updates the generated maps.The steps of the algorithm are as follows:First,it uses the real-time color information of the scene captured by RGBD camera for camera’s positional estimation and trajectory calculation.And then,to achieve semantic segmentation of unknown scene information and obtain real-time 2D semantic information of the scene,it constructs HieSemNet in the context of a deep learning network fusing spatial multiscale sparse and dense features.Second,spatial point cloud estimation using depth information and camera poses to construct an octree of spatial relations of point clouds.Finally,to build a highprecision point cloud map with semantic information and realizing 3D map reconstruction,the semantic segmentation 2D information is fused with 3D point cloud information,and the result of semantic segmentation can correspond the corresponding spatial position of the octree.Result To verify the effectiveness of the method proposed,validation experiments are conducted for the constructed HieSemNet and the semantic SLAM system.The HieSemNet analysis is compared to other related frontier networks full connected network(FCN),segmentation network(SegNet),PSPNet(pyramid scene parsing network),DeepLabv3 and SETR(segmentation transformer)in terms of segmentation accuracy on the classical semantic segmentation dataset ADE20k.The experimental results show that the network proposed has its potentials for mean pixel accuracy and mean intersection over union.Since the HieSemNet can obtain a large perceptual field using dilated convolution without losing too much detail information,it can have much more accurate segmentation results for both of large-size targets and small-size objects.Compared to the above network,the mean pixel accuracy value of the networks can be improved by 17.47%,11.67%,4.86%,2.90%and 0.44%,respectively,and the mean intersection over union value can be improved by 13.94%,1.10%,6.28%,2.28%and 0.62%,respectively as well.The proposed SLAM algorithm is tested in related to such contexts of office scenes,warehouse scenes of TUM RGB-D dataset and natural environment.This paper shows the map building process,the trajectory accuracy and absolute trajectory error for three of different scenes by the SLAM algorithm.The comparative results show that our constructed maps can obtain more information for map building,fewer blank or wrong parts in the maps,the contour and position information of objects in the maps constructed is more accurate,and the adverse effects caused by small and chaotic objects are less.It is able to show the actual scene more accurately.Conclusion The segmentation effect of objects of different sizes can be fully involved in,and the proposed HieSemNet network can be used to improve the scene semantic segmentation accuracy potentially.

作者马淼刘培敏潘海鹏 Ma Miao;Liu Peimin;Pan Haipeng(School of Information Science and Engineering,Zhejiang Sci-Tech University,Hangzhou 310018,China)

机构地区浙江理工大学信息科学与工程学院

出处《中国图象图形学报》 CSCD 北大核心 2023年第8期2432-2446,共15页 Journal of Image and Graphics

基金浙江省自然科学基金项目(LQ19F030014)。

关键词同时定位与地图构建(SLAM) 语义分割语义三维地图空间多尺度特征 simultaneous localization and mapping(SLAM) semantic segmentation semantic three-dimensional map spatial multiscale features

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1崔明月,钟仕鹏,刘思瑶,李博洋,吴成昊,黄凯.利用边缘计算的多车协同激光雷达SLAM[J].中国图象图形学报,2021,26(1):218-228. 被引量：7
2杜静,蔡国榕.多特征融合与残差优化的点云语义分割方法[J].中国图象图形学报,2021,26(5):1105-1116. 被引量：9
3王金科,左星星,赵祥瑞,吕佳俊,刘勇.多源融合SLAM的现状与挑战[J].中国图象图形学报,2022,27(2):368-389. 被引量：23

二级参考文献3

1邹雄,肖长诗,文元桥,元海文.基于特征点法和直接法VSLAM的研究[J].计算机应用研究,2020,37(5):1281-1291. 被引量：12
2青晨,禹晶,肖创柏,段娟.深度卷积神经网络图像语义分割研究进展[J].中国图象图形学报,2020,25(6):1069-1090. 被引量：35
3龙霄潇,程新景,朱昊,张朋举,刘浩敏,李俊,郑林涛,胡庆拥,刘浩,曹汛,杨睿刚,吴毅红,章国锋,刘烨斌,徐凯,郭裕兰,陈宝权.三维视觉前沿进展[J].中国图象图形学报,2021,26(6):1389-1428. 被引量：32

共引文献36

1刘复昌,南博,缪永伟.基于显著性图的点云替换对抗攻击[J].中国图象图形学报,2022,27(2):500-510. 被引量：3
2黄林林,李世雄,谭彧,王硕.基于改进卷积神经网络算法的路径导航研究[J].中国农机化学报,2022,43(4):146-152. 被引量：3
3李娇娇,孙红岩,董雨,张若晗,孙晓鹏.基于深度学习的3维点云处理综述[J].计算机研究与发展,2022,59(5):1160-1179. 被引量：10
4龚诗雄,王旭,孔国杰,龚建伟.多车协同目标跟踪方法[J].兵工学报,2022,43(10):2429-2442. 被引量：2
5彭浩,朱德才,吴凯华.基于SLAM技术的地下暗渠探测方法[J].北京测绘,2022,36(10):1412-1416. 被引量：2
6梅立雪.基于深度卷积的神经网络的SLAM算法研究[J].造纸装备及材料,2022,51(10):70-72. 被引量：1
7邹筠珍,赵伟,许舒晨,孙永荣.基于轨迹拐点滤波的激光雷达里程计定位算法研究[J].导航定位与授时,2023,10(1):109-116.
8刘盛,曹益烽,黄文豪,李丁达.融合稀疏注意力和实例增强的雷达点云分割[J].中国图象图形学报,2023,28(2):483-494. 被引量：2
9张小红,张元泰,朱锋.城市复杂场景下GNSS定位的因子图优化方法及其抗差性能分析[J].武汉大学学报（信息科学版）,2023,48(7):1050-1057. 被引量：3
10夏琳琳,张晶晶,初妍,张道畅,宋梓维,崔家硕,刘瑞敏.融合天空偏振光的视觉SLAM研究进展与展望[J].兵工学报,2023,44(6):1588-1601. 被引量：3

同被引文献1

1Magnus Ekstrm,林文杰.景观空间格局的定量化[J].AMBIO－人类环境杂志,2003,32(8):568-571. 被引量：1

引证文献1

1郭静,王冬冬.基于文献计量分析的传统村落景观发展趋势综合研究[J].村委主任,2023(10):109-112.

1陈鑫,侯青山,付艳,张吉康.改进DeepLabV3+下的轻量化烟雾分割算法[J].西安工程大学学报,2023,37(4):118-126. 被引量：1
2周非,陈帅,吴凯,舒浩峰.快速跟踪分割辅助的动态SLAM[J].仪器仪表学报,2023,44(5):313-321. 被引量：2
3董志华,姚顽强,蔺小虎,郑俊良,马柏林,高康洲.煤矿井下顾及特征点动态提取的激光SLAM算法研究[J].煤矿安全,2023,54(8):241-246. 被引量：2
4龙升琼.排除干扰项,提高解答阅读题的效率[J].语数外学习（高中版）（上）,2023(3):62-64.
5徐婕.谈谈细节理解题的解答技巧[J].语数外学习（高中版）（中）,2023(2):69-70.
6薛光辉,李瑞雪,张钲昊,刘爽,魏金波.基于激光雷达的煤矿井底车场地图融合构建方法研究[J].煤炭科学技术,2023,51(8):219-227. 被引量：2
7杨校李,高林,赵晓雨,彭运猛,廖明艳.基于改进YOLOv7-tiny算法的输电线路螺栓缺销检测[J].湖北民族大学学报（自然科学版）,2023,41(3):314-321. 被引量：3
8阮晓钢,陈晓,朱晓庆.基于多重信息增益的移动机器人探索策略[J].北京工业大学学报,2023,49(9):990-998.
9章益民.基于对比边界学习的超面片Transformer点云分割网络[J].计算机时代,2023(9):75-80.
10方家吉,赖一波,唐正涛,喻擎苍.基于DeepLabv3+的轻量级电力线语义分割方法[J].计算机时代,2023(9):19-23. 被引量：1

中国图象图形学报

2023年第8期

浏览历史

内容加载中请稍等...

结合语义信息与3D点云技术的未知环境地图构建方法被引量：1

参考文献3

二级参考文献3

共引文献36

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

结合语义信息与3D点云技术的未知环境地图构建方法 被引量：1

参考文献3

二级参考文献3

共引文献36

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

结合语义信息与3D点云技术的未知环境地图构建方法被引量：1