融合PointNet和3D-LMNet的单幅图像三维重建及语义分割被引量：2

3D Reconstruction and Semantic Segmentation Method Combining PointNet and 3D-LMNet from Single Image

导出

摘要由单幅图像重建三维结构并感知三维对象的语义理解极具挑战性。针对单幅图像难以直接生成三维重建点云问题,提出一种融合PointNet与3D-LMNet的联合优化网络模型进行三维重建并完成语义分割。基于3D-LMNet网络进行训练生成三维点云,并完成局部分割,同时,对网络损失函数进行联合优化来预测分割点云。通过分割点云的语义信息改善重建效果,生成带有语义分割信息的三维点云重建模型。针对联合训练中真值点云和预测点云类别标签无点对点的对应关系问题,引入联合优化损失函数来提高重建和分割效果,生成最终三维重建模型。通过在ShapeNet数据集上实验验证,并与PointNet和3D-LMNet单独训练相比,所提模型在平均交并比(mIoU)上提高了4.23%,在倒角距离(CD)和EMD(earth mover’s distance)上分别降低了7.97%和6.04%,联合优化网络明显改善了重建和分割的点云模型。 It is very challenging to reconstruct the 3D structure from a single image and perceive the semantic information of 3D objects.Aiming at the problem that it is difficult to directly generate a 3D reconstruction model from a single image input,a joint optimization network model combining PointNet and 3DLMNet is proposed for single image 3D reconstruction and semantic segmentation.First,a 3D point cloud is generated by training based on the 3DLMNet network,and then local segmentation is performed.Meanwhile,the network loss function is jointly optimized to predict the segmented 3D point cloud.Then,the reconstruction effect is improved through the semantics information of segmented point cloud,and a 3D point cloud reconstruction model is generated with semantic segmentation information.Finally,in view of the problem that there is no pointtopoint correspondence between the true value point cloud and the predicted point cloud category label during the joint training,the joint optimization loss function is introduced into the joint optimization network to improve the reconstruction and segmentation effect,and the 3D reconstructed model is made.Through verification on the ShapeNet dataset,and comparation with PointNet and 3DLMNet training,the model in this paper improves mean intersection over union(mIoU)by 4.23%,and reduces chamfer distance(CD)and earth mover’s distance(EMD)by 7.97%and 6.04%,respectively.The joint optimization network significantly improves the reconstruction and segmented point cloud model.

作者陈辉童勇朱莉梁维斌 Chen Hui;Tong Yong;Zhu Li;Liang Weibin(School of Automation Engineering,Shanghai University of Electric Power,Shanghai 200090,China;Open AI Lab(Shanghai)Co.,Ltd.,Shanghai 200233,China)

机构地区上海电力大学自动化工程学院开放智能机器(上海)有限公司

出处《激光与光电子学进展》 CSCD 北大核心 2022年第18期304-311,共8页 Laser & Optoelectronics Progress

基金国家自然科学基金(51705304) 上海市自然科学基金面上项目(20ZR1421300) 上海市浦江人才计划项目(21PJD025)。

关键词深度学习单幅图像联合优化三维重建语义分割 deep learning single image joint optimization 3D reconstruction semantic segmentation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1佟帅,徐晓刚,易成涛,邵承永.基于视觉的三维重建技术综述[J].计算机应用研究,2011,28(7):2411-2417. 被引量：116
2郑远攀,李广阳,李晔.深度学习在图像识别中的应用研究综述[J].计算机工程与应用,2019,55(12):20-36. 被引量：404
3杨斌,钟金英.卷积神经网络的研究进展综述[J].南华大学学报（自然科学版）,2016,30(3):66-72. 被引量：34
4陈加,张玉麒,宋鹏,魏艳涛,王煜.深度学习在基于单幅图像的物体三维重建中的应用[J].自动化学报,2019,45(4):657-668. 被引量：27
5董鹏辉,柯良军.基于图像的三维重建技术综述[J].无线电通信技术,2019,45(2):115-119. 被引量：22
6朱育正,张亚萍,冯乔生.基于深度学习的单视图彩色三维重建[J].激光与光电子学进展,2021,58(14):199-207. 被引量：8
7张爱武,刘路路,张希珍.道路三维点云多特征卷积神经网络语义分割方法[J].中国激光,2020,47(4):261-269. 被引量：19
8胡涛,李卫华,秦先祥.基于多层深度特征融合的极化合成孔径雷达图像语义分割[J].中国激光,2019,46(2):244-250. 被引量：13
9徐聪,王丽.基于改进DeepLabv3+网络的图像语义分割方法[J].激光与光电子学进展,2021,58(16):217-224. 被引量：21
10鲍海龙,万敏,刘忠祥,秦勉,崔浩宇.基于区域自我注意力的实时语义分割网络[J].激光与光电子学进展,2021,58(8):196-202. 被引量：8

二级参考文献159

1贺美芳,周来水,神会存.散乱点云数据的曲率估算及应用[J].南京航空航天大学学报,2005,37(4):515-519. 被引量：27
2李秀智,张广军.一种基于边缘线的三目立体匹配方法[J].光电工程,2007,34(2):22-26. 被引量：3
3HORN B. Shape from shading: a method for obtaining the shape of a smooth opaque object from one view[ D ]. Cambridge:[ s. n. ], 1970.
4BELHUMEUR P, KRIEGMAN D, YUILLE A. The bas-relief ambiguity[ J ]. International ,Journal of Computer Vision, 1999,35 ( 1 ) : 33-44.
5BAKSHI S,YANG Y. Shape from shading for non-lambertian surfaces [ C]//Proc of International Conference on Image Processing. 1994: 130-134.
6PENNA M. A shape from shading analysis for a single perspective image of a polyhedron [ J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1989,11 ( 6 ) :545-554.
7VOGEL O, BREUB M, WEICKERT J. Perspective shape from shading with non-lambertian reflectance [ C ]//Proc of DAGM Symposium on Pattern Recognition. Berlin : Springer, 2008 : 517-526.
8ECKER A, JEPSON A D. Polynomial shape from shading [ C ]//Proc of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2010.
9WOODHAM R J. Photometric method for determining surface orientation from multiple images [ J ]. Optical Engineering, 1980,19 ( 1 ) :139-144.
10NOAKES L, KOZERA R. Nonlinearities and noise reduction in 3source photometric stereo [ J ]. Journal of Mathematical Imaging and Vision,2003,18 ( 2 ) : 119-127.