With the increasing popularity of 3D sensors(e.g.,Kinect)and light field cameras,technologies such as driverless,smart home and virtual reality have become hot spots for engineering applications.As an important part o...With the increasing popularity of 3D sensors(e.g.,Kinect)and light field cameras,technologies such as driverless,smart home and virtual reality have become hot spots for engineering applications.As an important part of 3D vision tasks,point cloud semantic segmentation has received a lot of attention from researchers.In this work,we focus on realistically collected indoor point cloud data and propose a point cloud semantic segmentation method based on PAConv and SE_variant.The SE_variant module captures global perception from a broad perspective of feature space by fusing different pooling methods,which fully utilize the channel information of point clouds.The effectiveness of the method is verified by comparing with other methods on S3DIS and ScanNetV2 semantic tagging benchmarks,and achieving 65.3%mIoU in S3DIS,47.6%mIoU in ScanNetV2.The results of the ablation experiments verify the effectiveness of the key modules and analyze how to use the attention mechanism to improve the 3D semantic segmentation performance.展开更多
文摘With the increasing popularity of 3D sensors(e.g.,Kinect)and light field cameras,technologies such as driverless,smart home and virtual reality have become hot spots for engineering applications.As an important part of 3D vision tasks,point cloud semantic segmentation has received a lot of attention from researchers.In this work,we focus on realistically collected indoor point cloud data and propose a point cloud semantic segmentation method based on PAConv and SE_variant.The SE_variant module captures global perception from a broad perspective of feature space by fusing different pooling methods,which fully utilize the channel information of point clouds.The effectiveness of the method is verified by comparing with other methods on S3DIS and ScanNetV2 semantic tagging benchmarks,and achieving 65.3%mIoU in S3DIS,47.6%mIoU in ScanNetV2.The results of the ablation experiments verify the effectiveness of the key modules and analyze how to use the attention mechanism to improve the 3D semantic segmentation performance.