基于深度与实例分割融合的单目3D目标检测方法

Monocular 3D object detection method integrating depth and instance segmentation

下载PDF

导出

摘要针对单目3D目标检测在视角变化引起的物体大小变化以及物体遮挡等情况下效果不佳的问题,提出一种融合深度信息和实例分割掩码的新型单目3D目标检测方法。首先,通过深度-掩码注意力融合(DMAF)模块,将深度信息与实例分割掩码结合,以提供更准确的物体边界;其次,引入动态卷积,并利用DMAF模块得到的融合特征引导动态卷积核的生成,以处理不同尺度的物体;再次,在损失函数中引入2D-3D边界框一致性损失函数,调整预测的3D边界框与对应的2D检测框高度一致,以提高实例分割和3D目标检测任务的效果;最后,通过消融实验验证该方法的有效性,并在KITTI测试集上对该方法进行验证。实验结果表明,与仅使用深度估计图和实例分割掩码的方法相比,在中等难度下对车辆类别检测的平均精度提高了6.36个百分点,且3D目标检测和鸟瞰图目标检测任务的效果均优于D4LCN(Depth-guided Dynamic-Depthwise-Dilated Local Convolutional Network)、M3D-RPN(Monocular 3D Region Proposal Network)等对比方法。 To address the limitations of monocular 3D object detection,when encountering changing object size due to changing perspective and occlusion,a new monocular 3D object detection method was proposed fusing depth information with instance segmentation masks.Firstly,with the help of the Depth-Mask Attention Fusion(DMAF)module,depth information was combined with instance segmentation masks to provide more accurate object boundaries.Secondly,dynamic convolution was introduced,and the fused features obtained from the DMAF module were used to guide the generation of dynamic convolution kernels for dealing with objects of different scales.Moreover,a 2D-3D bounding box consistency loss function was introduced into loss function,adjusting the predicted 3D bounding box to highly coincide with corresponding 2D detection box,thereby enhancing performance in instance segmentation and 3D object detection tasks.Lastly,the effectiveness of the proposed method was confirmed through ablation studies and validated on the KITTI test set.The results indicate that,compared to methods using only depth estimation maps and instance segmentation masks,the proposed method improves the average accuracy of vehicle detection under medium difficulty by 6.36 percentage points,and it outperforms comparative techniques like D4LCN(Depth-guided Dynamic-Depthwise-Dilated Local Convolutional Network)and M3D-RPN(Monocular 3D Region Proposal Network)in both 3D object detection and aerial view object detection tasks.

作者孙逊冯睿锋陈彦如 SUN Xun;FENG Ruifeng;CHEN Yanru(Line Station Design and Research Institute,China Railway Siyuan Survey and Design Group Company Limited,Wuhan Hubei 430063,China;College of Economics and Management,Southwest Jiaotong University,Chengdu Sichuan 610031,China)

机构地区中铁第四勘察设计院集团有限公司线路站场设计研究院西南交通大学经济管理学院

出处《计算机应用》 CSCD 北大核心 2024年第7期2208-2215,共8页 journal of Computer Applications

基金国家自然科学基金资助项目(62173279)。

关键词单目3D目标检测深度学习动态卷积实例分割 monocular 3D object detection deep learning dynamic convolution instance segmentation

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1周静,胡怡宇,胡成玉,王天江.基于点云补全和多分辨Transformer的弱感知目标检测方法[J].计算机应用,2023,43(7):2155-2165. 被引量：2
2王凤随,熊磊,钱亚萍.联合实例深度的多尺度单目3D目标检测算法[J].激光与光电子学进展,2023,60(16):230-238. 被引量：2

二级参考文献7

1刘芳,吴志威,杨安喆,韩笑.基于多尺度特征融合的自适应无人机目标检测[J].光学学报,2020,40(10):127-136. 被引量：33
2鞠默然,罗江宁,王仲博,罗海波.融合注意力机制的多尺度目标检测算法[J].光学学报,2020,40(13):126-134. 被引量：44
3裴仪瑶,郭会明,张丹普,陈文博.基于定位不确定性的鲁棒3D目标检测方法[J].计算机应用,2021,41(10):2979-2984. 被引量：5
4赵亮,胡杰,刘汉,安永鹏,熊宗权,王宇.基于语义分割的深度学习激光点云三维目标检测[J].中国激光,2021,48(17):171-183. 被引量：36
5胡杰,刘汉,徐文才,赵亮.基于三维激光雷达的道路障碍物目标位姿检测算法[J].中国激光,2021,48(24):158-168. 被引量：20
6孙刘杰,赵进,王文举,张煜森.多尺度Transformer激光雷达点云3D物体检测[J].计算机工程与应用,2022,58(8):136-146. 被引量：2
7龚威,史硕,陈博文,宋沙磊,吴德成,刘东,刘正军,廖梅松.机载高光谱激光雷达成像技术发展与应用[J].光学学报,2022,42(12):21-32. 被引量：34

共引文献2

1张莹,蒋亮亮,张东波,段万林,孙月.基于改进SECOND算法的点云三维目标检测[J].激光与光电子学进展,2024,61(8):126-135.
2宋凯瑞,汤慧.基于加权FCM和法向离群因子文物点云去噪[J].信息技术与信息化,2024(9):50-54.

1Haokun Yuan,Ruiqin Fang,Chi Fu,Shuo Wang,Xiaoqin Tong,Deyi Feng,Xiaoqing Wei,Xirong Hu,Yuan Wang.ATIP/ATIP1 regulates prostate cancer metastasis through mitochondrial dynamic-dependent signaling[J].Acta Biochimica et Biophysica Sinica,2024,56(2):304-314.
2刘光辉,王秦蒙,孟月波,陈廷廷,张娅琳.特征引导的多模态聚合低光环境行为识别方法[J].控制与决策,2024,39(7):2305-2314.
3李冬,张智.结合多尺度融合和图匹配的行人重识别[J].计算机工程与设计,2024,45(7):2180-2186.
4王林,刘艺琪,顾启馨,张驰,徐蕾,王蕾,陈翠英,刘学恩,赵鸿,庄辉.血清N-聚糖生物标志物诊断ALT水平正常慢性乙型肝炎患者显著肝纤维化和肝硬化的临床意义[J].Engineering,2023(7):151-158. 被引量：1
5杨燕珍,孙旭,李嘉旸,邹世清,傅永建.利用双磁极平面磁力研磨法对单晶硅表面的抛光实验研究[J].机械科学与技术,2024,43(6):1048-1055.
6黄丹丹,王菲,刘智,高晗,王惠绩.基于Retinex-Net网络模型的渐晕图像校正[J].液晶与显示,2024,39(7):929-938.
7Anni Luo,Jian-Xiang Liu.Rescuing the Golgi from heat damages by ATG8:restoration rather than clean-up[J].Stress Biology,2023,3(1):219-221.
8Puyuan Wen,Chao Ren.Research progress on intranasal treatment forParkinson'sdisease[J].Neuroprotection,2024,2(2):79-99.
9Mariola Olkowicz,Khaled Ramadan,Hernando Rosales-Solano,Miao Yu,Aizhou Wang,Marcelo Cypel,Janusz Pawliszyn.Mapping the metabolic responses to oxaliplatin-based chemotherapy with in vivo spatiotemporal metabolomics[J].Journal of Pharmaceutical Analysis,2024,14(2):196-210.

计算机应用

2024年第7期

浏览历史

内容加载中请稍等...

基于深度与实例分割融合的单目3D目标检测方法

参考文献2

二级参考文献7

共引文献2

相关作者

相关机构

相关主题

浏览历史