期刊文献+

Adaptive multi-modal feature fusion for far and hard object detection

自适应性多模态特征融合的远小困难目标检测
下载PDF
导出
摘要 In order to solve difficult detection of far and hard objects due to the sparseness and insufficient semantic information of LiDAR point cloud,a 3D object detection network with multi-modal data adaptive fusion is proposed,which makes use of multi-neighborhood information of voxel and image information.Firstly,design an improved ResNet that maintains the structure information of far and hard objects in low-resolution feature maps,which is more suitable for detection task.Meanwhile,semantema of each image feature map is enhanced by semantic information from all subsequent feature maps.Secondly,extract multi-neighborhood context information with different receptive field sizes to make up for the defect of sparseness of point cloud which improves the ability of voxel features to represent the spatial structure and semantic information of objects.Finally,propose a multi-modal feature adaptive fusion strategy which uses learnable weights to express the contribution of different modal features to the detection task,and voxel attention further enhances the fused feature expression of effective target objects.The experimental results on the KITTI benchmark show that this method outperforms VoxelNet with remarkable margins,i.e.increasing the AP by 8.78%and 5.49%on medium and hard difficulty levels.Meanwhile,our method achieves greater detection performance compared with many mainstream multi-modal methods,i.e.outperforming the AP by 1%compared with that of MVX-Net on medium and hard difficulty levels.
作者 LI Yang GE Hongwei 李阳;葛洪伟(江南大学江苏省模式识别与计算智能实验室,江苏无锡214122;江南大学人工智能与计算机学院,江苏无锡214122)
出处 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2021年第2期232-241,共10页 测试科学与仪器(英文版)
基金 National Youth Natural Science Foundation of China(No.61806006) Innovation Program for Graduate of Jiangsu Province(No.KYLX160-781) Jiangsu University Superior Discipline Construction Project。
关键词 3D object detection adaptive fusion multi-modal data fusion attention mechanism multi-neighborhood features 3D目标检测 自适应性融合 多模态数据融合 注意力机制 多邻域特征
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部