多尺度特征融合的RGB-D图像显著性目标检测

Multi-Scale Feature Fusion Saliency Object Detection Based on RGB-D Images

下载PDF

导出

摘要显著性目标检测是计算机视觉的一个基础问题,目前很多基于深度学习的显著性检测方法都是将RGB图像和深度图按照输入融合或结果融合的方法进行特征融合,但这些方法并不能有效地融合特征图,为了提升显著性目标检测算法性能,提出了一种多尺度特征融合的RGB-D图像显著性目标检测方法。将模型主体设计为两个特征编码器、两个特征解码器和一个跨模特多尺度特征交错融合模块。两个特征编码器分别对应RGB图和深度图,其采用经过ImageNet数据集预训练的ResNet50网络,特征解码器用于解码编码器的五种不同尺度的输出,跨模态多尺度特征交错融合模块用于融合解码器和编码器提取的不同尺度的特征图,并将五个层次的融合结果进行拼接和降维,输出最终的显著性预测图。实验在四个公开的显著性数据集上与以往具有代表性的十个模型进行了比较,该模型在各个数据集上,相比于性能第二的模型,S-measure平均提高了0.391%,MAE平均减少了0.330%,F-measure平均减少了0.405%。提出了一种多尺度特征融合模型,摒弃了以往融合的方式,采用特征融合,将浅层和深层的特征分别进行交错融合,实验表明,提出的方法较以往的方法有更强的性能,能够取得更好的效果。 Purpose salient object detection is a basic problem in computer vision.At present,many saliency detection methods based on deep learning are based on the feature fusion of RGB images and depth maps according to the method of input fusion or result fusion,but these methods cannot effectively fuse of feature maps.In order to improve the performance of salient object detection algorithms,a multi-scale feature fusion RGB-D image salient object detection method is proposed.The main body of the model is designed as two feature encoders,two feature decoders and a cross-model multiscale feature interleaved fusion module.The two feature encoders correspond to the RGB image and the depth image respectively,which use the ResNet50 network pre-trained by the ImageNet dataset,the feature decoder is used to decode the output of the encoder in 5 different scales,and the cross-model multi-scale feature interleaved fusion module is used for the feature maps of different scales extracted by the decoder and encoder are fused,and the five-level fusion results are spliced and dimensionally reduced to output the final saliency prediction map.Experiments are compared with ten representative models in the past on four public significance data sets.Compared with the second-performing model,the S-measure of the model in this paper is increased by 0.391%on average on each data set.,MAE is decreased by 0.330%on average,and F-measure is decreased by 0.405%on average.A multi-scale feature fusion model is proposed,which abandons the previous fusion method and uses feature fusion to interleave the shallow and deep features.Experiments show that the method proposed in this paper has stronger performance than previous methods,to achieve better results.

作者王震于万钧陈颖 WANG Zhen;YU Wanjun;CHEN Ying(School of Computer Science and Information Engineering,Shanghai Institute of Technology,Shanghai 201418,China)

机构地区上海应用技术大学计算机科学与信息工程学院

出处《计算机工程与应用》 CSCD 北大核心 2024年第11期242-250,共9页 Computer Engineering and Applications

基金国家自然科学基金(61976140)。

关键词显著性物体检测多模图像融合多支路协同预测多尺度特征 saliency object detection(SOD) multimodal image fusion multi-path collaborative prediction multiscale features

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张海涛.如何打造科研管理信息化系统[J].科学新闻,2022,24(5):46-48.
2赵丽.基于云计算智慧平台的健康养老信息推送算法研究[J].长春大学学报,2024,34(4):8-13.
3张冬梅,李石磊.一种显著性检测提取高分遥感影像建筑物的方法[J].测绘与空间地理信息,2024,47(6):97-101.
4张晶,李春艳,范洪军.基于“两个坚持”的高职教育课程思政研究与实施[J].青岛远洋船员职业学院学报,2024,45(2):75-82.
5崔亚洲,曹敬立,王玉君,佟鑫,陈丽晔,李明.基于电力营销大数据技术的反窃电检查应用分析[J].自动化技术与应用,2024,43(5):131-134.
6吴松.潮汕嵌瓷造型语言研究[J].雕塑,2024(2):94-95.
7张斯力,李梓健,蔡瑞初,郝志峰,闫玉光.基于因果机制约束的强化推荐系统[J].计算机工程,2024,50(5):279-290.
8初春虹,叶陈刚,姚春莉.绿色创新驱动产业链协同减碳探析[J].财务与会计,2024(10):75-76.
9叶欣悦,朱磊,王文武,付云.互补特征交互融合的RGB_D实时显著目标检测[J].中国图象图形学报,2024,29(5):1252-1264. 被引量：1
10Aiman,Muhammad Arshad,Bilal Khan,Khalil Khan,Ali Mustafa Qamar,Rehan Ullah Khan.ABMRF:An Ensemble Model for Author Profiling Based on Stylistic Features Using Roman Urdu[J].Intelligent Automation & Soft Computing,2024,39(2):301-317.

计算机工程与应用

2024年第11期

浏览历史

内容加载中请稍等...

多尺度特征融合的RGB-D图像显著性目标检测

相关作者

相关机构

相关主题

浏览历史