基于深度质量感知和分层特征引导的RGB⁃D显著性检测

RGB-D Saliency Detection via Depth Quality Perception and Hierarchical Feature Guidance

下载PDF

导出

摘要现有基于融合的RGB-D显著性物体检测方法在对跨模态特征进行融合时忽视了RGB和深度图两模态特征的差异性,跨模态特征融合不均衡的问题使得模型不能充分利用跨模态互补特征,而低质量深度图也会对模型性能带来损害。提出一种基于深度质量感知和分层特征引导的RGB-D显著性物体检测算法。算法分为两个阶段:深度质量感知阶段和分层特征引导阶段。在第一阶段,利用深度质量感知从现有的主流RGB-D显著性物体检测训练数据集中挖掘高质量深度图,对训练集进行增强,提升低质量深度图的质量,减少噪声数据对模型性能的损害;在第二阶段,利用特征引导网络对RGB图和深度图进行分层自适应权重动态融合,在有效增加融合效率的同时增强跨模态融合的感知能力。在基准数据集NJUD、NLPR、SSD、STEREO和SIP上的实验结果表明,相比于SSF、CDNet、D3Net、DASNet等方法,该算法能够大幅提升深度图质量,其中在NLPR数据集上F-Measure值为0.934,MAE仅为0.020,综合性能优于其他相关SOTA方法,证明了先挖掘高质量深度图再进行跨模态自适应动态融合算法的有效性。 Existing fusion-based RGB-D saliency object detection methods ignore the differences between RGB and depth map features when fusing cross-modal features.The problems from fusing unbalanced cross-modal features makes the model insufficiently leverage cross-modal complementary features.Moreover,low-quality depth maps can hurt model performance.This paper proposes an RGB-D salient object detection algorithm based on depth quality perception and hierarchical feature guidance.The algorithm is divided into two stages:depth quality perception stage and hierarchical feature guidance stage.In the first stage,depth quality perception is used to mine high-quality depth maps from the existing mainstream RGB-D salient object detection training data sets to enhance the training sets.This process significantly improves the quality of low-quality depth maps and reduces the damage of noise data on model performance.In the second stage,the feature-guidance network is used to perform hierarchical adaptive weight dynamic fusion of the RGB and depth map,which effectively increases the fusion efficiency and enhances the cross-modality fusion perception.The experimental results on five benchmark datasets(NJUD,NLPR,SSD,STEREO,and SIP)show that the proposed algorithm significantly improves the depth map quality compared to methods such as SSF,CDNet,D3Net,and DASNet.Moreover,on the NLPR dataset,the F-Measure value is 0.934,whereas the MAE is only 0.020.The comprehensive performance is better than other related SOTA methods,proving the effectiveness of the proposed algorithm in first mining high-quality depth maps and then performing cross-modal adaptive dynamic fusion.

作者宋梦柯郑元超陈程立诏 SONG Mengke;ZHENG Yuanchao;CHEN Chenglizhao(College of Computer Science and Technology,Qingdao University,Qingdao 266071,Shandong,China)

机构地区青岛大学计算机科学技术学院

出处《计算机工程》 CAS CSCD 北大核心 2023年第5期255-261,268,共8页 Computer Engineering

基金山东省自然科学基金博士项目(ZR2019BF011)。

关键词深度质量感知特征引导跨模态融合分层融合 RGB-D显著性检测 depth quality perception feature guidance cross-modal fusion hierarchical fusion RGB-D saliency detection

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张云川,姜麟,林莉.基于单次双向特征金字塔网络的目标检测模型[J].激光与光电子学进展,2023,60(2):263-273. 被引量：2
2费熳熳,陈春晓,王亮,傅雪.基于MDRA-net的肺结节良恶性分类方法[J].激光与光电子学进展,2023,60(4):392-396.
3乌丽凡·乌兰.绘本在幼儿园语言教学中的应用探讨[J].中文科技期刊数据库（全文版）社会科学,2022(7):0145-0148.
4宣城市直机关新时期基层党建品牌创建工作研究课题组.品牌创建促融合党建提升展特色[J].旗帜,2023(1):68-68.
5黄晓露,关靖莹,郑义.购物渠道对消费者木质家具质量感知的影响[J].中国林业经济,2023(2):31-37.
6史悦,于万钧,陈颖.基于多层次特征融合的RGB-D显著性检测[J].计算机工程与应用,2023,59(7):207-213.
7谭永前,曾凡菊.基于凸包计算和小波变换的显著目标检测算法[J].兵器装备工程学报,2023,44(4):252-261.
8相广芳,焦钰雯,肖崇,梁雯静,沈静芳.乡村振兴背景下河南省农旅融合效率时空差异及优化措施研究[J].安徽农业科学,2023,51(8):148-151.
9王钊,解文彬,文江.基于YOLO的多模态特征差分注意融合行人检测[J].计算机系统应用,2023,32(4):329-338.
10张应强.高等教育质量民间立场与我国高等教育普及化[J].中国社会科学文摘,2023(4):149-150.

计算机工程

2023年第5期

浏览历史

内容加载中请稍等...

基于深度质量感知和分层特征引导的RGB⁃D显著性检测

相关作者

相关机构

相关主题

浏览历史