期刊文献+

基于空间结构化推理深度融合网络的RGB-D场景解析 被引量:4

RGB-D Scene Parsing Based on Spatial Structured Inference Deep Fusion Networks
下载PDF
导出
摘要 为了弥补RGB-D场景解析中卷积神经网络空间结构化学习能力的不足,本文基于深度学习提出空间结构化推理深度融合网络,内嵌的结构化推理层有机地结合条件随机场和空间结构化推理模型,该层能够较为全面而准确地学习物体所处三维空间的物体分布以及物体间的三维空间位置关系.在此基础上,网络的特征融合层巧妙地利用深度置信网络和改进的条件随机场,该层可以根据融合生成的物体综合语义信息和物体间语义相关性信息完成深度结构化学习.实验结果表明,在标准RGB-D数据集NYUDv2和SUNRGBD上,空间结构化推理深度融合网络分别实现最优的平均准确率53.8%和54.6%,从而有助于实现机器人任务规划、车辆自动驾驶等智能计算机视觉任务. In order to make up the drawbacks that convolutional neural networks lack the ability of spatial structured learning in RGB-D scene parsing,we propose spatial structured inference deep fusion networks (SSIDFNs) on the basis of deep learning,the embedded structural inference layer organically combines conditional random fields (CRFs) and spatial structured inference model,which is able to learn the three-dimensional spatial distributions of objects and three-dimensional spatial relationships among objects in a more comprehensive and accurate way.Furthermore,the feature fusion layer takes both advantages of deep belief networks and improved CRFs,which is able to achieve deep structured learning according to the comprehensive semantic information of objects and semantic correlation information among objects.The experimental results demonstrate that the proposed SSIDFNs achieve the best mean accuracy 53.8% and 54.6% on the standard RGB-D datasets NYUDv2 and SUNRGBD respectively,which will be helpful to implement intelligent computer vision tasks,such as robot task planning and self-driving cars.
作者 王泽宇 吴艳霞 张国印 布树辉 WANG Ze-yu;WU Yan-xia;ZHANG Guo-yin;BU Shu-hui(College of Computer Science and Technology,Harbin Engineering University,Harbin,Heilongjiang 150001,China;School of Aeronautics,Northwestern Polytechnical University,Xi'an,Shaanxi 710072,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2018年第5期1253-1258,共6页 Acta Electronica Sinica
基金 国家重点研发计划(No.2016YFB1000400) 哈尔滨市杰出青年人才基金(No.2017RAYXJ016) 中央高校自由探索基金(No.HEUCF170605) 国家自然科学基金(No.61573284)
关键词 RGBD场景解析 深度学习 卷积神经网络 条件随机场 空间结构化推理模型 深度置信网络 计算机视觉 机器人任务规划 车辆自动驾驶 RGB-D scene parsing deep learning convolutional neural networks conditional random fields spatial structured inference model deep belief networks computer vision robot task planning self-driving cars
  • 相关文献

参考文献1

二级参考文献3

同被引文献15

引证文献4

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部