期刊文献+

基于双流卷积神经网络的RGB-D图像联合检测 被引量:8

Joint Detection of RGB-D Images Based on Double Flow Convolutional Neural Network
原文传递
导出
摘要 当前卷积神经网络结构未能充分考虑RGB图像和深度图像的独立性和相关性,针对其联合检测效率不高的问题,提出了一种新的双流卷积网络。将RGB图像和深度图像分别输入到两个卷积网络中,两个卷积网络结构相同且权值共享,经过数次卷积提取各自独立的特征后,在卷积层根据最优权值对两个卷积网络进行融合;继续使用卷积核提取融合后的特征,最后通过全连接层得到输出。相比于以往卷积网络对RGB-D图像采用的早期融合和后期融合方法,在检测时间相近的情况下,双流卷积网络检测的准确率和成功率分别提高了4.1%和3.5%。 The convolutional neural network structure fails to consider the independence and correlation between RGB images and depth images fully, so its detection is not high. A new double flow convolution network is proposed for the joint detection of RGB-D images. The RGB image and depth image are inputted to the two convolutional networks and the two networks have the same structure and weight sharing. After several convolutions, the independent features are extracted. According to the optimal weights in the convolution layer, the two convolutional networks are fused. The fused features are extracted continuously using convolution kernels, and the output is obtained by full connection layer finally. When the detection time is similar, the detection accuracy and the success rate are increased by 4.1% and 3.5% respectively, compared with the previous early and late fusion methods.
作者 刘帆 刘鹏远 张峻宁 徐彬彬 Liu fan;Liu Pengyuan;Zhang Junning;Xu Binbin(Mechanical Engineering College, Shifiazhuang, Hebei 050003, Chin)
机构地区 军械工程学院
出处 《激光与光电子学进展》 CSCD 北大核心 2018年第2期380-388,共9页 Laser & Optoelectronics Progress
基金 国家自然科学基金(51205405 51305454)
关键词 机器视觉 RGB-D 卷积神经网络 多模态信息 联合检测 深度学习 machine vision RGB-D convolutional neural network multimodal information joint detection depth learning
  • 相关文献

参考文献7

二级参考文献124

  • 1尹潘龙,徐光柱,雷帮军,曹维华.Kinect下深度信息获取技术及其在三维目标识别中的应用综述[J].集成技术,2013,2(6):94-99. 被引量:5
  • 2任翠池,杨淑莹,洪俊.基于BP神经网络的手写字符识别[J].天津理工大学学报,2006,22(4):80-82. 被引量:4
  • 3张斌,赵玮烨,李积宪.基于BP神经网络的手写字符识别系统[J].兰州交通大学学报,2007,26(1):18-20. 被引量:1
  • 4金连文,徐秉铮.基于多级神经网络结构的手写体汉字识别[J].通信学报,1997,18(5):21-27. 被引量:19
  • 5Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60 (2) 91 110.
  • 6Dalai N, Triggs B. Histograms of oriented gradients for human detection[C]//Computer Vision and Pattern Recognition (CVPR), IEEE Computer Society Conference on. San Diego, USA: IEEE, 2005, 1 886-893.
  • 7Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks[J]. Science, 2006, 313(5786) : 504-507.
  • 8Hubel D H, Wiesel T N. Receptive fields, binocular interaction and functional architecture in the catrs visual cortex[J]. The Journal of Physiology, 1962, 160(1): 106-154.
  • 9Fukushima K, Miyake S. Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in posi- tion[J]. Pattern Recognition, 1982, 15(6): 455-469.
  • 10Ruck D W, Rogers S K, Kabrisky M. Feature selection using a multilayer perceptron[J]. Journal of Neural Network Com- puting, 1990, 2(2): 40-48.

共引文献686

同被引文献20

引证文献8

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部