期刊文献+

基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别 被引量:1

Object Recognition for RGB-D Images Based on Convolutional-Recursive Neural Network and Fisher Vector
原文传递
导出
摘要 综合利用彩色和深度信息,采用多数据模式的特征提取策略,提出一种基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别方法.对于彩色图像和深度图像,分别利用卷积-递归神经网络和卷积-费舍尔向量-递归神经网络提取物体的纹理及形状特征.为了更加全面的获取物体信息的特征表述,引入了灰度图像和表面法向量作为原始数据的补充,并利用卷积-递归神经网络提取特征.最后,将4种数据模式下提取到的特征融合起来,输入到softmax分类器中实现RGB-D物体识别.在标准的RGB-D数据库中对算法进行验证,所提算法可以有效提高物体识别率. Combining the color and depth information, a novel RGB-D object recognition method for RGB-D images based on convolutional-recursive neural network and fisher vector with multiple modalities extraction strategy is proposed. For the original color image and depth map, the convolutional-recursive neural network and convolutional-fisher vector-recursive neural network are used to exact the texture and shape features respectively. In order to capture more comprehensive features for object recognition, the gray image and the surface normal are introduced in our model, and the convolutional-recursive neural network is utilized to explore the corresponding features. At last, these four features extracted from different data modalities are integrated into the softmax classifier to achieve RGB-D object recognition. The proposed algorithm is verified in the standard RGB-D database. Experimental results show that the proposed algorithm achieves higher recognition rate.
作者 牛力杰 丛润民 倪敏 郑泽勋 陈越 罗晓维 Niu Lijie;Cong Runmin;Ni Min;Zheng Zexun;Chen Yue;Luo Xiaowei(School of Electronic and Information Engineering,Tianjin University,Tianjin 300072,China)
出处 《南开大学学报(自然科学版)》 CAS CSCD 北大核心 2021年第2期63-68,共6页 Acta Scientiarum Naturalium Universitatis Nankaiensis
基金 国家自然科学基金(61271324) 天津市自然科学基金(12JCYBJC10400)。
关键词 物体识别 RGB-D图像 卷积-递归神经网络 费舍尔向量 object recognition RGB-D images convolutional-recursive neural network Fisher vector
  • 相关文献

参考文献2

二级参考文献14

  • 1Makhzani A, Frey B. k-Sparse Autoencoders[J]. arXiv preprint arXiv, 2013:1312-5663.
  • 2Bo L, Ren X, Fox D. Hierarchical Matching Pursuit for Image Classification: Architecture and Fast Algorithms[J]. NIPS, 2011,1(2): 6-6.
  • 3Vincent P, Larochelle H, Bengio ~, et al. Extracting and composing robust features with denoising autoencoders[C]. Proceedings of the 25th international conference on machine learning, ACM, 2008:1096-1103.
  • 4Lee H, Grosse R, Ranganath R, et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations[C]. Proceedings of the 26th Annual International Conference on Machine Learning, ACM, 2009: 609-616.
  • 5Blum M, Springenberg J T, Wulfing J, et al. A learned feature descriptor for object recognition in rgb-d data[C]. Robotics and Automation(ICRA), 2012 IEEE International Conference on IEEE, 2012: 1298-1303.
  • 6Coates A, Ng A Y. The importance of encoding versus training with sparse coding and vector quantization[C]. Proceedings of the 28th International Conference on Machine Leaming(ICML- 11), 2011: 921-928.
  • 7Yu K, Lin Y, Lafferty J. Learning image representations from the pixel level via hierarchical sparse coding[C]. Computer Vision and Pattern Recognition(CVPR), 2011 IEEE Conference on IEEE, 2011: 1713-1720.
  • 8Bo L, Ren X, Fox D. Unsupervised feature learning for RGB-D based object recognition[C]. Experimental Robotics, Springer International Publishing, 2013: 387-402.
  • 9Socher R, Huval B, Bath B P, et al. Convolutional- Recursive Deep Learning for 3D Object Classifica-tion[C]. NIPS, 2012: 665-673.
  • 10吴证,周越,杜春华,袁泉,戈新良.彩色图像人脸特征点定位算法研究[J].电子学报,2008,36(2):309-313. 被引量:10

共引文献2

同被引文献5

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部