基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别被引量：1

Object Recognition for RGB-D Images Based on Convolutional-Recursive Neural Network and Fisher Vector

导出

摘要综合利用彩色和深度信息,采用多数据模式的特征提取策略,提出一种基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别方法.对于彩色图像和深度图像,分别利用卷积-递归神经网络和卷积-费舍尔向量-递归神经网络提取物体的纹理及形状特征.为了更加全面的获取物体信息的特征表述,引入了灰度图像和表面法向量作为原始数据的补充,并利用卷积-递归神经网络提取特征.最后,将4种数据模式下提取到的特征融合起来,输入到softmax分类器中实现RGB-D物体识别.在标准的RGB-D数据库中对算法进行验证,所提算法可以有效提高物体识别率. Combining the color and depth information, a novel RGB-D object recognition method for RGB-D images based on convolutional-recursive neural network and fisher vector with multiple modalities extraction strategy is proposed. For the original color image and depth map, the convolutional-recursive neural network and convolutional-fisher vector-recursive neural network are used to exact the texture and shape features respectively. In order to capture more comprehensive features for object recognition, the gray image and the surface normal are introduced in our model, and the convolutional-recursive neural network is utilized to explore the corresponding features. At last, these four features extracted from different data modalities are integrated into the softmax classifier to achieve RGB-D object recognition. The proposed algorithm is verified in the standard RGB-D database. Experimental results show that the proposed algorithm achieves higher recognition rate.

作者牛力杰丛润民倪敏郑泽勋陈越罗晓维 Niu Lijie;Cong Runmin;Ni Min;Zheng Zexun;Chen Yue;Luo Xiaowei(School of Electronic and Information Engineering,Tianjin University,Tianjin 300072,China)

机构地区天津大学电气自动化与信息工程学院

出处《南开大学学报（自然科学版）》 CAS CSCD 北大核心 2021年第2期63-68,共6页 Acta Scientiarum Naturalium Universitatis Nankaiensis

基金国家自然科学基金(61271324) 天津市自然科学基金(12JCYBJC10400)。

关键词物体识别 RGB-D图像卷积-递归神经网络费舍尔向量 object recognition RGB-D images convolutional-recursive neural network Fisher vector

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1徐楚,金志刚,李东,李云.基于优化的SIFT特征描述子的人脸特征点定位[J].南开大学学报（自然科学版）,2016,49(5):50-56. 被引量：2
2卢良锋,何加铭,谢志军,孙德超.基于深度学习的RGB-D物体识别算法[J].移动通信,2015,39(10):52-56. 被引量：2

二级参考文献14

1Makhzani A, Frey B. k-Sparse Autoencoders[J]. arXiv preprint arXiv, 2013:1312-5663.
2Bo L, Ren X, Fox D. Hierarchical Matching Pursuit for Image Classification: Architecture and Fast Algorithms[J]. NIPS, 2011,1(2): 6-6.
3Vincent P, Larochelle H, Bengio ~, et al. Extracting and composing robust features with denoising autoencoders[C]. Proceedings of the 25th international conference on machine learning, ACM, 2008:1096-1103.
4Lee H, Grosse R, Ranganath R, et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations[C]. Proceedings of the 26th Annual International Conference on Machine Learning, ACM, 2009: 609-616.
5Blum M, Springenberg J T, Wulfing J, et al. A learned feature descriptor for object recognition in rgb-d data[C]. Robotics and Automation(ICRA), 2012 IEEE International Conference on IEEE, 2012: 1298-1303.
6Coates A, Ng A Y. The importance of encoding versus training with sparse coding and vector quantization[C]. Proceedings of the 28th International Conference on Machine Leaming(ICML- 11), 2011: 921-928.
7Yu K, Lin Y, Lafferty J. Learning image representations from the pixel level via hierarchical sparse coding[C]. Computer Vision and Pattern Recognition(CVPR), 2011 IEEE Conference on IEEE, 2011: 1713-1720.
8Bo L, Ren X, Fox D. Unsupervised feature learning for RGB-D based object recognition[C]. Experimental Robotics, Springer International Publishing, 2013: 387-402.
9Socher R, Huval B, Bath B P, et al. Convolutional- Recursive Deep Learning for 3D Object Classifica-tion[C]. NIPS, 2012: 665-673.
10吴证,周越,杜春华,袁泉,戈新良.彩色图像人脸特征点定位算法研究[J].电子学报,2008,36(2):309-313. 被引量：10

共引文献2

1李凌乐,李瑞华.基于RGB-D弹性可形变物体跟踪识别控制披萨厨师机器人方法研究[J].食品与机械,2020,36(2):100-104.
2王羿,姚克明,姜绍忠.基于口罩佩戴情况下的人脸识别方法[J].计算机科学与应用,2022,12(3):739-745. 被引量：2

同被引文献5

1蔡秋艺,陈昱璨,李建林,赵正凯,王燕.静息态低频振幅异常对精神分裂症患者识别研究[J].磁共振成像,2021,12(10):45-48. 被引量：1
2魏志军,刘国才,顾冬冬.基于多级串联深度卷积神经网络配准大形变图像[J].中国医学影像技术,2022,38(4):588-593. 被引量：2
3苏乾,赵睿,杨帆,刘怀贵.基于距离相关功能连接网络的机器学习模型在精神分裂症诊断中的价值[J].国际医学放射学杂志,2022,45(4):380-384. 被引量：2
4薛康康,陈静丽,魏亚蕊,陈苑,韩少强,王彩鸿,张勇,宋学勤,程敬亮.首发精神分裂症患者大尺度脑网络内及网络间静息态功能连接异常模式[J].中国医学影像技术,2023,39(6):829-833. 被引量：2
5邓豪东,王俊易,葛骏一,林放,李梦凡.基于多输入三维卷积神经网络的脑电解码模型[J].现代电子技术,2023,46(19):149-154. 被引量：1

引证文献1

1刘晨宇,周素妙,易芸,黄园园,李荷花,冯仕轩,黎浚豪,吴逢春.基于功能MRI机器学习用于诊断和治疗精神分裂症研究进展[J].中国医学影像技术,2023,39(12):1898-1901. 被引量：1

二级引证文献1

1祁纳,赵军.基于MRI影像组学和机器学习诊断抑郁症研究进展[J].中国医学影像技术,2024,40(3):455-458.

1球迷问答[J].当代体育（扣篮）,2014,0(23):112-112.
2刘宇斌,叶鸿盛.基于RGB双目相机的学习机系统的设计[J].数码设计,2021,10(11):38-39.
3蒋虹乔,孔欣雅,罗荧.网络直播销售中消费者权益的侵害和保护研究[J].现代商业,2021(11):6-8. 被引量：2
4仝凌云,张彬.基于全局协作信息的个性化推荐算法[J].价值工程,2021,40(11):233-234.
5陈喆,刘贺翊.农村医养结合模式下养老设施功能布局改进对策探究[J].居业,2021(4):12-13. 被引量：3
6夏婷婷.基于矩阵编码和AMBTC的灰度图像可逆信息隐藏[J].莆田学院学报,2021,28(2):56-60. 被引量：1
7本刊编辑部.关键词标引的规范化[J].中国微侵袭神经外科杂志,2021,26(2):89-89.
8冯化纲.基于立体视觉的客流监控预警系统设计与实现[J].计算机与数字工程,2021,49(4):731-735. 被引量：2
9张俊鹏,刘辉,李清荣.基于FCN-LSTM的工业烟尘图像分割[J].计算机工程与科学,2021,43(5):907-916. 被引量：2
10本刊编辑部.关键词标引的规范化[J].中国微侵袭神经外科杂志,2021,26(3):136-136.

南开大学学报（自然科学版）

2021年第2期

浏览历史

内容加载中请稍等...

基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别被引量：1

参考文献2

二级参考文献14

共引文献2

同被引文献5

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别 被引量：1

参考文献2

二级参考文献14

共引文献2

同被引文献5

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于卷积-递归神经网络和费舍尔向量的RGB-D物体识别被引量：1