期刊文献+

基于RGB-D图像核描述子的物体识别方法 被引量:3

Object recognition method based on RGB-D image kernel descriptor
下载PDF
导出
摘要 针对传统的颜色-深度(RGB-D)图像物体识别的方法所存在的图像特征学习不全面、特征编码鲁棒性不够等问题,提出了基于核描述子局部约束线性编码(KD-LLC)的RGB-D图像物体识别方法。首先,在图像块间匹配核函数基础上,应用核主成分分析法提取RGB-D图像的3D形状、尺寸、边缘、颜色等多个互补性核描述子;然后,分别对它们进行LLC编码及空间池化处理以形成相应的图像编码向量;最后,把这些图像编码向量融合成具有鲁棒性、区分性的图像表示。基于RGB-D数据集的仿真实验结果表明,作为一种基于人工设计特征的RGB-D图像物体识别方法,由于所提算法综合利用深度图像和RGB图像的多方面特征,而且对传统深度核描述子的采样点选取和紧凑基向量的计算这两方面进行了改进,使得物体类别识别率达到86.8%,实体识别率达到92.7%,比其他同类方法具有更高的识别准确率。 The traditional RGB-Depth (RGB-D) image object recognition methods have some drawbacks, such as insufficient feature learning and poor robustness of feature coding. In order to solve these problems, an object recognition method of RGB-D image based on Kernel Descriptor and Locality-constrained Linear Coding (KD-LLC) was proposed. Firstly, based on the kernel function of image block matching, several complementary kernel descriptors from RGB-D images, such as 3D shape, size, edges and color, were extracted using Kernel Principal Component Analysis (KPCA). Then, the extracted feature from different cues, were processed by using LLC and Spatial Pyramid Pooling (SPP) to form the corresponding image coding vectors. Finally, the vectors were combined to obtain robust and distinguishable image representation. As a hand- crafted feature method, the proposed algorithm was compared to other hand-crafted feature methods on a RGB-D image dataset. In the proposed algorithm, multiple cues from depth image and RGB image were used, and the sampling points selection and basis vectors calculation schema for depth kernel descriptor generation were proposed. Due to above-mentioned improvements, the category and instance recognition accuracy of the proposed algorithm for objects can respectively reach 86.8% and 92.7%, which are higher than those of the previously hand-crafted feature methods for object recognition from RGB-D images.
作者 骆健 蒋旻
出处 《计算机应用》 CSCD 北大核心 2017年第1期255-261,共7页 journal of Computer Applications
基金 国家自然科学基金面上项目(41571396) 国家创新训练项目(201410488017)~~
关键词 RGB-D图像 物体识别 局部约束线性编码 核描述子 空间池化 RGB-D image object recognition Locality-constrained Linear Coding (LLC) kernel descriptor SpatialPyramid Pooling (SPP)
  • 相关文献

参考文献3

二级参考文献49

  • 1李长勇,曹其新.基于深度图像的蔬果形状特征提取[J].农业机械学报,2012,43(S1):242-245. 被引量:9
  • 2蔡骋,张明,朱俊平.基于压缩感知理论的杂草种子分类识别[J].中国科学:信息科学,2010,40(S1):160-172. 被引量:16
  • 3MarkoffJ. How many computers to identify a cat?[NJ The New York Times, 2012-06-25.
  • 4MarkoffJ. Scientists see promise in deep-learning programs[NJ. The New York Times, 2012-11-23.
  • 5李彦宏.2012百度年会主题报告:相信技术的力量[R].北京:百度,2013.
  • 610 Breakthrough Technologies 2013[N]. MIT Technology Review, 2013-04-23.
  • 7Rumelhart D, Hinton G, Williams R. Learning representations by back-propagating errors[J]. Nature. 1986, 323(6088): 533-536.
  • 8Hinton G, Salakhutdinov R. Reducing the dimensionality of data with neural networks[J]. Science. 2006, 313(504). Doi: 10. 1l26/science. 1127647.
  • 9Dahl G. Yu Dong, Deng u, et a1. Context-dependent pre?trained deep neural networks for large vocabulary speech recognition[J]. IEEE Trans on Audio, Speech, and Language Processing. 2012, 20 (1): 30-42.
  • 10Jaitly N. Nguyen P, Nguyen A, et a1. Application of pretrained deep neural networks to large vocabulary speech recognition[CJ //Proc of Interspeech , Grenoble, France: International Speech Communication Association, 2012.

共引文献607

同被引文献20

引证文献3

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部