期刊文献+

基于多视图特征投影与合成解析字典学习的图像分类 被引量:1

Multi-view feature projection and synthesis-analysis dictionary learning for image classification
下载PDF
导出
摘要 针对目前存在的合成解析字典学习方法不能有效地消除同类样本之间的差异性和忽略了不同特征对分类的不同影响的问题,提出了一种基于多视图特征投影与合成解析字典学习(MFPSDL)的图像分类方法。首先,在合成解析字典学习过程中为每种特征学习不同的特征投影矩阵,减小了类内样本间的差异对识别带来的影响;其次,对合成解析字典添加鉴别性的约束,使得同类样本具有相似的稀疏表示系数;最后通过为不同类型的特征学习权重,充分地融合多种特征。在公开人脸数据库(LFW)和手写体识别数据库(MNIST)上进行多项对比实验,MFPSDL方法在LFW和MNIST数据库上的训练时间分别为61.236 s和52.281 s,MFPSDL方法相比Fisher鉴别字典学习(FDDL)、类别一致的K奇异值分解(LC-KSVD)、字典对学习(DPL)等字典学习方法,在LFW和MNIST上的识别率提高了至少2.15和2.08个百分点。实验结果表明,所提方法在保证较低的时间复杂度的同时,获得了更好的识别效果,适用于图像分类。 Concerning the problem that the existing synthesis-analysis dictionary learning method can not effectively eliminate the differences between the samples of the same class and ignore the different effects of different features on the classification, an image classification method based on Multi-view Feature Projection and Synthesis-analysis Dictionary Learning (MFPSDL) was put forward. Firstly, different feature projection matrices were learned for different features in the process of synthesis-analysis dictionary learning, so the influence of the within-class differences on recognition was reduced. Secondly, discriminant constraint was added to the synthesis-analysis dictionary, so that similar sparse representation coefficients were obtained for samples of the same class. Finally, by learning different weights for different features, multiple features could be fully integrated. Several experiments were carried out on the Labeled Faces in the Wild (LFW) and Modified National Institute of Standards and Technology (MNIST) database, the training time of MFPSDL method on LFW and MNIST databases were 61. 236 s and 52. 281 ,~. Compared with Fisher Discrimination Dictionary Learning ( FDDL), Lable Consistent K Singular Value Decomposition (LC-KSVD) and Dictionary Pair Learning ( DPL), the recognition rate of MFPSDL method on LFW and MNIST was increased by at least 2.15 and 2.08 percentage points. The experimental results show that MFPSDL method can obtain higher recognition rate while keeping low time complexity, and it is suitable for image classification.
出处 《计算机应用》 CSCD 北大核心 2017年第7期1960-1966,共7页 journal of Computer Applications
基金 国家自然科学基金资助项目(61272273)~~
关键词 图像分类 字典学习 稀疏表示 多视图学习 特征学习 image classification dictionary learning sparse representation multi-view learning feature learning
  • 相关文献

参考文献2

二级参考文献16

  • 1MarkoffJ. How many computers to identify a cat?[NJ The New York Times, 2012-06-25.
  • 2MarkoffJ. Scientists see promise in deep-learning programs[NJ. The New York Times, 2012-11-23.
  • 3李彦宏.2012百度年会主题报告:相信技术的力量[R].北京:百度,2013.
  • 410 Breakthrough Technologies 2013[N]. MIT Technology Review, 2013-04-23.
  • 5Rumelhart D, Hinton G, Williams R. Learning representations by back-propagating errors[J]. Nature. 1986, 323(6088): 533-536.
  • 6Hinton G, Salakhutdinov R. Reducing the dimensionality of data with neural networks[J]. Science. 2006, 313(504). Doi: 10. 1l26/science. 1127647.
  • 7Dahl G. Yu Dong, Deng u, et a1. Context-dependent pre?trained deep neural networks for large vocabulary speech recognition[J]. IEEE Trans on Audio, Speech, and Language Processing. 2012, 20 (1): 30-42.
  • 8Jaitly N. Nguyen P, Nguyen A, et a1. Application of pretrained deep neural networks to large vocabulary speech recognition[CJ //Proc of Interspeech , Grenoble, France: International Speech Communication Association, 2012.
  • 9LeCun y, Boser B, DenkerJ S. et a1. Backpropagation applied to handwritten zip code recognition[J]. Neural Computation, 1989, I: 541-551.
  • 10Large Scale Visual Recognition Challenge 2012 (ILSVRC2012)[OLJ.[2013-08-01J. http://www. image?net.org/challenges/LSVRC/2012/.

共引文献600

同被引文献10

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部