期刊文献+

Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy

Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy
下载PDF
导出
摘要 It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation( SR) combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths' scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit( OMP) is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach. It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation( SR) combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths' scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit( OMP) is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach.
出处 《Journal of Donghua University(English Edition)》 EI CAS 2015年第4期658-664,共7页 东华大学学报(英文版)
基金 National Natural Science Foundation of China(No.61210306074) Natural Science Foundation of Jiangxi Province,China(No.2012BAB201025) the Scientific Program of Jiangxi Provincial Education Department,China(Nos.GJJ14583,GJJ13008)
关键词 mouth-state recognition image warping sparse representation(SR) sparse coding HOMOTOPY mouth-state recognition image warping sparse representation(SR) sparse coding homotopy
  • 相关文献

参考文献31

  • 1Rivet B, Wang W, Naqvi S M, et al. Audio-Visual Speech Source Separation [ J]. IEEE Signal Processing Magazine, 2014, 31(3) : 125-134.
  • 2Liu Q, Wang W W, Jackson P. Use of Bimodal Coherence to Resolve Permutation Problem in Convolutive BSS [ J ]. Signal Processing, 2012, 92(8): 1916-1927.
  • 3Missaoui I, Zied L. Cepstral Smoothing of Binary Masks for Convolutive Blind Separation of Speech Mixtures [ J ]. International Journal of Digital Content Technology and Its Applications, 2012, 6 (17) : 532-541.
  • 4Bucak S S, Rong J, Jain A K. Multiple Kernel Learning for Visual Object Recognition: a Review [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36 (7) : 1354- 1369.
  • 5Loog M, Jensen A C. Semi-supervised Nearest Mean Classification through a Constrained Log-likelihood [ J ]. IEEE Transactions on Neural Networks and Learning Systems, 2015, 26(5) : 995-1006.
  • 6Zheng J, Lu B L. A Support Vector Machine Classifier with Automatic Confidence and Its Application to Gender Classification [ J ]. Neurocomputing, 2011, 74 ( 11 ) : 1926-1935.
  • 7Cavalcanti G D C, Ren T I, Vale B A. Data Complexity Measures and Nearest Neighbor Classifiers; a Practical Analysis for Meta-learning [ C ]. IEEE 24th International Conference on Tools with Artificial Intelligence, Athens, Greece, 2012.. 1065- 1069.
  • 8Bag S, Sanyal G. An Efficient Face Recognition Approach Using PCA and Minimum Distance Classifier [ C ]. IEEE International Conference on Image Information Processing, Himachal Pradesh, India, 2011: 3-5.
  • 9Wang C L, I.an L, Zhang Y W, et al. Face Recognition Based on Principle Component Analysis and Support Vector Machine [ C]. IEEE 3rd International Workshop on Intelligent Systems and Applications, Wuhan, China, 2011: 1-4.
  • 10Cootes T F, Edwards G J, Taylor C J. Active Appearance Models [ J]. Computer Vision ECCV'98, 1998, 1407 : 484-498.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部