Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy

Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy

下载PDF

导出

摘要 It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation( SR) combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths' scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit( OMP) is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach. It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation（ SR） combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths＇ scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit（ OMP） is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach.

作者李翠梅曾萍萍朱劲强吴建华

机构地区 School of Communication and Electronics College of Science and Technology Department of Electronic Information Engineering

出处《Journal of Donghua University(English Edition)》 EI CAS 2015年第4期658-664,共7页 东华大学学报（英文版）

基金 National Natural Science Foundation of China(No.61210306074) Natural Science Foundation of Jiangxi Province,China(No.2012BAB201025) the Scientific Program of Jiangxi Provincial Education Department,China(Nos.GJJ14583,GJJ13008)

关键词 mouth-state recognition image warping sparse representation(SR) sparse coding HOMOTOPY mouth-state recognition image warping sparse representation（SR） sparse coding homotopy

分类号 TN911.73 [电子电信—通信与信息系统] O235 [理学—运筹学与控制论]

引文网络
相关文献

参考文献31

1Rivet B, Wang W, Naqvi S M, et al. Audio-Visual Speech Source Separation [ J]. IEEE Signal Processing Magazine, 2014, 31(3) : 125-134.
2Liu Q, Wang W W, Jackson P. Use of Bimodal Coherence to Resolve Permutation Problem in Convolutive BSS [ J ]. Signal Processing, 2012, 92(8): 1916-1927.
3Missaoui I, Zied L. Cepstral Smoothing of Binary Masks for Convolutive Blind Separation of Speech Mixtures [ J ]. International Journal of Digital Content Technology and Its Applications, 2012, 6 (17) : 532-541.
4Bucak S S, Rong J, Jain A K. Multiple Kernel Learning for Visual Object Recognition: a Review [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36 (7) : 1354- 1369.
5Loog M, Jensen A C. Semi-supervised Nearest Mean Classification through a Constrained Log-likelihood [ J ]. IEEE Transactions on Neural Networks and Learning Systems, 2015, 26(5) : 995-1006.
6Zheng J, Lu B L. A Support Vector Machine Classifier with Automatic Confidence and Its Application to Gender Classification [ J ]. Neurocomputing, 2011, 74 ( 11 ) : 1926-1935.
7Cavalcanti G D C, Ren T I, Vale B A. Data Complexity Measures and Nearest Neighbor Classifiers; a Practical Analysis for Meta-learning [ C ]. IEEE 24th International Conference on Tools with Artificial Intelligence, Athens, Greece, 2012.. 1065- 1069.
8Bag S, Sanyal G. An Efficient Face Recognition Approach Using PCA and Minimum Distance Classifier [ C ]. IEEE International Conference on Image Information Processing, Himachal Pradesh, India, 2011: 3-5.
9Wang C L, I.an L, Zhang Y W, et al. Face Recognition Based on Principle Component Analysis and Support Vector Machine [ C]. IEEE 3rd International Workshop on Intelligent Systems and Applications, Wuhan, China, 2011: 1-4.
10Cootes T F, Edwards G J, Taylor C J. Active Appearance Models [ J]. Computer Vision ECCV'98, 1998, 1407 : 484-498.

1Meiyue Zhang,Ying Wu.PERIODIC SOLUTIONS TO ELECTRON BEAMS FOCUSING SYSTEM[J].Annals of Differential Equations,2013,29(2):244-247.
2刘停战,于波.Existence of Periodic Solutions for Odd Order Ordinary Differential Equations via the Homotopy Method[J].Northeastern Mathematical Journal,2004,20(2):135-138.
3首照宇,胡蓉,欧阳宁.改进的基于稀疏表示的多聚焦图像融合[J].电视技术,2014,38(7):13-16. 被引量：6
4A Kamalianfar,S A Halim,Mahmoud Godarz Naseri,M Navasery,Fasih Ud Din,J A M Zahedi,Kasra Behzad,K P Lim,A Lavari Monghadam,S K Chen.Growth and characterization of ZnO multipods on functional surfaces with different sizes and shapes of Ag particles[J].Chinese Physics B,2013,22(8):683-689.
5ZHAO RuiZhen,LIU XiaoYu,Ching-Chung LI,Robert J. SCLABASSI,& SUN MinGui.Wavelet denoising via sparse representation[J].Science in China(Series F),2009,52(8):1371-1377. 被引量：26
6首照宇,胡蓉,欧阳宁,张彤.基于多尺度稀疏表示的图像融合方法[J].计算机工程与设计,2015,36(1):232-235. 被引量：9
7苏艳涛,檀童和,李志立.一种新颖的低复杂度稀疏信道估计算法实现[J].广东通信技术,2016,36(1):18-22. 被引量：1
8Lü Haitao,YIN Cao,CUI Zongmin,HU Jinhui.A Depth Video Coding In-Loop Median Filter Based on Joint Weighted Sparse Representation[J].Wuhan University Journal of Natural Sciences,2016,21(4):351-357.
9刘楠,Song Wenlong,Dong Guanghui.Two-stage DOA estimation method for passive radar based on sparse representation[J].High Technology Letters,2015,21(4):465-470.
10LI YueLong,MENG Li,FENG JuFu,WU JiGang.Downsampling sparse representation and discriminant information aided occluded face recognition[J].Science China(Information Sciences),2014,57(3):134-141. 被引量：5

Journal of Donghua University(English Edition)

2015年第4期

浏览历史

内容加载中请稍等...

Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy

参考文献31

相关作者

相关机构

相关主题

浏览历史