Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

下载PDF

导出

摘要 One-dimensional Mel-Frequency Cepstrum Coefficients (1D-MFCC) in conjunction with a power spectrum analysis method is usually used as a feature extraction in a speaker identification system. However, as this one dimensional feature extraction subsystem shows low recognition rate for identifying an utterance speech signal under harsh noise conditions, we have developed a speaker identification system based on two-dimensional Bispectrum data that was theoretically more robust to the addition of Gaussian noise. As the processing sequence of ID-MFCC method could not be directly used for processing the two-dimensional Bispectrum data, in this paper we proposed a 2D-MFCC method as an extension of the 1D-MFCC method and the optimization of the 2D filter design using Genetic Algorithms. By using the 2D-MFCC method with the Bispectrum analysis method as the feature extraction technique, we then used Hidden Markov Model as the pattern classifier. In this paper, we have experimentally shows our developed methods for identifying an utterance speech signal buried with various levels of noise. Experimental result shows that the 2D-MFCC method without GA optimization has a comparable high recognition rate with that of 1D-MFCC method for utterance signal without noise addition. However, when the utterance signal is buried with Gaussian noises, the developed 2D-MFCC shows higher recognition capability, especially, when the 2D-MFCC optimized by Genetics Algorithms is utilized. One-dimensional Mel-Frequency Cepstrum Coefficients (1D-MFCC) in conjunction with a power spectrum analysis method is usually used as a feature extraction in a speaker identification system. However, as this one dimensional feature extraction subsystem shows low recognition rate for identifying an utterance speech signal under harsh noise conditions, we have developed a speaker identification system based on two-dimensional Bispectrum data that was theoretically more robust to the addition of Gaussian noise. As the processing sequence of ID-MFCC method could not be directly used for processing the two-dimensional Bispectrum data, in this paper we proposed a 2D-MFCC method as an extension of the 1D-MFCC method and the optimization of the 2D filter design using Genetic Algorithms. By using the 2D-MFCC method with the Bispectrum analysis method as the feature extraction technique, we then used Hidden Markov Model as the pattern classifier. In this paper, we have experimentally shows our developed methods for identifying an utterance speech signal buried with various levels of noise. Experimental result shows that the 2D-MFCC method without GA optimization has a comparable high recognition rate with that of 1D-MFCC method for utterance signal without noise addition. However, when the utterance signal is buried with Gaussian noises, the developed 2D-MFCC shows higher recognition capability, especially, when the 2D-MFCC optimized by Genetics Algorithms is utilized.

作者 Benyamin Kusumoputro Agus Buono Li Na

机构地区 Department of Computer Science Department of Computer Science Department of Electrical Engineering

出处《Journal of Software Engineering and Applications》 2012年第12期193-199,共7页 软件工程与应用（英文）

关键词 2D Mel-Frequency CEPSTRUM COEFFICIENTS BISPECTRUM Hidden Markov Model GENETICS Algorithms 2D Mel-Frequency Cepstrum Coefficients Bispectrum Hidden Markov Model Genetics Algorithms

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Shoichiro Tomii,Tomoaki Ohtsuki.Learning Based Falling Detection Using Multiple Doppler Sensors[J].Advances in Internet of Things,2013,3(2):33-43.
2Adel Hidri,Souad Meddeb,Hamid Amiri.About Multichannel Speech Signal Extraction and Separation Techniques[J].Journal of Signal and Information Processing,2012,3(2):238-247.
3Mojtaba Radmard,Mahdi Hadavi,Mohammad Mahdi Nayebi.A New Method of Voiced/Unvoiced Classification Based on Clustering[J].Journal of Signal and Information Processing,2011,2(4):336-347.
4Lisha Zhong,Jiangzhong Wan,Zhiwei Huang,Gaofei Cao,Bo Xiao.Heart Murmur Recognition Based on Hidden Markov Model[J].Journal of Signal and Information Processing,2013,4(2):140-144.
5G. Ravindran,S. Shenbagadevi,V. Salai Selvam.Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech[J].Journal of Biomedical Science and Engineering,2010,3(1):85-94.
6Zhe Wang,Haijian Zhang,Guoan Bi.Speech Signal Recovery Based on Source Separation and Noise Suppression[J].Journal of Computer and Communications,2014,2(9):112-120.
7Venkata Rama Rao,Rama Murthy,K. Srinivasa Rao.Speech Enhancement Using Cross-Correlation Compensated Multi-Band Wiener Filter Combined with Harmonic Regeneration[J].Journal of Signal and Information Processing,2011,2(2):117-124. 被引量：1
8Lincoln Priyadarshi Choudhury,Jayaraman Prabakaran.Urban and Rural HIV Estimates among Adult Population (15 - 49 Years) in Selected States of India Using Spectrum Data[J].World Journal of AIDS,2015,5(3):226-237.
9Xiaoxia Zhang,Ying Li.Environmental Sound Recognition Using Double-Level Energy Detection[J].Journal of Signal and Information Processing,2013,4(3):19-24.
10Ahmed Alwodai,Tie Wang,Zhi Chen,Fengshou Gu,Robert Cattley,Andrew Ball.A Study of Motor Bearing Fault Diagnosis using Modulation Signal Bispectrum Analysis of Motor Current Signals[J].Journal of Signal and Information Processing,2013,4(3):72-79. 被引量：3

Journal of Software Engineering and Applications

2012年第12期

浏览历史

内容加载中请稍等...

Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

相关作者

相关机构

相关主题

浏览历史