摘要
通过分析嗓音的发音机理、病态嗓音与正常嗓音在频域的表现差异,利用小波变换对信号进行分解,突出病态嗓音的特点,提出了基于多尺度分析的小波降噪、分解的熵系数(Entropy Coefficient based on De-noise,Decomposition of Multi-scale Analysis,ECDDMA)作为识别的特征矢量集。并对比分析了语音识别中经典特征参数Mel倒谱系数(MFCC),分别运用这两种特征参数对242例正常嗓音和234例病态嗓音运用高斯混合模型(GMM)进行了识别。结果显示:ECDDMA系数较传统的模拟人耳听觉非线性特性的MFCC及其动态特征能更准确地表征正常与病态嗓音之间的差异,有利于同时提高病态和正常嗓音的识别率。
Considering the voice pronunciation mechanism,the different performances of the abnormal voice and the normal voice in the field of frequency,the paper proposes a new method for extracting characteristics that is Entropy Coefficient based on Denoise,Decomposition of Multi-scale Analysis (ECDDMA) using the wavelet decomposition to find the pathological voice's characteristics,and comparative analysis of the effective speech characteristics MFCC.242 normal voices samples and 234 abnormal samples are recognized with MFCC and the new extracted characteristics ECDDMA based on Gaussian Mixture Model (GMM).The result indicates that,the parameters of ECDDMA are more advantageous to the normal and abnormal voice recognition than the traditional MFCC and the dynamic characteristic which mimic the human ears non-linear characteristic with frequency,and improves the abnormal and normal voice's recognition result.
出处
《计算机工程与应用》
CSCD
北大核心
2009年第22期194-196,205,共4页
Computer Engineering and Applications
基金
广西自然科学基金No.0448035~~