摘要
在汉语数码语音识别( M D S R)中,“2”和“8”是最易混淆的一对语音。文章分析了“2”和“8”混淆的原因,发现可用于分辨“2”和“8”的区别特征在于其共振峰轨迹的差异。因此文章提出了基于共振峰轨迹的判决算法( F T B D)来分辨“2”和“8”。实验表明,使用 F T B D 算法,使 M D S R识别率从960% 提高到 977% ,“2”和“8”的识别率从 91% 提高到99% ,消除了这对语音的混淆,提高了 M D S R
In mandarin digit speech recognition (MDSR), “2” and “8” are the most confusable pair of words. The reason why “2” and “8” are often confused is analyzed. It is found that the cue to distinguish “2” and “8” is the difference between the formant trajectory of “2” and “8”. Therefore the formant trajectory based on decision algorithm (FTBD) was proposed to distinguish “2” and “8”. Experiments show that with FTBD the correct recognition rate is improved from 96.0% to 97.7% for MDSR, and from 91% to 99% for “2” and “8”, thus this confusion is removed from MDSR, and the performance of MDSR is improved.
出处
《清华大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
1999年第9期69-71,共3页
Journal of Tsinghua University(Science and Technology)