期刊文献+

用共振峰轨迹提高汉语数码语音识别性能 被引量:2

Use formant trajectory to improve the performance of mandarin digit speech recognition
原文传递
导出
摘要 在汉语数码语音识别( M D S R)中,“2”和“8”是最易混淆的一对语音。文章分析了“2”和“8”混淆的原因,发现可用于分辨“2”和“8”的区别特征在于其共振峰轨迹的差异。因此文章提出了基于共振峰轨迹的判决算法( F T B D)来分辨“2”和“8”。实验表明,使用 F T B D 算法,使 M D S R识别率从960% 提高到 977% ,“2”和“8”的识别率从 91% 提高到99% ,消除了这对语音的混淆,提高了 M D S R In mandarin digit speech recognition (MDSR), “2” and “8” are the most confusable pair of words. The reason why “2” and “8” are often confused is analyzed. It is found that the cue to distinguish “2” and “8” is the difference between the formant trajectory of “2” and “8”. Therefore the formant trajectory based on decision algorithm (FTBD) was proposed to distinguish “2” and “8”. Experiments show that with FTBD the correct recognition rate is improved from 96.0% to 97.7% for MDSR, and from 91% to 99% for “2” and “8”, thus this confusion is removed from MDSR, and the performance of MDSR is improved.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 1999年第9期69-71,共3页 Journal of Tsinghua University(Science and Technology)
关键词 汉语 数码语音识别 共振峰轨迹 MDSR mandarin digit speech recognition formant trajectory
  • 相关文献

参考文献4

二级参考文献3

  • 1Chen C J,Proc 3rd International Conf Signal Proc,1996年,821页
  • 2Chiang T H,IEEE Trans Speech Audio Process,1996年,4卷,3期,167页
  • 3Gu Y H,Proc IEEE ICASSP,1992年,2期,21页

共引文献6

同被引文献3

引证文献2

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部