期刊文献+

结合高斯混合模型和VOT特征的音素发音错误检测 被引量:3

Phonetic Mispronunciation Detection Based on GMM and VOT
下载PDF
导出
摘要 结合高斯混合模型(GMM)和嗓音起始时间(VOT)特征的普通话音素发音错误检测,提出了一种结合语音声道特征信息和音源特征信息的发音错误检测方法。其中GMM用于反映声道特征信息的MFCC参数的建模与评测,并直接对大部分音素的发音质量直接进行错误检测。对于少数通过MFCC参数和GMM难于检测区分的辅音音素,则通过反映VOT信息的音源特征参数进行区分。实验表明,该方法在训练数据有限的情况下取得了较好的性能,非常适合用于聋人语言康复的计算机辅助训练。 Combining Gaussian mixture model (GMM) and voice onset time (VOT) features, a novel Mandarin phonetic mispronunciation detection approach is proposed which combines the spectral features and the prosodic features. GMMs are used to model the Mel-frequency cepstral coefficients (MFCCs) for all the phonemes and eval- uate the pronunciation of most of the phonemes directly. For some consonants which are difficult to distinguish, prosodic features reflecting the VOT are extracted and used to achieve the classification. Mandarin phonetic mispro- nunciation detection experiments on limited data indicate the significant improvement. So, the new framework is quite helpful for the implementation of independent Putonghua training by the adult hearing-impaired people.
出处 《科学技术与工程》 北大核心 2013年第7期1789-1793,共5页 Science Technology and Engineering
基金 国家自然科学基金项目(61005020 60901061)资助
关键词 语音识别 发音错误检测 高斯混合模型 嗓音起始时间 speech recognition mispronunciation detection Gaussian mixture model voice onset time
  • 相关文献

参考文献10

  • 1Maxine Eskenazi. An overview of spoken language technology for edu- cation. Speech Communication ,2009 ;51 (10) :832-844.
  • 2Moustroufas, Digalakis V. Automatic pronunciation evaluation of for- eign speakers using unknown text. Computer Speech and Language, 2007 ; 21 ( 1 ) :219-230.
  • 3Witt S M, Young S J. Phone-level pronunciation scoring and assess- ment for interactive language learning . Speech Communication, 2000 ;30( 1 ) :95 - 108.
  • 4Frederik Stouten, Jean-Pierre Martens. On the Use of Phonological Features for Pronunciation Scoring. Proc of IEEE International Confer- ence on Acoustics, Speech and Signal Processing. ICASSP 2006.
  • 5Strik H, Truong K, Wet F de, et al. Comparing classifiers for pronun- ciation error detection. Proc. Interspeech 2007.
  • 6Strik H, Truong K, Wet F de, et al. Comparing different approaches for automatic pronunciation error detection. Speech Communication, 2009 ;51 (10) : 845-852.
  • 7van Doremalen J, Cucchiarini C, Strik H. Automatic Detection of Vowel Pronunciation Errors Using Multiple Information Sources. Proc of the biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU) ,2009.
  • 8黄中伟,杨磊,徐明,冯杉杉.普通话语音识别中的基本音素分析[J].深圳大学学报(理工版),2006,23(4):356-357. 被引量:9
  • 9Reynolds D A. Speaker identification and verification using Gaussian mixture speaker models. Speech Communication, 1995; 17(1-2) : 91 -108.
  • 10Reynolds D A. Speaker Verification using Adapted Gaussian Mixture Models. Digital Signal Processing, 2000; 10( 1 ) :19--41.

二级参考文献1

  • 1汉语拼音方案(第一届全国人民代表大会第五次会议通过)[G].国家语言文字委员会标准化工作办公室.国家语言文字规范和标准选编.北京:中国标准出版社,1997.

共引文献8

同被引文献29

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部