期刊文献+

普通话发音评估性能改进 被引量:2

Improvements on Mandarin Pronunciation Evaluation
下载PDF
导出
摘要 为减少噪声环境对评估性能的影响,该文将PNCC参数引入普通话发音评估。结果表明,其评分相关性在普通话测试实录音数据库上较传统MFCC参数提高了6.6%。在此基础上,对汉语声学模型拆分方法进行了研究,提出将声母介音+韵母模型拆分方法应用到发音评估中。使用这种拆分方式的评估系统总错误率降低5.6%,专家打分相关性则提高了0.056。该文还对模型最佳状态数的选取进行讨论,并提出模型状态数混合和不同配置综合评分两种混合评分方案,在相关性上较同等条件下3状态模型分别提高了0.021和0.017。 In this paper, PNCC(Power-Normalized Cepstral Coefficients) is introduced into Mandarin pronunciation evaluation system for reducing the impact of background noise. The result shows that the score correlation based on PNCC has been increased by 6.6% compared with classical MFCC. Then, different initial-final acoustic model structures for Chinese syllables are investigated on Mandarin pronunciation evaluation. An initial-medial and final (IMF) modeling is applied, resulting 5.6% reduction of the error rate and an increase of 0. 056 score correlation. Finally, the number of states in HMM model is discussed for pronunciation scoring, and some mixed score compu- ting schemes based on either models or scores are proposed. Test results show the score correlation with the experts has been increased by 0. 021 and 0. 017 respectively.
出处 《中文信息学报》 CSCD 北大核心 2013年第3期48-55,共8页 Journal of Chinese Information Processing
基金 2010年北京师范大学自主科研基金项目资助 2010年北京师范大学教学建设与改革项目资助
关键词 发音评估 PNCC 模型拆分 HMM状态数 mandarin pronunciation evaluation PNCC~ initial-medial and final HMM states
  • 相关文献

参考文献20

  • 1S M Witt. Use of Speech Recognition in Computer as- sisted Language Learning[D]. PhD Thesis, the Uni-.versity of Cambridge, Nov. 1999.
  • 2H Strik, K Truong, et al. Comparing different ap- proaches for automatic pronunciation error detection [J]. Speech Communication, 2009, 51(10): 845-852.
  • 3K Truong, A Neri, C Cucchiarini, et al. Automatic pronunciation error detection: an acoustic-phonetic ap- proach[C]//Proceedings of the InSTIL/ICALL Sym posium 2004. Venice, Italy: 2004: 135-138.
  • 4A Neri, C Cucchiarini, W Strik. Automatic speech recognition for second language learning how and why it actually works[C]//Proceedings of the 15th Interna- tional Congresses of Phonetic Sciences. Barcelona, Spanish: 2003: 1157-1160.
  • 5刘庆升,魏思,胡郁,郭武,王仁华.基于语言学知识的发音质量评价算法改进[J].中文信息学报,2007,21(4):92-96. 被引量:14
  • 6葛凤培,潘复平,董滨,颜永红.汉语发音质量评估的实验研究[J].声学学报,2010,35(2):261-266. 被引量:12
  • 7张峰,黄超,戴礼荣.普通话发音错误自动检测技术[J].中文信息学报,2010,24(2):110-115. 被引量:3
  • 8Kim Chanwoo. Robust Speech Recognition Motivated by Auditory and Binaural Observations[D]. Depart- ment of Language Technologies Institute, Carnegie Mellon University, Ph D. thesis, July 2009.
  • 9H I. Wang, J Q Han, T R Zheng. Quality evaluation and optimization of confusion network for LVCSR[J].中国科学院电子学报合集,1994-2007.
  • 10L Mangu, E Brill, A Stolcke. Finding consensus in speech recognition., word error minimization and oth- er applications of confusion networks[J]. In Comput- er, Speech and Language, 2000,14(4): 373-400.

二级参考文献60

共引文献58

同被引文献32

  • 1艾伦,王陆,张鸽.汉语标准普通话水平训练和自动测试系统[J].中国电化教育,2004(7):78-80. 被引量:3
  • 2孙景涛.介音在音节中的地位[J].语言科学,2006,5(2):44-52. 被引量:13
  • 3何珏,刘加.汉语连续语音中HMM模型状态数优化方法研究[J].中文信息学报,2006,20(6):83-88. 被引量:7
  • 4顾明亮,马勇.基于高斯混合模型的汉语方言辨识系统[J].计算机工程与应用,2007,43(3):204-206. 被引量:9
  • 5李净,徐明星.汉语连续语音识别中声学模型基元比较:音节、音素、声韵母[C].第六届全国人机语音通信会议,20014:267-280.
  • 6Maxine Eskenazi. An overview o{ spoken language technology for education [J]. M Eskenazi Speech Communication, 2009, 51(4):832.
  • 7何怡.错音检测技术在计算机辅助普通话发音教学系统中的应用研究[D].北京:北京师范大学,2013.
  • 8Hinton G, Li Deng, Dong Yu. Deep neural networks for acoustic modeling in speech recognition the shared views of four research groups[J]. Proc IEEE, 2012,29(6).84.
  • 9Chen Mingming,Yang Zhanlei , Liu Wenju. Deep natural network for madarin tone recognition[J]. Proc IJCNN, 2014,6889515 : 1154.
  • 10Lawrence Rabiner, Biing Hwang Juang. Fundamentals ofspeech recognition[M]. America. Prentice Hall, 1993:22.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部