期刊文献+

汉语连续语音识别系统中三音子模型的优化 被引量:4

Refining triphone model in mandarin continuous speech recognition
下载PDF
导出
摘要 为了更准确地估计状态聚类前有调三音子的模型参数,从而提高聚类后捆绑状态的精度及系统的识别性能,针对汉语连续语音识别中,有些有调三音子的训练样本数非常少,而其对应的无调三音子的训练样本数相对较多的情况,提出用其对应的无调三音子的模型参数进行初始化,并用最大后验概率准则训练模型。汉语大词汇量连续语音识别实验表明,该方法可以提高训练语料中稀疏三音子聚类前的模型精度,从而提高系统的识别性能。 In order to estimate toned triphone' s model parameters accurately before state clustering and increase recognition rate, this paper used the corresponding toneless triphone model parameters for the initialization of toned triphones, and applied the maximum a posteriori criterion for model estimation. In the experiment of mandarin large vocabulary continuous speech recognition, this method can improve the accuracy of sparse triphone' s model in the training corpus before clustering and a- chieve certain increase of recognition rate.
出处 《计算机应用研究》 CSCD 北大核心 2013年第10期2920-2922,共3页 Application Research of Computers
基金 国家自然科学基金资助项目(10925419,90920302,61072124,1107427511161140319,91120001,61271426) 中国科学院战略性先导科技专项资助项目(XDA06030100,XDA06030500) 国家“863”计划资助项目(2012AA012503) 中国科学院重点部署项目(KGZD-EW-103-2)
关键词 决策树聚类 三音子模型 声韵母 最大后验概率 decision tree-based clustering triphone model initials and finals maximum a posteriori(MAP)
  • 相关文献

参考文献12

  • 1倪崇嘉,刘文举,徐波.汉语大词汇量连续语音识别系统研究进展[J].中文信息学报,2009,23(1):112-123. 被引量:39
  • 2黄浩,朱杰,哈力旦.汉语语音识别中的区分性声调建模方法[J].计算机工程与应用,2009,45(11):178-182. 被引量:4
  • 3YAN Long,ZHAO Ren-cai,LIU Gang,et al. Large vocabulary manda-rin Chinese continuous speech recognition system based on tonaltriphone [ C] //Proc of International Symposium on Tonal Aspects ofLanguages. 2004:28 - 31.
  • 4李净,郑方,张继勇,吴文虎.汉语连续语音识别中上下文相关的声韵母建模[J].清华大学学报(自然科学版),2004,44(1):61-64. 被引量:18
  • 5YOUNG S J, WOODLAND P C. State clustering in hidden Markovmodel-based continuous speech recognition [ J]. Computer Speechand Language, 1994,8(4) :369-384.
  • 6WANG Guang-sen, SIM K C. An investigation of tied-mixture GMMbased triphones state clustering [ C] //Proc of IEEE International Con-ference on Acoustics,Speech and Signal Processing.2012 -.4717-4720.
  • 7韩兆兵,贾磊,张树武,徐波.连续语音识别中声学建模的组合聚类算法研究[J].中文信息学报,2003,17(4):33-38. 被引量:5
  • 8REICHL W,CHOU W. Robust decision tree state tying for continuousspeech recognition [ J]. IEEE Trans on Speech and Audio Pro-cessing,2000,8(5) ;555-566.
  • 9LIU Chao-jun,WU Xin-tian,YAN Yong-hong. High accuracy acousticmodeling using two-level decision-tree based state-tying[ C] //Proc ofthe 5 th European Conference on Speech Communication and Techno-logy. 1999:1703-1706.
  • 10WONG Y W,CHANG E. The effect of pitch and lexical tone on diffe-rent mandarin speech recognition tasks[ C]//Proc of the 7th EuropeanConference on Speech Communication and Technology. 2001:2741-2744.

二级参考文献110

  • 1钱跃良,林守勋,刘群,刘宏.2005年度863计划中文信息处理与智能人机接口技术评测回顾[J].中文信息学报,2006,20(B03):1-6. 被引量:4
  • 2张昊天.[D].北京:清华大学电子工程系,2000.
  • 3Zhang, B., S. Matsoukas and R. Schwartz. Discrimina tively trained region dependent teature transforms for speech recognition [C]// Proc. ICASSP, Vol. 1-13, 2006: 313-316.
  • 4Beyerlein, P., et al., Large vocabulary continuous speech recognition of Broadcast News - The Philips/ RWTH approach[J]. Speech Communication, 2002, 37(1-2): 109- 131.
  • 5Hain, T., et al., Automatic transcription of conversational telephone speech [C]// IEEE Transactions on Speech and Audio Processing, 2005, 13(6): 1173-1185.
  • 6Zhang, B. and S. Matsoukas, Minimum phoneme error based heteroscedastic linear discriminant analy sis for speech recognition[C]// Proc. ICASSP, Vol. 1-5, 2005: 1925-1928.
  • 7Hirsimaki, T., et al., Unlimited vocabulary speech recognition with morph language models applied to Finnish[J]. Computer Speech and Language, 2006, 20(4) : 515-541.
  • 8Odell, J.J., The Use of Context in Large Vocabulary Speech Recognition[D]. 1995, University of Cambridge :Cambridge
  • 9Young, S.J., J.J. Odell, and P. C. Woodland. Tree-Based State Tying for High Accuracy Modelling [C]// Proceedings ARPA Workshop on Human Language Technology. 1994.
  • 10Xu, B., et al., Integrating tone information in continuous Mandarin recognition[C]// Proc. ISSPIS, 1999.

共引文献83

同被引文献64

  • 1丁沛,曹志刚.基于语音增强失真补偿的抗噪声语音识别技术[J].中文信息学报,2004,18(5):64-69. 被引量:3
  • 2徐向华,朱杰,郭强.汉语连续语音识别中的分级聚类算法的研究和应用[J].信号处理,2004,20(5):497-500. 被引量:2
  • 3韩勇,须德,戴国忠.语音用户界面研究进展[J].计算机科学,2004,31(6):1-4. 被引量:5
  • 4刘放军,王仁华.语音识别前端鲁棒性问题综述[J].计算机科学,2006,33(4):168-173. 被引量:3
  • 5张翠丽,张申生,李磊.基于统一受理的农业呼叫中心解决方案[J].计算机应用与软件,2006,23(10):31-32. 被引量:9
  • 6赵春江,申长军,邢振,郑文刚,鲍锋,吴文彪.农产品信息采集器及采集方法[P].中国:CNl02122430A,2011.
  • 7Singh G. Multi utility e-controlled cum voice operated farm.International Journal of Computer Applications, 2010, 1(13): 109-113.
  • 8Mantena G V, Rajendran S, Rambabu B, Gangashetty S V, Yegnanarayana B, Prahallad K. A speech-based conversation system for accessing agriculture commodity prices in Indian languages. Hands-free Speech Communication and Microphone Arrays (HSCMA) 2011 Joint Workshop on, 2011: 153-154.
  • 9Plauche M, Nallasamy U, Pal J, Wooters C, Ramachandran D. Speech recognition for illiterate access to information and technology. //Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06). Berkeley, CA, 2006: 83-92.
  • 10Ou W H, Gao W L, Li Z, Zhang S L, Wang Q. Application of keywords speech recognition in agricultural voice information system. //Computational Intelligence and Natural Computing Proceedings ( CINC), 2010 Second International Conference. Wuhan, Hubei, 2010: 197-200.

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部