期刊文献+

Investigation of Improved Approaches to Bayes Risk Decoding

Investigation of Improved Approaches to Bayes Risk Decoding
原文传递
导出
摘要 Bayes risk (BR) decoding methods have been widely investigated in the speech recognition area due to its flexibility and complexity compared with the maximum a posteriori (MAP) method regarding to minimum word error (MWE) optimization. This paper investigates two improved approaches to the BR decoding, aiming at minimizing word error. The novelty of the proposed methods is shown in the explicit optimization of the objective function, the value of which is calculated by an improved forward algorithm on the lattice. However, the result of the first method is obtained by an expectation maximization (EM) like iteration, while the result of the second one is achieved by traversing the confusion network (CN), both of which lead to an optimized objective function value with distinct approaches. Experimental results indicate that the proposed methods result in an error reduction for lattice rescoring, compared with the traditional CN method for lattice rescoring. Bayes risk (BR) decoding methods have been widely investigated in the speech recognition area due to its flexibility and complexity compared with the maximum a posteriori (MAP) method regarding to minimum word error (MWE) optimization. This paper investigates two improved approaches to the BR decoding, aiming at minimizing word error. The novelty of the proposed methods is shown in the explicit optimization of the objective function, the value of which is calculated by an improved forward algorithm on the lattice. However, the result of the first method is obtained by an expectation maximization (EM) like iteration, while the result of the second one is achieved by traversing the confusion network (CN), both of which lead to an optimized objective function value with distinct approaches. Experimental results indicate that the proposed methods result in an error reduction for lattice rescoring, compared with the traditional CN method for lattice rescoring.
作者 徐海华 朱杰
出处 《Journal of Shanghai Jiaotong university(Science)》 EI 2011年第5期524-529,共6页 上海交通大学学报(英文版)
关键词 Bayes risk (BR) confusion network (CN) speech recognition lattice rescoring Bayes risk (BR), confusion network (CN), speech recognition, lattice rescoring
  • 相关文献

参考文献17

  • 1LEVENSHTE1N V I. Binary codes capable of correcting deletions, insertions and reversals [J]. Soviet Physics Doklady, 1966, 10(8): 707-710.
  • 2STOLCKE A, KONIG Y, WEINTRAUB M. Explicit word error minimization in n-best list rescoring [C]//Proceedings of the 5th European Conference on Speech Communication and Technology. Rhodes, Greece: ISCA, 1997: 163-166.
  • 3MANGU L in speech BRILL E, STOLCKE A. Finding consensus recognition: Word error minimization and other applications of confusion networks [J]. Computer Speech and Language, 2000, 14: 373-400.
  • 4COEL V, KUMAR S, BYRNE W J. Minimum bayes-risk automatic speech recognition [J]. Computer Speech and Language, 2000, 14: 115-135.
  • 5WESSEL F, SCHLUTER R, NEY H. Explicit word error minimization using word hypothesis posterior prob- abilities [C]//Proceeding of International Conference on Acoustics, Speech, and Signal Processing. Salt Lake City, USA: IEEE, 2001: 33-36.
  • 6GOEL V, BYRNE W J. Segmental minimum bayes-risk decoding for automatic speech recognition [J]. IEEE Transactions on Speech and Audio Processing, 2006, 12: 234-249.
  • 7Xu H, POVEY D, ZHu J, et al. Minimum hypothesis phone error as a decoding method for speech recog- nition [C]/ / Proeeedings of INTERSPEECH. Brighton, UK: ISCA, 2009: 76-79.
  • 8POVEY D, WOODLAND P C. Minimum phone error and I-smoothing for improved discriminative training [C]/ / Proceeding of International Conference on Acous- tics, Speech, and Signal Processing. Florida, USA: IEEE, 2002: 105-108.
  • 9HOFFMEISTER B, SCHLUTER R, NEY H. Bayes risk ap- proximations using time overlap with an application to system combination [C]// Proceedings of INTER- SPEECH. Brighton, UK: ISCA, 2009: 1191-1194.
  • 10HEIGOLD G, MACHEREY W, SCHLUTER R, et al. Min- imum exact word error training [C]// Proceedings of Automatic Speech Recognition and Understanding. San Juan, USA: IEEE, 2005: 186-190.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部