Investigation of Improved Approaches to Bayes Risk Decoding

Investigation of Improved Approaches to Bayes Risk Decoding

导出

摘要 Bayes risk (BR) decoding methods have been widely investigated in the speech recognition area due to its flexibility and complexity compared with the maximum a posteriori (MAP) method regarding to minimum word error (MWE) optimization. This paper investigates two improved approaches to the BR decoding, aiming at minimizing word error. The novelty of the proposed methods is shown in the explicit optimization of the objective function, the value of which is calculated by an improved forward algorithm on the lattice. However, the result of the first method is obtained by an expectation maximization (EM) like iteration, while the result of the second one is achieved by traversing the confusion network (CN), both of which lead to an optimized objective function value with distinct approaches. Experimental results indicate that the proposed methods result in an error reduction for lattice rescoring, compared with the traditional CN method for lattice rescoring. Bayes risk （BR） decoding methods have been widely investigated in the speech recognition area due to its flexibility and complexity compared with the maximum a posteriori （MAP） method regarding to minimum word error （MWE） optimization. This paper investigates two improved approaches to the BR decoding, aiming at minimizing word error. The novelty of the proposed methods is shown in the explicit optimization of the objective function, the value of which is calculated by an improved forward algorithm on the lattice. However, the result of the first method is obtained by an expectation maximization （EM） like iteration, while the result of the second one is achieved by traversing the confusion network （CN）, both of which lead to an optimized objective function value with distinct approaches. Experimental results indicate that the proposed methods result in an error reduction for lattice rescoring, compared with the traditional CN method for lattice rescoring.

作者徐海华朱杰

机构地区 Department of Electronic Engineering

出处《Journal of Shanghai Jiaotong university(Science)》 EI 2011年第5期524-529,共6页 上海交通大学学报（英文版）

关键词 Bayes risk (BR) confusion network (CN) speech recognition lattice rescoring Bayes risk （BR）, confusion network （CN）, speech recognition, lattice rescoring

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献17

1LEVENSHTE1N V I. Binary codes capable of correcting deletions, insertions and reversals [J]. Soviet Physics Doklady, 1966, 10(8): 707-710.
2STOLCKE A, KONIG Y, WEINTRAUB M. Explicit word error minimization in n-best list rescoring [C]//Proceedings of the 5th European Conference on Speech Communication and Technology. Rhodes, Greece: ISCA, 1997: 163-166.
3MANGU L in speech BRILL E, STOLCKE A. Finding consensus recognition: Word error minimization and other applications of confusion networks [J]. Computer Speech and Language, 2000, 14: 373-400.
4COEL V, KUMAR S, BYRNE W J. Minimum bayes-risk automatic speech recognition [J]. Computer Speech and Language, 2000, 14: 115-135.
5WESSEL F, SCHLUTER R, NEY H. Explicit word error minimization using word hypothesis posterior prob- abilities [C]//Proceeding of International Conference on Acoustics, Speech, and Signal Processing. Salt Lake City, USA: IEEE, 2001: 33-36.
6GOEL V, BYRNE W J. Segmental minimum bayes-risk decoding for automatic speech recognition [J]. IEEE Transactions on Speech and Audio Processing, 2006, 12: 234-249.
7Xu H, POVEY D, ZHu J, et al. Minimum hypothesis phone error as a decoding method for speech recog- nition [C]/ / Proeeedings of INTERSPEECH. Brighton, UK: ISCA, 2009: 76-79.
8POVEY D, WOODLAND P C. Minimum phone error and I-smoothing for improved discriminative training [C]/ / Proceeding of International Conference on Acous- tics, Speech, and Signal Processing. Florida, USA: IEEE, 2002: 105-108.
9HOFFMEISTER B, SCHLUTER R, NEY H. Bayes risk ap- proximations using time overlap with an application to system combination [C]// Proceedings of INTER- SPEECH. Brighton, UK: ISCA, 2009: 1191-1194.
10HEIGOLD G, MACHEREY W, SCHLUTER R, et al. Min- imum exact word error training [C]// Proceedings of Automatic Speech Recognition and Understanding. San Juan, USA: IEEE, 2005: 186-190.

1Hai-hua XU Jie ZHU.An iterative approach to Bayes risk decoding and system combination[J].Journal of Zhejiang University-Science C(Computers and Electronics),2011,12(3):204-212. 被引量：1
2张德富.求解难可满足性问题的混合算法[J].小型微型计算机系统,2003,24(8):1528-1531. 被引量：2
3GAO Yan,CHEN ZhiHua,CHEN MinGang,SHEN Yang.An improved approach to the efficient construction of and search operations in motion graphs[J].Science China(Information Sciences),2012,55(5):1042-1051.
4关于英国Lancaster大学与中国电子信息产业发展研究院（赛迪集团）联合召开中文多字表达（MWE）与机器翻译研讨会征文通知[J].中国计算机用户,2006(10):47-47.
5Yanjun SHEN,Tingsong DU.An improved approach to H-two control withregional stability constraints[J].控制理论与应用（英文版）,2007,5(4):380-384.
6QIN Donghong,YANG Jiahai,WANG Hui.Experimental Study on Diversity and Novelty of Interdomain Paths[J].Chinese Journal of Electronics,2013,22(1):160-166. 被引量：1
7Ning Ran,Hongye Su,Shouguang Wang.An Improved Approach to Test Diagnosability of Bounded Petri Nets[J].IEEE/CAA Journal of Automatica Sinica,2017,4(2):297-303. 被引量：6
8SHAO Jian ZHAO Qingwei ZHANG Pengyuan LIU Zhaojie YAN Yonghong.Fast Fuzzy Keyword Spotting Using Syllable Confusion Network Indexing[J].Chinese Journal of Electronics,2008,17(2):265-269.
9WANG Chiaye ZHANG Caiming.Interpolating Quintic Polynomial with C^1 Continuity[J].Computer Aided Drafting,Design and Manufacturing,1991,2(1):11-19.
10布凡,朱小燕,李明.A New Multiword Expression Metric and Its Applications[J].Journal of Computer Science & Technology,2011,26(1):3-13.

Journal of Shanghai Jiaotong university(Science)

2011年第5期

浏览历史

内容加载中请稍等...

Investigation of Improved Approaches to Bayes Risk Decoding

参考文献17

相关作者

相关机构

相关主题

浏览历史