期刊文献+

基于词图重估的语音解码参数优化方法

Speech Decoding Parameters Optimization Method Based on Word Graph Rescoring
下载PDF
导出
摘要 在大词汇连续语音识别系统中,语言模型权值和插入代价等语音解码参数对系统的识别率有较大的影响,而在实际应用中常通过实验手动调整其值寻求最佳识别结果。为此,提出一种利用二元文法进行词图重估的方法,自动优化语音解码参数。在重估的参数空间搜索过程中采用线性搜索与模拟退火搜索相结合的方法,使优化参数具有全局最优和对初值稳定性强的优点。实验结果表明,相比凭经验设置的参数,该方法估计出的参数值能大幅降低识别词错误率,与经典的N-best优化相比,其优化速度有较大提升。 In Large Vocabulary Continuous Speech Recognition(LVCSR) system,speech decoding parameters——Language Model(LM) weight and insertion cost can greatly affects the recognition performance.But in practice,they are usually hand-tuned through experiment to obtain best recognition performance.This paper proposes the rescoring based method that uses bi-gram LM to optimize the parameters automatically,meanwhile the method of combine line search and Simulated Annealing(SA) search in parameters search space of rescoring,which is globally optimal and is insensitive to initial value of parameter.Experimental results show that the method can dramatically reduce the word error compared with empirical parameter setting method,and gains much faster optimization speed than classical N-best optimization.
出处 《计算机工程》 CAS CSCD 北大核心 2011年第16期158-160,163,共4页 Computer Engineering
基金 国家"863"计划基金资助项目(2006AA01z146)
关键词 词图重估 语言模型权值 解码 插入代价 word graph rescoring Language Model(LM) weight decoding insertion cost
  • 相关文献

参考文献6

  • 1杨善茜,黄汉明,蒋正锋,李锐.基于HTK的语音识别网络优化算法[J].计算机工程,2010,36(14):169-171. 被引量:3
  • 2Horbe Y,Minematsu N,Nakagawa S.Theoretical/ExperimentalInvestigation of Balance Between Acoustic Model Likelihood andLanguage Model Likelihood. IPSJ SIG Notes . 2000
  • 3Ito A,Kohda M,Makino S.Fast Optimization of LanguageModel Weight and Insertion Penalty from N-best Candidates. Acoustical Science and Technology . 2005
  • 4Mak B,Ko T.Min-max Discriminative Training of DecodingParameters Using Iterative Linear Programming. Proc.of the9th Annual Conference of the International Speech Communi-cation Association . 2008
  • 5Emori T,Onishi Y,Shinoda K.Automatic Estimation of ScalingFactors Among Probabilistic Models in Speech Recognition. Proc.of the 8th Annual Conference of the International SpeechCommunication Association . 2007
  • 6Hannani A E I,Hain T.Automatic Optimization of SpeechDecoder Parameters. IEEE Signal Processing Letters . 2010

二级参考文献2

  • 1Young S,Evermann G Gales M.The Hidden Markov Model Toolkit[EB/OL].(2005-10-20).http://htk.eng.cam.ac.uk/.
  • 2Jang R.Audio Signal Processing and Recognition[Z].(2009-05-30).http://neural.cs.nthu.edu.tw/jang/books/audioSignalProcessing/.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部