基于词图重估的语音解码参数优化方法

Speech Decoding Parameters Optimization Method Based on Word Graph Rescoring

下载PDF

导出

摘要在大词汇连续语音识别系统中,语言模型权值和插入代价等语音解码参数对系统的识别率有较大的影响,而在实际应用中常通过实验手动调整其值寻求最佳识别结果。为此,提出一种利用二元文法进行词图重估的方法,自动优化语音解码参数。在重估的参数空间搜索过程中采用线性搜索与模拟退火搜索相结合的方法,使优化参数具有全局最优和对初值稳定性强的优点。实验结果表明,相比凭经验设置的参数,该方法估计出的参数值能大幅降低识别词错误率,与经典的N-best优化相比,其优化速度有较大提升。 In Large Vocabulary Continuous Speech Recognition（LVCSR） system,speech decoding parameters——Language Model（LM） weight and insertion cost can greatly affects the recognition performance.But in practice,they are usually hand-tuned through experiment to obtain best recognition performance.This paper proposes the rescoring based method that uses bi-gram LM to optimize the parameters automatically,meanwhile the method of combine line search and Simulated Annealing（SA） search in parameters search space of rescoring,which is globally optimal and is insensitive to initial value of parameter.Experimental results show that the method can dramatically reduce the word error compared with empirical parameter setting method,and gains much faster optimization speed than classical N-best optimization.

作者尹明明屈丹李弼程黄山奇

机构地区解放军信息工程大学信息工程学院

出处《计算机工程》 CAS CSCD 北大核心 2011年第16期158-160,163,共4页 Computer Engineering

基金国家"863"计划基金资助项目(2006AA01z146)

关键词词图重估语言模型权值解码插入代价 word graph rescoring Language Model（LM） weight decoding insertion cost

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1杨善茜,黄汉明,蒋正锋,李锐.基于HTK的语音识别网络优化算法[J].计算机工程,2010,36(14):169-171. 被引量：3
2Horbe Y,Minematsu N,Nakagawa S.Theoretical/ExperimentalInvestigation of Balance Between Acoustic Model Likelihood andLanguage Model Likelihood. IPSJ SIG Notes . 2000
3Ito A,Kohda M,Makino S.Fast Optimization of LanguageModel Weight and Insertion Penalty from N-best Candidates. Acoustical Science and Technology . 2005
4Mak B,Ko T.Min-max Discriminative Training of DecodingParameters Using Iterative Linear Programming. Proc.of the9th Annual Conference of the International Speech Communi-cation Association . 2008
5Emori T,Onishi Y,Shinoda K.Automatic Estimation of ScalingFactors Among Probabilistic Models in Speech Recognition. Proc.of the 8th Annual Conference of the International SpeechCommunication Association . 2007
6Hannani A E I,Hain T.Automatic Optimization of SpeechDecoder Parameters. IEEE Signal Processing Letters . 2010

二级参考文献2

1Young S,Evermann G Gales M.The Hidden Markov Model Toolkit[EB/OL].(2005-10-20).http://htk.eng.cam.ac.uk/.
2Jang R.Audio Signal Processing and Recognition[Z].(2009-05-30).http://neural.cs.nthu.edu.tw/jang/books/audioSignalProcessing/.

共引文献2

1袁浩,李海洋,郑铁然,韩纪庆.基于相邻帧特征相似性的快速关键词检出方法[J].计算机工程,2012,38(7):287-289.
2刘琼.几种开源英语识别工具包的对比分析[J].计算技术与自动化,2018,37(4):123-127. 被引量：3

1努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木,热依曼.吐尔逊.基于音节的维吾尔语大词汇连续语音识别系统[J].清华大学学报（自然科学版）,2013,53(6):741-744. 被引量：5
2那斯尔江.吐尔逊,吾守尔.斯拉木.基于隐马尔可夫模型的维吾尔语连续语音识别系统[J].计算机应用,2009,29(7):2009-2011. 被引量：17
3张文杰,李建中,张炜.E3D R-Tree:一种处理移动对象数据库历史查询的索引结构[J].计算机科学,2005,32(9):103-107.
4努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木,热依曼.吐尔逊.维吾尔语大词汇语音识别系统识别单元研究[J].北京大学学报（自然科学版）,2014,50(1):149-152. 被引量：4
5邓岳贵.启发式搜索在网络爬虫中应用的分析[J].教育技术导刊,2008(2):80-82. 被引量：7
6陆国丽,王小华,王荣波.最大词重降维算法与模拟退火算法相结合的文本聚类方法研究[J].现代图书情报技术,2008(12):43-47. 被引量：2
7桑农,张涛,李斌,吴翔.基于字典学习的背景建模[J].华中科技大学学报（自然科学版）,2013,41(9):28-31. 被引量：2
8张弛,付媛媛,贾丽媛.自适应蚁群算法在TSP问题中的应用[J].湖南城市学院学报（自然科学版）,2011,20(1):54-57.
9张剑,屈丹,李真.基于循环神经网络语言模型的N-best重打分算法[J].数据采集与处理,2016,31(2):347-354. 被引量：3
10肖敏,刘宇红.SD卡硬件加密在工业MP3中的实现[J].通信技术,2012,45(11):34-36. 被引量：1

计算机工程

2011年第16期

浏览历史

内容加载中请稍等...

基于词图重估的语音解码参数优化方法

参考文献6

二级参考文献2

共引文献2

相关作者

相关机构

相关主题

浏览历史