低空间复杂度的加权有限状态转换器合成算法

Low space-complexity composition algorithm for weighted finite-state transducers

下载PDF

导出

摘要利用加权有限状态转换器相关的合成操作,可以将语音识别需要的模型进行组合,便于识别中各种知识的综合利用,从而提升识别性能。传统合成算法在计算的同时存储了无效状态与状态转移。在进行词典与语言模型等合成操作时,算法需要1 GB甚至更多内存保存无效信息,这直接导致了算法的高空间复杂度。为解决这一问题,提出同步裁剪合成算法(synchronized pruning composition algorithm,SPCA)。新算法对传统合成算法进行了改进,在合成的同时对无效信息进行及时的分析和去除。实验表明,与经典的合成算法相比,SPCA平均节约内存14.99%,所用最大内存节约25.72%,有效降低了合成的空间复杂度。 The WFST-related composition algorithm could be used to integrate recognition models together to facilitate the utilization of knowledge during speech recognition and to improve the recognition system＇s performance.The general composition algorithm stores lots of useless states and transitions during it runs.It needs 1 GB or more memory to save the useless info when compose dictionary and language models,which impact the algorithm＇s space complexity.To solve this problem,this paper developed a SPCA.It improved the general composition method.With the new method,the composition and removing useless info were done simultaneously.Experiments shows that the improved method achieves 14.99% and 25.72% in average and maximum memory reduction compared with the general method,and effectively reduces the composition＇s space complexity.

作者李伟吴及吕萍

机构地区清华大学电子工程系

出处《计算机应用研究》 CSCD 北大核心 2011年第8期2931-2934,共4页 Application Research of Computers

关键词加权有限状态转换器合成有向图空间复杂度语音识别 WFST（weighted finite-state transducer） composition digraph space-complexity speech recognition

分类号 TP301.1 [自动化与计算机技术—计算机系统结构] TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献5

1OU Zhi-jian, XIAO Ji. A study of large vocabulary spoeeh recognition decoding using finite-state graphs[ C]//Proc of the 7th International Symposium on Chinese Language Processing. Tainan: IEEE Press, 2010:123-128.
2MEHRYAR M, FERNANDO P, MICHAEL R. Weighted finite-state transducers in speech recognition[ J]. Computer Speech and Lan- guage,2002,16( 1 ) :69-88.
3SHINJI W, TAKAAKI H, ERIK M, et al. A discriminative model for continuous speech recognition based on weighted finite-state transducers[ C]//Proc of International Conference on Acoustics, Speech and Signal Processing. Dallas: IEEE Press,2010:4922-4925.
4MAIDER L, IZHAK S. Learning a discriminative weighted tiniteState transducer for speech recognition [ J]. IEEE Trans on Audio, Speech, and Language Processing,2010,18(8):1-16.
5MEHRYAR M. Weighted .automata algorithms [ M ]. Heidelberg: Springer, 2009:213-254.

1庞薇,徐波.基于多模型融合的人名翻译系统[J].中文信息学报,2009,23(1):44-49. 被引量：2
2陆梨花,张连海,陈琦.基于加权有限状态转换器的语音查询项检索技术[J].数据采集与处理,2015,30(2):390-398. 被引量：2
3王水成.红外遥控器遥控无效故障及其分析[J].中国有线电视,2009(1):98-98.
4业内心声[J].印制电路资讯,2011(4):14-14.
5牛牛.找到依稀记得的网站[J].电脑迷,2011(10):78-78.
6孙浩天,王志鹏,孔德举,都基泽.一种基于二维码的信息隐藏方法[J].电脑知识与技术,2016,0(6):77-80. 被引量：1
7王春梅,范通让.维护Agent状态的定位和检测[J].河北省科学院学报,2013,30(3):12-17.
8牟晓东.当PDF被“禁止打印”时[J].电脑知识与技术（经验技巧）,2017,0(1):21-22.
9陆梨花,张连海.基于音素混淆模型的集外词查询项扩展方法[J].信息工程大学学报,2014,15(4):459-465. 被引量：1
10张霓,陈天天,何熊熊.基于数据场和单次划分的聚类算法[J].浙江工业大学学报,2016,44(1):52-57. 被引量：9

计算机应用研究

2011年第8期

浏览历史

内容加载中请稍等...

低空间复杂度的加权有限状态转换器合成算法

参考文献5

相关作者

相关机构

相关主题

浏览历史