期刊文献+

融合音素串编辑距离的随机段模型解码算法

Decoding algorithm of integrating phonetic string edit distance into stochastic segment models
下载PDF
导出
摘要 解码时声学特性最优的路径蕴含了揭示当前路径是否正确的重要参考信息,为此提出了一种随机段模型系统的解码优化方法。训练能够准确地衡量当前路径与声学最优路径相似性程度的上下文相关音素串编辑距离模型,在N-Best重打分的过程中将音素串编辑距离加入到路径总得分中。在"863-test"测试集上进行的连续语音识别实验显示汉语字的相对错误率下降了8.1%。实验结果表明了将音素串编辑距离应用到随机段模型的可行性。 The optimal path achieved according to acoustic characteristics implies some information which can reveal the correctness of current hypothesized path. Thus, the information may be used to improve the decoding algorithm of stochastic segment model. The context phonetic string edit distance model is built to measure the similarity between the best matched path and current hypothesized path. Then the edit distance is integrated into total score of current hypothesized path by the N-Best rescoring. Experiments conducted on"863-test"set show that about 8.1% relative improvement can be achieved in the recognition accuracy. Thus, potential of the method is demonstrated.
作者 晁浩
出处 《计算机工程与应用》 CSCD 北大核心 2015年第6期208-211,共4页 Computer Engineering and Applications
基金 河南省基础与前沿技术研究计划资助项目(No.132300410332)
关键词 语音识别 音素串编辑距离 随机段模型 解码 speech recognition phonetic string edit distance stochastic segment model decoding
  • 相关文献

参考文献13

  • 1高升.语境相关的声学模型和搜索策略的研究[D].北京:中国科学院自动化研究所,2001.
  • 2Aubert X L.An overview of decoding techniques for larg vocabulary continuous speech recognition[J].Computer Speech and Language,2002,16(1):89-114.
  • 3Kimball O,Ostendorf M,Bechwati I.Context modeling with the stochastic segment model[J].IEEE Transactions on Signal Processing,1992,40(6):1584-1587.
  • 4唐赟,刘文举,徐波.基于后验概率解码段模型的汉语语音数字串识别[J].计算机学报,2006,29(4):635-641. 被引量:12
  • 5Tang Yun,Liu Wenju,Zhang Hua.One-pass coarse-to-fine segmental speech decoding algorithm[C]//The 31st IEEE International Conference on Acoustics,Speech and Signal Processing.Toulouse:IEEE,2006:441-444.
  • 6Zhang Hua,Liu Wenju,Xu Bo.Research on adaptive step decoding in segment-based LVCSR[C]//Natural Language Processing and Knowledge Engineering,2007:463-467.
  • 7Oncina J,Sebban M.Learning stochastic edit distance:application in handwritten character recognition[J].Pattern Recognition,2006,39(9):1575-1587.
  • 8Ristad E S,Yianilos P N.Learning string edit distance[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(5):522-532.
  • 9Zweig G,Nguyen P.Maximum mutual information multiphone units in direct modeling[C]//10th Annual Conference of the International Speech Communication Association,2009:1919-1922.
  • 10Zweig G,Nedel J.Emprical properties of multi-lingual phone-to-word transduction[C]//The 33th IEEE International Conference on Acoustics,Speech and Signal Processing.Las Vegas:IEEE,2008:4445-4448.

二级参考文献27

  • 1宋锐,张静,夏胜平,郁文贤.一种基于BP神经网络群的自适应分类方法及其应用[J].电子学报,2001,29(z1):1950-1953. 被引量:19
  • 2Ruda H, Snorrason M, Shue D. Framework for automatic target recognition optimization. No.R96451, Cambridge: Charles River Analytics, 1997.
  • 3Jain AK, Duin RPW, Mao JC. Statistical pattern recognition: A review. 1999. http://citeseer.ist.psu.edu/jain99statistical.html.
  • 4Song R, Ji H, Xia SP, Hu WD, Yu WX. Hierarchical modular structure for automatic target recognition systems. In: Shen J,Pankanti S, Wang RS, eds. Proc. of the SPIE, Vol 4554, Object Detection, Classification and Tracking Technologies. Bellingham:SPIE, 2001.57-61.
  • 5张静.[D].长沙:国防科学技术大学,2004.
  • 6Logan JD. Applied Mathematics. 2nd ed., Hoboken: Wiley-Interscience, 1996.
  • 7Rosen KH. Discrete Mathematics and Its Applications. 5th ed., Berkshire: McGraw-Hill Science/Engineering/Math, 2003.
  • 8Ripley BD. Pattern Recognition and Neural Networks. London: Cambridge University Press, 1996.
  • 9Vapnik VN. The Nature of Statistical Learning Theory. New York: Springcr-Vcrlag, 1995.
  • 10Zhang J, Song R, Yu WX, Xia SP, Hu WD. Visual effects based feature extraction for dynamic radar target echo series. In: Yuan BZ, et al., eds. Proc. of the 7th Int'l Conf. on Signal Processing (ICSP 2004). Beijing: Publishing House of Electronics Industry,2004.2111-2115.

共引文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部