多预测子融合实时连续语音识别输出词正误判别

Combination of Multiple Predictors for Correct-Incorrect Classification of Output Words in Real Time Continuous Speech Recognition

下载PDF

导出

摘要本文在采用堆栈译码词网重估输出作为识别最终输出的连续语音识别实时解码条件下,利用决策树方法将多个预测子融合,对识别输出词进行正确和错误的判别。本文首先构造了词后验概率、词长、相邻词的后验概率、词的声学和语言得分等共13个预测子,然后利用决策树方法,通过选择不同的预测子组合方式和适当的决策树建树参数,筛选出预测子的最佳组合,建立优化的决策树进行输出词的正误判别。实验结果表明:利用局域词图计算的词后验概率与词长、相邻词的后验概率等几种实时预测子融合后,对识别输出词的正误判别能力得到提高,并且在实时性和分类效果两个方面优于n-best输出的相应结果,相对于基线系统,则分类错误率下降41.4%。实验结果也表明本文提出的相邻词的后验概率是相对重要的预测子。 Under the decoding strategy of using stack decoding to rescore the word trellis to generate final output, this paper uses decision tree to combine multiple predictors to identify each of recognition output words as correct or incorrect. A series of predictors are constructed, including word posterior probability, word length, word posterior probabihty of neighboring words, 13 in all. Optimal combination of predictors is found and best decision tree is constructed for correct-incorrect classification of output words by testing different combination of predictors and choosing appropriate tree parameters. The experimental results show that the combination of local word posterior probabilities （LWPP） with some of other predictors constructed by this paper, including mainly word length and LWPPs of neighboring words, can give a significant improvement in classifieation performance, and is better in time consumption and quality than the corresponding results from n-best list. Compared with baseline system, the classification error rate getsan improvement of 41.4%. The experimental results also show that posterior probabilities of neighboring words proposed by this paper are among relatively important predictors.

作者付跃文杜利民

机构地区中国科学院声学研究所语音交互信息技术中心

出处《中文信息学报》 CSCD 北大核心 2005年第6期84-91,共8页 Journal of Chinese Information Processing

基金国家重点基础研究发展规划资助项目(973)(G1998030505)

关键词计算机应用中文信息处理连续语音识别预测子决策树 computer application Chinese information processing continuous speech recognition predictor decision tree

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1S J. Young. Detecting misrecognitions and out-of-vocabulary words [A]. In: Proc. ICASSP [C], 1994, vol. 2, 21 -24.
2F. Wessel, K. Macherey, R. Schl'uter. Using word probabilities as confidence measures [ A ]. In: Proc. ICASSP[C], 1998, vol. 1, 225- 228.
3李红莲,何伟,袁保宗.一种文本相似度及其在语音识别中的应用[J].中文信息学报,2003,17(1):60-64. 被引量：10
4F. Wessel. Word Posterior Probabilities for Large Vocabulary Continuous Speech Recognition[D]. Aachen University, Germany, January 2002, 85.
5R. Mosur. Efficient Algorithms for Speech Recognition [D]. Carnegie Mellon University, May 1996.
6A.Lee, T.Kawabara, K.Shikano. Julius - an open source real-time large vocabulary recognition engine [A] .In:Proc. Eurospeech [C], 2001, 1691 - 1694.
7A. Lee, K. Shikano, T. Kawahara, Real-Time Word Confidence Scoring using Local Posterior probabifities on Tree Trellis Search[A] .In: Proc. ICASSP[C], 2004, 793- 796.
8F. K. Soong, E.-F. Huang. A tree-trellis based fast search for finding the N best sentence hypotheses in continuous speech recognition [A] .In: Proc. ICASSP[C], 1991, vol. 1, 705 - 708.
9J. R. Quinlan. C4.5: Programs for Machine Learning [M]. San Mateo, California: Morgan Kaufmann, 1996.

二级参考文献1

1李红莲,袁保宗,王春花.利用背景知识提高web语音浏览中的识别精度的方法[J].电子学报,2002,30(12):1836-1839. 被引量：8

共引文献9

1袁保宗,阮秋琦,王延江,刘汝杰,唐晓芳.新一代(第四代)人机交互的概念框架特征及关键技术[J].电子学报,2003,31(z1):1945-1954. 被引量：28
2朱占辉,李红莲.基于文音相似度的语音查号系统[J].电脑开发与应用,2006,19(2):35-36.
3李红莲,宋占岭.基于文音相似度的语音查询系统的设计与开发[J].计算机工程与应用,2006,42(26):221-223. 被引量：1
4田生伟,吐尔根.依布拉音,禹龙,买合木提.木合买提,艾山.吾买尔.一种维吾尔语句子相似度算法的研究[J].计算机工程与应用,2009,45(26):144-146. 被引量：10
5李红莲,潘建军,范京.校园网语音浏览系统的设计与开发[J].计算机工程与应用,2010,46(27):80-82.
6杨皓东,江凌,李国俊.国内自然语言处理研究热点分析——基于共词分析[J].图书情报工作,2011,55(10):112-117. 被引量：14
7史美伦,张雄,李平江,蒋丽滢.胶凝材料的组成、力学性能与交流阻抗谱的关系[J].硅酸盐通报,1999,18(4):14-17. 被引量：13
8高国强,黄吕威,陈丰钰.使用网络搜索引擎计算汉语词汇的语义相似度[J].计算机技术与发展,2014,24(7):84-87. 被引量：4
9石杰,周兰江,线岩团,余正涛.基于WordNet的中泰文跨语言文本相似度计算[J].中文信息学报,2016,30(4):65-70. 被引量：12

1郭张婷.基于卡尔曼滤波的红外目标局部搜索跟踪算法[J].科技视界,2014(13):63-63.
2吕红伟,王士同.预测子空间聚类的聚类集成算法[J].小型微型计算机系统,2017,38(4):845-851. 被引量：1
3吕红伟,王士同.基于RPCA对高维数据子空间聚类的预测方法[J].计算机工程与科学,2017,39(3):553-561. 被引量：3
4侯越先,何丕廉.基于预测复杂性的神经网络预测子辨识[J].信息与控制,2001,30(1):16-20. 被引量：3
5徐辰华.基于神经网络的透气性状态预测[J].柳州师专学报,2010,25(5):117-123. 被引量：4
6何天翔,张晖,李波,杨春明,赵旭剑.结合情感词网的中文短文本情感分类[J].计算机应用研究,2015,32(10):2905-2909. 被引量：5
7赵永霄,哈力旦.阿布都热依木,张振东.面向增量同生主题的维吾尔文爬虫的研究[J].计算机应用研究,2014,31(11):3269-3272. 被引量：1
8莫燕,戴茂余.智能化预测系统软件的设计[J].杭州电子科技大学学报（自然科学版）,1998,23(1):9-15.
9郝继红,陈鸣,赵洪华,张睿.NWS预测子系统的应用[J].解放军理工大学学报（自然科学版）,2004,5(6):1-5. 被引量：1
10陈洲斌.互联网另一个战场——在线词典[J].上海信息化,2013(3):64-66.

中文信息学报

2005年第6期

浏览历史

内容加载中请稍等...

多预测子融合实时连续语音识别输出词正误判别

参考文献9

二级参考文献1

共引文献9

相关作者

相关机构

相关主题

浏览历史