一种基于分而治之的语音识别错误纠正方案被引量：1

Speech recognition error correction scheme based on divide-and-conquer

下载PDF

导出

摘要介绍了一种基于分而治之的语音识别错误纠正方案,首先利用混淆网络把连续语音识别问题转换为顺序的、独立的分类子任务。每个分类子任务可看做是孤立词识别问题,通过训练专门的支持向量机来区分混淆网络的识别候选。提出了一种快速的基于码本转换的语音向量对齐方法,解决了变长语音向量无法直接作为支持向量机输入的问题。通过一个普通话音节识别任务的实验结果表明,该方案能有效提高系统的正确率。 This paper introduced a divide-and-conquer speech recognition error correction scheme. Firstly transformed continuous speech recognition problem into sequential,independent,classification tasks using confusion network（ CN） . Each of these sub-tasks could be taken as an isolated word recognition problem and specialized support vector machines （ SVMs） were trained and applied to each problem to discriminate the recognized candidates from CN. Proposed a fast codebook transformation based speech vector alignment method to address the problem that variable length speech vector could not be directly acted as the input vector for SVM. Experiment on a mandarin syllable recognition task shows the proposed scheme can improve the recognition accuracy effectively.

作者孙成立

机构地区南昌航空大学信息工程学院

出处《计算机应用研究》 CSCD 北大核心 2010年第10期3841-3843,共3页 Application Research of Computers

基金国家自然科学基金资助项目(60705019) 南昌航空大学人才基金资助项目2009ZC56)

关键词语音识别错误纠正置信度支持向量机 speech recognition error correction confidence measure support vector machine

分类号 TN911.22 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1BRILL E. Transformation-based error-driven learning and natural language : a ease study in part of speech tagging [ J ]. Computational Linguistics, 1995,21 (4) :543- 565.
2JELINEK F. Speech recognition as code-breaking, Technology Report [ R]. 1996.
3VAPNIK V. The nature of statistical learning theory [ M ]. New York : Springer-Verlag, 1995.
4MANGU L, BRILL E, STOLCKE A. Finding consensus in speech recognition: word error minimization and other applications of confusion networks[ J]. Computer Speech and Language, 2000, 14 (4) : 3?3-400.
5PLATF J C. Probabilities for sv machines[ M ]. [ S. l. ] : MIT Press, 2000 : 61 - 74.
6WAN V, RENAES S. Speaker verification using sequence discriminant support vector machines[J]. IEEE Trans Speech and Audio Processing, 2005, 13 ( 2 ) :203- 210.
7JAAKKOLA T S, HAUSSLER D. Exploiting generative models in discriminative classifiers, in advances in neural information processing systems 11 [M]. [ S. l. ] :MIT Press, 1999.
8VENKATARAMANI V, CHAKARABARTY S, BYRNE W. Ginisupport vector machines for segmental minimum Bayes risk decoding of continuous speech[J]. Computer Speech & Language, 2007,21 (3) :423-442.
9VOJTECH F, VACLAV H. Statistical pattern recognition toolbox for MATLAB [ EB/OL]. http ://cmp. felk. cvut. cz.

同被引文献5

1徐蔚然,杜刚,陈光,郭军,杨洁.Unsupervised Feature Selection for Latent Dirichlet Allocation[J].China Communications,2011,8(5):54-62. 被引量：1
2刘刚,陈伟,郭军.汉语连续语音识别结果评价算法研究[J].China Communications,2010,7(2):132-138. 被引量：3
3Liu Gang Chen Wei Guo Jun.Novel Active Learning Method for Speech Recognition[J].China Communications,2010,7(5):29-39. 被引量：1
4He Tingting,Li Fang.Semantic Knowledge Acquisition from Blogs with Tag-Topic Model[J].China Communications,2012,9(3):38-48. 被引量：3
5Xiao, Sun.Discriminative Latent Model Based Chinese Multiword Expression Extraction[J].China Communications,2012,9(3):124-133. 被引量：2

引证文献1

1常凤香,李宝祥,刘刚,郭军.Candidate Expansion Algorithm Based on WeightedSyllable Confusion Matrix for Mandarin LVCSR[J].China Communications,2013,10(7):104-112. 被引量：2

二级引证文献2

1徐必伟,苏成利,杨微,曹江涛.基于DTW和EMD的孤立词语音识别研究[J].辽宁石油化工大学学报,2018,38(1):74-78. 被引量：2
2赵丽娜.英语发音通过嵌入式实时系统的识别设计及功能实现[J].中阿科技论坛（中英文）,2021(1):85-88.

1季白杨,陈纯.一种甚低码率下的面向对象小波变换编码技术[J].通信学报,2000,21(10):62-67.
2胡光华.用于数据传输的代数编码[J].国外科技新书评介,2013(1):17-17.
3王劲松,陈哲,冯静兰,顾明亮.半监督学习对十个口述数字的识别[J].电声技术,2010,34(4):50-52.
4董亮.基于OLS码的检错纠错抗辐射加固设计[J].电子技术应用,2016,42(10):44-46. 被引量：2
5胡光华.用于数据与计算机通信的编码[J].国外科技新书评介,2006(6):14-15.
6张哲.差错控制编码技术[J].考试周刊,2007(32):130-131.
7刘小汇,张鑫,陈华明.基于一种交织码的多位翻转容错技术研究[J].信号处理,2012,28(7):1014-1020. 被引量：1
8赵仲孟,马文波,张选平.VQ/HMM二级音节识别的研究[J].计算机研究与发展,1998,35(11):1024-1028. 被引量：1
9贾嵩,徐鹤卿,王源,吴峰锋,李涛,徐越.适用于位交叉布局的低电压SRAM单元(英文)[J].北京大学学报（自然科学版）,2013,49(4):721-724.
10李芳琼,席泓.量子错误纠正编码[J].西南师范大学学报（自然科学版）,2001,26(4):416-419. 被引量：1

计算机应用研究

2010年第10期

浏览历史

内容加载中请稍等...

一种基于分而治之的语音识别错误纠正方案被引量：1

参考文献9

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于分而治之的语音识别错误纠正方案 被引量：1

参考文献9

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于分而治之的语音识别错误纠正方案被引量：1