Mandarin Digits Speech Recognition Using Support Vector Machines 被引量：2

Mandarin Digits Speech Recognition Using Support Vector Machines

下载PDF

导出

摘要 A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited. A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.

作者谢湘匡镜明

机构地区 School of Information Science and Technology School

出处《Journal of Beijing Institute of Technology》 EI CAS 2005年第1期9-12,共4页 北京理工大学学报（英文版）

基金 theNationalNaturalScienceFoundation(60372089)

关键词 speech recognition support vector machine (SVM) kernel function speech recognition support vector machine (SVM) kernel function

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献8

1Sch lkopfB,,BurgesCJC,SmolaAJ.Advancesinker nelmethods—Supportvectorlearning[]..1999
2JoachimsT.Textcategorizationwithsupportvectorma chines:Learningwithmanyrelevantfeatures[].thEuropeanConferenceonMachineLearning.1998
3HearstMA,Sch lkopfB,DumaisS,etal.Trendsand controversies—Supportvectormachines[].IEEEIn telligentSystems.1998
4CunLY,JackelLD,BottouL,etal.Comparisonof learningalgorithmsforhandwrittendigitrecognition[].ICANN’.1995
5FriedmanJH.Anotherapproachtopolychotomousclas sification[]..1996
6YoungS,KershawD,OdellJ,etal.TheHTKbook (v3.0)[]..2000
7Chang C C,Lin CJ.LIBSVM:a Libraryfor Support Vector Ma-chines. http://www.csie.ntu.edu.tw/~cjlin/libsvm/ . 2002
8Vapnik,V. The Nature of Statistical Learning Theory . 1995

同被引文献54

1庄东,陈英.基于加权近似支持向量机的文本分类[J].清华大学学报（自然科学版）,2005,45(S1):1787-1790. 被引量：16
2马立权,李维,蔡韩辉,路莹,李歆.手写数字识别中的预处理技术研究[J].仪器仪表学报,2001,22(z2):263-265. 被引量：12
3邬啸,魏延,吴瑕.改进的双隶属度模糊支持向量机[J].重庆师范大学学报（自然科学版）,2011,28(5):49-52. 被引量：5
4赖苏,熊忠阳,江帆,唐蓉君.利用改进的多项式核函数支持向量机进行文本分类[J].重庆大学学报（自然科学版）,2012,35(S1):41-45. 被引量：2
5应伟,王正欧,安金龙.一种基于改进的支持向量机的多类文本分类方法[J].计算机工程,2006,32(16):74-76. 被引量：28
6曾水玲,徐蔚鸿.基于支持向量机的手写体数字识别[J].计算机与数字工程,2006,34(10):104-106. 被引量：9
7李攀,杨玮龙,厉剑.基于DTW/SVM的语音识别系统在DSP中的实现[J].电声技术,2006,30(9):40-44. 被引量：4
8王欢良,韩纪庆,李海峰,郑铁然.基于HMM/SVM两级结构的汉语易混淆语音识别[J].模式识别与人工智能,2006,19(5):578-584. 被引量：4
9朱齐丹,张智,邢卓异.支持向量机改进序列最小优化学习算法[J].哈尔滨工程大学学报,2007,28(2):183-188. 被引量：10
10李向东,王进华.支持向量机分解算法研究[J].计算机与数字工程,2007,35(5):9-12. 被引量：2

引证文献2

1白静,杨利红,张雪英.一种面向语音识别的抗噪SVM参数优化方法[J].中南大学学报（自然科学版）,2013,44(2):604-611. 被引量：9
2汪海燕,黎建辉,杨风雷.支持向量机理论及算法研究综述[J].计算机应用研究,2014,31(5):1281-1286. 被引量：204

二级引证文献213

1李曙光,张新泉.沙钢冷轧原料库行车无人化技术应用[J].冶金自动化,2021,45(S01):12-15. 被引量：1
2王增政,王岩松,郭辉,袁涛,郑立辉,孙裴.基于LS-SVR的高速列车车内声品质主观评价[J].智能计算机与应用,2022,12(2):191-195. 被引量：1
3支余庆.利用串联谐振耐压现场检出和处理GIS缺陷[J].高电压技术,2000,26(2):78-79. 被引量：4
4牟宗萍,郑波,孙宗花,郭富荣,郭安余.新生儿惊厥的临床特点及病因分析(附124例报告)[J].新生儿科杂志,2000,15(1):32-33. 被引量：15
5汪海燕,黎建辉,杨风雷.支持向量机理论及算法研究综述[J].计算机应用研究,2014,31(5):1281-1286. 被引量：204
6李远远,梅红波,任晓杰,胡旭东,李梦迪.基于确定性系数和支持向量机的地质灾害易发性评价[J].地球信息科学学报,2018,20(12):1699-1709. 被引量：56
7熊静玲,朱西存,高华光,于瑞阳,温新.基于MSC与SVM的夯土齐长城土壤含水率高光谱估测[J].土壤学报,2018,55(6):1336-1344. 被引量：7
8马云飞.基于建模仿真的战车分类算法研究[J].电子技术（上海）,2014(8):9-14. 被引量：1
9卢曼丽.基于K-means算法的神经网络文本分类算法研究[J].中国管理信息化,2014,17(21):80-82. 被引量：1
10刘红芬,张雪英,刘晓峰,黄丽霞,王子中.基于特征加权的FSVM在低信噪比语音识别中的应用[J].太原理工大学学报,2014,45(6):764-768.

1FadhilH.T.Al-dulaimy,王作英,田野.Adaptive Compensation Algorithm in Open Vocabulary Mandarin Speaker-Independent Speech Recognition[J].Tsinghua Science and Technology,2002,7(5):521-526.
2WANG Chengyou,TANG Shuqi,LIANG Diannong,CHEN Huihuang and TANG Zhaojing(National University of Defence Technology Changsha 410073)Received.The methods for combining the information of various kinds of features in speech recognition[J].Chinese Journal of Acoustics,1997,16(2):115-120.
3RUAN Xiu-kai,ZHANG Zhi-yong.Signal Blind Recovery in Communication Systems Using Support Vector Machines[J].南京邮电大学学报（自然科学版）,2010,30(1):1-5.
4HOU Limin HUANG Zhenhua XIE Juanmin.Spectrum warping based on sub-glottal resonances in speaker-independent speech recognition[J].Chinese Journal of Acoustics,2011,30(4):427-436.
5South Africa： Learning Mandarin More Than Personal Interest in South Africa[J].海外华文教育动态,2016(9):70-71.
6谢湘,匡镜明.Novel Extended Phonemic Set for Mandarin Continuous Speech Recognition[J].Journal of Beijing Institute of Technology,2003,12(4):399-402.
7Learning Mandarin from Birds[J].海外华文教育动态,2016(2):74-74.
8Liu Gang Chen Wei Guo Jun.Novel Active Learning Method for Speech Recognition[J].China Communications,2010,7(5):29-39. 被引量：1
9手势操控电子设备[J].销售与管理,2012(11):17-17.
10赵军辉,谢湘,匡镜明.Linear Discriminant Analysis and Kernel Vector Quantization for Mandarin Digits Recognition[J].Journal of Beijing Institute of Technology,2004,13(4):385-388.

Journal of Beijing Institute of Technology

2005年第1期

浏览历史

内容加载中请稍等...

Mandarin Digits Speech Recognition Using Support Vector Machines 被引量：2

参考文献8

同被引文献54

引证文献2

二级引证文献213

相关作者

相关机构

相关主题

浏览历史