MLLR特征的SVM语种识别算法

MLLR based SVM language identification algorithm

导出

摘要为了挖掘更多语种间区分性信息进行可靠的自动语种识别,本文提出一种将自适应领域的最大似然线性回归(maximum likelihood linear regression,MLLR)矩阵作为特征的语种识别算法。该算法首先对每个语种训练Gauss混合模型(Gaussian mixture model,GMM),然后对每个语音段在所有语种的GMM上计算MLLR矩阵。将得到的多类MLLR矩阵经归一化后拼接形成超矢量作为特征输入支持向量机(support vector machine,SVM)分类器进行训练和识别。比较了均值方差和排序两种归一化方法,并将多类MLLR-SVM算法与传统GMM语种识别算法进行对比。实验表明:排序归一化算法优于传统的均值方差归一化;建立在GMM模型基础上的MLLR-SVM系统性能有9.7%的提升,并与GMM分类器有很强的互补性。 This paper presents a language identification algorithm based on maximum likelihood linear regression(MLLR).The algorithm first trains the language dependent Gaussian mixture models(GMMs),calculates the MLLR transforms for every speech segment from the GMMs,and then combines the MLLRs to form supervectors for support vector machine(SVM) classifier training and testing after normalization.Tests comparing mean/variance normalization with rank normalization and the current MLLR-SVM system with the GMM classifier show that rank normalization outperforms the traditional mean/variance normalization With the MLLR-SVM system 9.7% better than the GMM classifier,but can complement the GMM classifier results.

作者钟山刘加

机构地区清华大学电子工程系

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2009年第S1期1283-1287,共5页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(60776800) 国家"八六三"高技术项目(2006AA010101 2007AA04Z223 2008AA02Z414)

关键词语种识别语音段最大似然线性回归(MLLR) 支持向量机(SVM) language identification speech segment maximum likelihood linear regression (MLLR) support vector machine(SVM)

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1TONG Rong,MA Bin,ZHU Donglai,et al.Integratingacoustic,prosodic and phonotactic features for spokenlanguage identification. Proc ICASSP . 2006
2Torres-Carrasquillo P A,Siner E,Kohler M A,et al.Approaches to language identification using Gaussian mixutremodels and shifted delta cepstral features. ProcInternational Conference on Spoken Language Processing . 2002
3Campbell W M,Torres-Carrasquillo P A,Reynolds D A.Language recognition with support vector machines. Proc IEEE Odyssey . 2004
4Burget L,Matejka P,Cernocky J.Discriminative trainingtechniques for acoustic language identification. ProcICASSP . 2006
5Castaldo F,Colibro D,Dalmasso E,et al.Acoustic languageidentification using fast discriminative training. ProcInterspeech . 2007
6Stolcke A,Ferrer L,Kajarekar S,et al.MLLR transformsas features in speaker recognition. Proc 9th Eur.Conf.Speech Commun.Technol . 2005
7Karam A N,Campbell W M.A new kernel for SVM MLLRbased speaker recognition. Proc Interspeech . 2007
8Stolcke A,Kajarekar S,Ferrer L,et al.Speaker recognitionwith session variability normalization based on MLLRadaptation transforms. IEEE Trans Audio,Speech andLanguage Processing . 2007
9Stolcke A,Ferrer L,Kajarekar S.Improvements inMLLR-transform-based speaker recognition. Proc IEEEOdyssey-The Speaker and Language Recognition Workshop . 2006
10Stolcke A,Kajarekar S,Ferrer L.Nonparametric featurenormalization for SVM-based speaker verification. ProcICASSP . 2008

1丰洪才,卢正鼎.基于MAP和MLLR的综合渐进自适应方法研究[J].计算机工程,2005,31(5):4-7. 被引量：3
2钟山,何亮,邓妍,刘加.基于最大似然线性回归矩阵的说话人识别算法研究[J].自动化学报,2009,35(5):546-550.
3李荟,赵云敏.特征音方法在说话人识别中的应用[J].计算机系统应用,2013,22(8):176-179.
4周宇,陈熙霖,赵德斌,姚鸿勋,高文.基于数据生成的手语识别自适应方法[J].高技术通讯,2009,19(12):1258-1264.
5余姗姗,张亚琼.语音识别的自适应研究[J].福建电脑,2011,27(6):53-54.
6钱洪伟,贺苏宁.说话人模型参数自适应技术研究[J].电信技术研究,2008(5):16-22.
7蒋泰,张林军.语音识别自适应算法在智能家居中的应用[J].计算机系统应用,2017,26(3):150-155. 被引量：3
8LU Yong WU Zhenyang.Maximum likelihood polynomial regression for robust speech recognition[J].Chinese Journal of Acoustics,2011,30(3):358-370.
9丁国宏,徐波.基于三对角和共享分块对角转换矩阵的快速说话人自适应方法[J].电子学报,2004,32(10):1709-1712.
10晁浩,宋成,彭维平.基于发音特征的声效相关鲁棒语音识别算法[J].计算机应用,2015,35(1):257-261. 被引量：8

清华大学学报（自然科学版）

2009年第S1期

浏览历史

内容加载中请稍等...

MLLR特征的SVM语种识别算法

参考文献13

相关作者

相关机构

相关主题

浏览历史