基于最大似然线性回归矩阵的说话人识别算法研究

Research on MLLR Based Speaker Recognition Algorithm

下载PDF

导出

摘要研究了将自适应领域的最大似然线性回归(Maximum likelihood linear regression,MLLR)变换矩阵作为特征进行文本无关的说话人识别算法.本文引入了基于统一背景模型的MLLRSV-SVM说话人识别算法,并在此基础上进行高层音素聚类以进一步提高识别性能.在采用多种信道补偿技术后,在NISTSRE2006年1训练语段-1测试语段同信道和跨信道数据库上,基于MLLR特征的系统与其他最好的系统性能接近并有很强的互补性,经过简单线性融合可以极大提高识别性能. This paper uses the maximum likelihood linear regression （MLLR） as feature for text-independent speaker recognition algorithm. We introduce a universal background model （UBM） based MLLRSV-SVM algorithm first, and then extend the algorithm to multi-class for improvement. After channel compensation, in terms of the NIST 2006 SRE lconv4w-lconv4w/mic corpus, the MLLR based system is comparable with and complementary of the state of the art systems. The performance is greatly improved by simply linear fusion.

作者钟山何亮邓妍刘加

机构地区清华大学电子工程系清华信息科学与技术国家实验室(筹)

出处《自动化学报》 EI CSCD 北大核心 2009年第5期546-550,共5页 Acta Automatica Sinica

基金国家高技术研究发展计划(863计划)(2006AA010101 2007AA04Z223) 国家自然科学基金委员会与微软亚洲研究院联合资助项目(60776800)资助~~

关键词说话人识别最大似然线性回归支持向量机信道补偿 Speaker recognition, maximum likelihood linear regression （MLLR）, support vector machine （SVM）, channel compensation

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Reynolds D A,Quatieri T F,Dunn R B.Speaker verification using adapted Gaussian mixture models.Digital Signal Processing,2000,10(1-3):19-41
2Campbell W M,Sturim D E,Reynolds D A.Support vector machines using GMM supervectors for speaker verification.IEEE Signal Processing Letters,2006,13(5):308-311
3Castaldo F,Colibro D,Dalmasso E,Laface P,Vair C.Compensation of nuisance factors for speaker and language recognition.IEEE Transactions on Audio,Speech,and Language Processing,2007,15(7):1969-1978
4Valr C,Colibro D,Castaldo F,Daimasso F,Laface P.Channel factors compensation in model and feature domain for speaker recognition.In:Proceedings of IEEE Odyssey:The Speaker and Language Recognition Workshop.San Juan,Puerto Rico:IEEE,2006.1-6
5Campbell W M,Sturim D E,Reynolds D A,Solomonoff A.SVM based speaker verification using a GMM supervector kernel and NAP variability compensation.In:Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing.Toulouse,France:IEEE,2006.97-100
6郭武,戴礼荣,王仁华.采用UBM更新量作为支持向量机特征的说话人确认[J].清华大学学报（自然科学版）,2008,48(S1):704-707. 被引量：4
7Stolcke A,Ferret L,Kajarekar S,Shriberg E,Venkataraman A.MLLR transforms as features in speaker recognition.In:Proceedings of the 9th European Conference on Speech Communication and Technology.Lisbon,Portugal:International Speech and Communication Association,2005.2425-2428
8Karam Z N,Campbell W M.A new kernel for SVM MLLR based speaker recognition.In:Proceedings of the 8th Conference in the Annual Series of Interspeech Events and the 10th Biennial Eurospeech Conference.Antwerp,Belgium:International Speech and Communication Association,2007.290-293
9Pavel M, Petr S, Jan C, Pavel C. Phonotactic language identification using high quality phoneme recognition. In: Proceedings of the 9th European Conference on Speech Communication and Technology. Lisbon, Portugal: International Speech and Communication Association, 2005. 2237-2240
10边肇祺,张学工,等.模式识别.北京:清华大学出版社,1999.

二级参考文献1

1陶卿,姚穗,范劲松,方廷健.一种新的机器学习算法:Support Vector Machines[J].模式识别与人工智能,2000,13(3):285-290. 被引量：30

共引文献21

1厉剑,杨玮龙,李攀.基于DSP并行结构的二叉树SVM多分类器[J].舰船电子工程,2007,27(1):110-113. 被引量：1
2周丹,蔡坤宝.基于短时傅立叶变换的脉象信号的模式识别方法[J].重庆科技学院学报（自然科学版）,2007,9(3):49-52. 被引量：6
3赵向军,路梅.垃圾邮件过滤算法研究[J].徐州师范大学学报（自然科学版）,2006,24(4):52-55. 被引量：1
4陆振波,章新华,康春玉.基于支持向量机的水中目标识别[J].信息与控制,2003,32(z1):739-742. 被引量：3
5赵杰,秦毅,李静.基于数据挖掘技术的SCADA系统不良数据状态估计[J].科技资讯,2007,5(30):6-7.
6王党卫,秦江敏.基于后置近邻函数准则的改进型模糊聚类算法[J].空军雷达学院学报,2002,16(2):32-34. 被引量：5
7胡文琳,王首勇.基于双谱对角切片的目标架次识别方法[J].空军雷达学院学报,2002,16(3):20-22. 被引量：3
8李建勋,秦江敏,马晓岩.基于神经网络的雷达抗应答式欺骗干扰方法[J].空军雷达学院学报,2003,17(4):19-21. 被引量：7
9胡红波,邱继进,马爱民.基于Matlab神经网络的水下目标识别[J].情报指挥控制系统与仿真技术,2005,27(5):52-54. 被引量：2
10陈伏兵,陈秀宏,王文胜,杨静宇.人脸识别中PCA方法的推广[J].计算机工程与应用,2005,41(34):34-38. 被引量：9

1钟山,刘加.MLLR特征的SVM语种识别算法[J].清华大学学报（自然科学版）,2009(S1):1283-1287.
2丰洪才,卢正鼎.基于MAP和MLLR的综合渐进自适应方法研究[J].计算机工程,2005,31(5):4-7. 被引量：3
3周宇,陈熙霖,赵德斌,姚鸿勋,高文.基于数据生成的手语识别自适应方法[J].高技术通讯,2009,19(12):1258-1264.
4李荟,赵云敏.特征音方法在说话人识别中的应用[J].计算机系统应用,2013,22(8):176-179.
5龙艳花,戴礼荣.采用M-矢量和支持向量机的说话人确认系统[J].华中科技大学学报（自然科学版）,2014,42(8):63-68. 被引量：2
6余姗姗,张亚琼.语音识别的自适应研究[J].福建电脑,2011,27(6):53-54.
7申铉京,翟玉杰,卢禹彤,王玉,陈海鹏.基于信道补偿的说话人识别算法[J].吉林大学学报（工学版）,2016,46(3):870-875. 被引量：3
8李香萍.MATLAB在说话人识别算法中的应用[J].实验室研究与探索,2008,27(1):70-72.
9钱洪伟,贺苏宁.说话人模型参数自适应技术研究[J].电信技术研究,2008(5):16-22.
10张雨虹,刘玉民,张小松.一种新的基于混沌序列的图像加密方法[J].唐山学院学报,2008,21(2):16-18.

自动化学报

2009年第5期

浏览历史

内容加载中请稍等...

基于最大似然线性回归矩阵的说话人识别算法研究

参考文献13

二级参考文献1

共引文献21

相关作者

相关机构

相关主题

浏览历史