高斯PLDA在说话人确认中的应用及其联合估计被引量：3

Gaussian PLDA for Speaker Verification and Joint Estimation

下载PDF

导出

摘要近年来,基于总变化因子的说话人识别方法成为说话人识别领域的主流方法.其中,概率线性鉴别分析(Probabilistic linear discriminant analysis,PLDA)因其优异的性能而得到学者们的广泛关注.然而,在估计PLDA模型时,传统的因子分析方法只更新模型空间,因此,模型均值不能很好地与更新后的模型空间耦合.提出联合估计法对模型均值和模型空间同时估计,得到更为严格的期望最大化更新公式,在美国国家标准与技术局说话人识别评测2010扩展测试数据库以及2012核心测试数据库上,等错率得到一定提升. Recently the approaches based on i-vector have become very popular in the speaker recognition domain. Among these methods, the probabilistic linear discriminant analysis （PLDA） has attracted much attention due to its promising performance. However, the traditional factor analysis method only updates model space, thus making model mean couple with the model space unsuitably. This paper propose an approach of joint estimation for both model mean and model space, resulting in more strict expectation maximization （EM） formula. The equal error rate has been improved on the NIST SRE 2010 extended test corpus and NIST SRE 2012 core test corpus.

作者许云飞杨海周若华颜永红

机构地区中国科学院语言声学与内容理解重点实验室

出处《自动化学报》 EI CSCD 北大核心 2014年第6期1068-1074,共7页 Acta Automatica Sinica

基金国家高技术研究发展计划(863计划)(2012AA012503) 国家自然科学基金(10925419 90920302 61072124 11074275 11161140319 91120001 61271426) 中国科学院战略性先导科技专项(XDA06030100 XDA06030500) 中科院重点部署项目(KGZDEW-103-2)资助~~

关键词因子分析总变化因子概率线性鉴别分析联合估计期望最大化 Factor analysis, i-vector, probabilistic linear discriminant analysis （PLDA）, joint estimation, expectationmaximization （EM）

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献22

1Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted gaussian mixture models. Digital Signal Processing, 2000, 10(1-3): 19-41.
2郭武,李轶杰,戴礼荣,王仁华.说话人识别中的因子分析以及空间拼接[J].自动化学报,2009,35(9):1193-1198. 被引量：14
3Kenny P, Boulianne G, Dumouchel P. Eigenvoice modeling with sparse training data. IEEE Transactions on Speech Audio Processing, 2005, 13(3): 345-359.
4Kenny P, Boulianne G, Ouellet P, Dumouchel P. Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech and Language Processing, 2007, 15(4): 1435-1447.
5何亮,史永哲,刘加.联合因子分析中的本征信道空间拼接方法[J].自动化学报,2011,37(7):849-856. 被引量：8
6Dehak N. Discriminative and generative approches for long-and short-term speaker characteristics modeling: Application to speaker verification [Ph.D. dissertation], école de Technologie Supérieure, Montreal, QC, Canada, 2009.
7Dehak N, Kenny P, Dehak R, Dumouchel P, Ouellet P. Front-end factor analysis for speaker verification. IEEE Transactions on Audio, Speech and Language Processing, 2011, 19(4): 788-798.
8McLaren M, Leeuwen D A V. Sourcenormalised and weighted lda for robust speaker recognition using i-vectors. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Prague, Czech Republic: IEEE, 2011. 5456-5459.
9Simon J D P, James H E. Probabilistic linear discriminant analysis for inferences about identity. In: Proceedings of International Conference on Computer Vision. Rio de Janeiro, Brazil: IEEE, 2007. 1-8.
10Dehak N, Karam Z, Reynolds D, Dehak R, Campbell W, Glass J. A channel-blind system for speaker verification. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Prague, Czech Republic: IEEE, 2011. 4536-4539.

二级参考文献22

1Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 2000, 10(1-3): 19-41.
2Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing; Letters, 2006, 13(5): 308-311.
3Kenny P, Boulianne G, Ouellet P, Dumouchel P. Speaker and session variability in GMM-based speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(4): 1448-1460.
4Vogt R, Sridharan S. Experiments in session variability modeling for speaker verification. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing. Toulouse, France: IEEE, 2006. 897-900.
5Castaldo F, Colibro D, Dalmasso E, Laface P, Vair C. Compensation of nuisance factors for speaker and language recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(7): 1969-1978.
6Kenny P, Ouellet P, Dehak N, Gupta V, Dumouchel P. A study of inter-speaker variability in speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 2008, 16(5): 980-988.
7Kenny P, Boulianne G, Dumouchel P. Eigenvoice modeling with sparse training data. IEEE Transactions on Audio, Speech, and Lnnguage Processing, 2005, 13(3): 345-354.
8Kenny P, Boulianne G, Ouellet P, Dumouchel P. Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(4): 1435-1447.
9NIST. The NIST Year 2008 Speaker Recognition Evaluation Plan [Online], available: http://www.nist.gov/speech/tests /sre/2008/index.html, March 20, 2008.
10Bishop C M. Pattern Recognition and Machine Learning. Berlin: Springer, 2008. 583-586.

共引文献17

1何亮,史永哲,刘加.联合因子分析中的本征信道空间拼接方法[J].自动化学报,2011,37(7):849-856. 被引量：8
2姜涛,韩纪庆,郑铁然.基于高斯混合模型移动因子补偿的说话人识别方法[J].声学学报,2011,36(6):658-664. 被引量：2
3顾晓江,赵鹤鸣,吕岗.模型与特征混合补偿法及其在耳语说话人识别中的应用[J].声学学报,2012,37(2):198-203. 被引量：4
4杨海,张翔,梁春燕,索宏彬,颜永红.联合因子分析和稀疏表示在稳健性说话人确认中的应用[J].声学学报,2012,37(5):548-552. 被引量：7
5GU Xiaojiang ZHAO Heming Lu Gang.Whispered speaker identification based on feature and model hybrid compensation[J].Chinese Journal of Acoustics,2012,31(4):499-508. 被引量：1
6李晋,郭武,戴礼荣.联合因子分析算法中基于信号子空间的空间变换方法[J].模式识别与人工智能,2013,26(8):705-710. 被引量：2
7酆勇,李宓,李子明.文本无关的说话人识别研究[J].数字通信,2013,40(4):48-52. 被引量：1
8栗志意,张卫强,何亮,刘加.基于总体变化子空间自适应的i-vector说话人识别系统研究[J].自动化学报,2014,40(8):1836-1840. 被引量：17
9梁春燕,杨琳,周若华,颜永红.韵律特征在概率线性判别分析说话人确认中的应用[J].声学学报,2015,40(1):28-33. 被引量：6
10许美玲,韩敏.多元混沌时间序列的因子回声状态网络预测模型[J].自动化学报,2015,41(5):1042-1046. 被引量：19

同被引文献26

1KINNUNEN T, LI H ZH. An overview of text-independent speaker recognition: From features to super-vectors [ J ]. Speech Communication, 2010, 52( 1 ) : 12- 40.
2GONZALEZ-RODRIGUEZ J. Evaluating automatic speaker recognition systems: An overview of the NIST speaker recognition evaluations ( 1996-2014 ) [ J ]. Lo- quens, 2014, 1 ( 1 ) : 1-15.
3KHOURY E, VESNICER B, FRANCO-PEDROSO J, et al. The 2013 speaker recognition evaluation in mobile en- vironment[ C ]. Proceedings of IAPR International Con- ference on Biometrics (ICB), 2013: 1-8.
4KENNY P, BOULIANNE G, OUELLET P, et al. Joint factor analysis versus eigenchannels in speaker recogni- tion[J]. IEEE Transactions on Audio, Speech and Lan- guage Processing, 2007, 15(4) : 1435-1447.
5DEHAK N, KENNY P, DEHAK R, et al. Front-end factor analysis for speaker verification [ J ]. IEEE Trans-actions on Audio, Speech, and Language Processing, 2011, 19(4) : 788-798.
6MCLAREN M, LEEUWEN D V. Source normalised and weighted LDA for robust speaker recognition u- sing i-veetors[ C ]. IEEE International Conferenee on Acoustics Speech and Signal Processing (ICASSP) , 2011:5456 -5459.
7KANAGASUNDARAM A, DEANA D, SRIDHARAN S, et al. I-vector based speaker recognition using advanced channel compensation techniques [ J ]. Computer Speech and Language, 2014, 28( 1 ) : 121-140.
8KENNY P. Bayesian speaker verification with heavy tailed priot~ [ C]. Proceedings of the Speaker and Lan- guage Recognition Workshop, 2010: 1-10.
9HASAN T, HANSEN J H L. Maximum likelihood acous- tic factor analysis models for robust speaker verification in noise[ J ]. 1EEE Transactions on Audio, Speech, and l.anguage Processing, 2014, 22(2): 381-391.
10邱政权,范小春,王俊年.基于维纳滤波和混合模型的说话人识别[J].仪器仪表学报,2009,30(7):1436-1440. 被引量：5

引证文献3

1王明合,唐振民,张二华.基于i-vector局部加权线性判别分析的说话人识别[J].仪器仪表学报,2015,36(12):2842-2848. 被引量：6
2徐利敏,魏翔.Android平台说话人认证系统的并行计算与设计[J].计算机工程与应用,2017,53(3):231-236.
3张二华,王明合,唐振民.加性噪声条件下鲁棒说话人确认[J].电子学报,2019,47(6):1244-1250. 被引量：3

二级引证文献9

1李湾湾,范承志,祁才君.基于改进MFD的I-Vector说话人识别[J].电声技术,2016,40(12):43-48. 被引量：1
2刘恒,吴迪,苏家仪,杨春勇,侯金.运用高斯混合模型识别动物声音情绪[J].国外电子测量技术,2016,35(11):82-87. 被引量：6
3茅正冲,王俊俊,黄舒伟.基于PLDA信道补偿的说话人识别算法[J].计算机与数字工程,2019,47(11):2757-2762. 被引量：2
4孙杰,吾守尔·斯拉木,热依曼·吐尔逊,张晶晶.维吾尔语方言识别及相关声学分析[J].声学学报,2019,44(6):1083-1092. 被引量：3
5SUN Jie,WUSHOUER Silamu,REYIMAN Turson,ZHANG Jingjing.Acoustic analysis of the vowel system in Hotan dialect and its contribution to dialect recognition of Uyghur dialects[J].Chinese Journal of Acoustics,2020,39(1):117-132.
6赵宏,岳鲁鹏,常兆斌,王伟杰.基于多特征I-Vector的说话人识别算法[J].兰州理工大学学报,2021,47(5):93-98. 被引量：1
7肜娅峰,陈晨,陈德运,何勇军.基于贝叶斯主成分分析的i-vector说话人确认方法[J].电子学报,2021,49(11):2186-2194. 被引量：2
8汪兰兰,蔡昌新.基于改进线性预测基音频率的语音情感识别系统[J].科学技术与工程,2022,22(26):11524-11532. 被引量：2
9景维鹏,肖庆欣,罗辉.基于概率球面判别分析的说话人识别信道补偿算法[J].计算机应用,2024,44(2):556-562.

1宋贵宝,张峰伟.基于信息融合技术的多阶段可靠性评估方法研究[J].舰船电子工程,2014,34(2):41-43. 被引量：4
2程起才,王洪元,吴小俊,刘锁兰.一种基于ISOMAP的分类算法[J].控制与决策,2011,26(6):826-830. 被引量：5
3成新民,张迎,蒋云良.基于FVQMM的说话人识别[J].辽宁工程技术大学学报（自然科学版）,2007,26(5):719-722.
4刘春燕,邹承明.基于用户体验度的云服务策略研究[J].小型微型计算机系统,2016,37(6):1203-1206.
5李志华,李超.一种控制继电器可靠性试验平台的研究与设计[J].计算机工程与设计,2005,26(11):3112-3114. 被引量：3
6陈放,潘素珍.计算机启动过程的解析[J].电子制作,2013,21(7X):86-86.
7安荣亮,杨琨.VI技术在扩展测试仪器功能上的应用[J].电光系统,2006(2):43-46. 被引量：1
8陈盛双,徐少堂,姚志鹏.新型媒体用户关注指数模型仿真分析[J].汉口学院学报,2013,6(2):44-48.
9测试与测量[J].中国电子商情,2006(8):87-88.
10焦宾,吕霞付,陈勇,李愿.一种改进的自适应高斯混合模型实时运动目标检测算法[J].计算机应用研究,2013,30(11):3518-3520. 被引量：5

自动化学报

2014年第6期

浏览历史

内容加载中请稍等...

高斯PLDA在说话人确认中的应用及其联合估计被引量：3

参考文献22

二级参考文献22

共引文献17

同被引文献26

引证文献3

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

高斯PLDA在说话人确认中的应用及其联合估计 被引量：3

参考文献22

二级参考文献22

共引文献17

同被引文献26

引证文献3

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

高斯PLDA在说话人确认中的应用及其联合估计被引量：3