期刊文献+

基于通用背景-联合估计(UB-JE)的说话人识别方法 被引量:5

Speaker Recognition Based on Universal Background-Joint Estimation(UB-JE)
下载PDF
导出
摘要 在说话人识别中,有效的识别方法是核心.近年来,基于总变化因子分析(i-vector)方法成为了说话人识别领域的主流,其中总变化因子空间的估计是整个算法的关键.本文结合常规的因子分析方法提出一种新的总变化因子空间估计算法,即通用背景–联合估计(Universal background-joint estimation algorithm, UB-JE)算法.首先,根据高斯混合–通用背景模型(Gaussian mixture model-universal background model, GMM-UBM)思想提出总变化矩阵通用背景(UB)算法;其次,根据因子分析理论结合相关文献提出了一种总变化矩阵联合估计(JE)算法;最后,将两种算法相结合得到通用背景–联合估计(UB-JE)算法.采用TIMIT和MDSVC语音数据库,结合i-vector方法将所提的算法与传统算法进行对比实验.结果显示,等错误率(Equal error rate, EER)和最小检测代价函数(Minimum detection cost function, MinDCF)分别提升了8.3%与6.9%,所提方法能够提升i-vector方法的性能. In the speaker recognition, the effective identification method is the core. In recent years, i-vector method has become the mainstream in the field of speaker recognition, and estimation of the total variation factor space is the key of whole algorithm. In this paper, we propose a new algorithm for total variation factor space estimation named UB-JE,which is combined with conventional factor analysis method. Firstly, the universal background algorithm of total variation matrix is proposed according to Gaussian mixture model-universal background model(GMM-UBM). Secondly, the joint estimation algorithm of total variation matrix is proposed according to the factor analysis theory and related works.Finally, the two algorithms are combined to get the universal background-joint estimation algorithm(UB-JE). TIMIT and MDSVC corpus are adopted in the experiment to compare the proposed algorithm with the traditional algorithm.Experimental results show that the equal error rate(EER) and the minimum detection cost function(MinDCF) are improved by 8.3 % and 6.9 %, respectively. The proposed method can improve the performance of i-vector method.
作者 汪海彬 郭剑毅 毛存礼 余正涛 WANG Hai-Bin;GUO Jian-Yi;MAO Cun-Li;YU Zheng-Tao(School of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500;Intelligent Information Processing Key Laboratory,Kunming University of Science and Technology,Kunming 650500)
出处 《自动化学报》 EI CSCD 北大核心 2018年第10期1888-1895,共8页 Acta Automatica Sinica
基金 国家自然科学基金(61262041 61472168 61562052)资助~~
关键词 总变化因子分析 总变化因子空间 通用背景–联合估计算法 说话人识别 I-vector total variation factor space universal background-joint estimation algorithm(UB-JE) speaker recognition
  • 相关文献

参考文献2

二级参考文献22

  • 1Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 2000, 10(1-3): 19-41.
  • 2Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing; Letters, 2006, 13(5): 308-311.
  • 3Kenny P, Boulianne G, Ouellet P, Dumouchel P. Speaker and session variability in GMM-based speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(4): 1448-1460.
  • 4Vogt R, Sridharan S. Experiments in session variability modeling for speaker verification. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing. Toulouse, France: IEEE, 2006. 897-900.
  • 5Castaldo F, Colibro D, Dalmasso E, Laface P, Vair C. Compensation of nuisance factors for speaker and language recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(7): 1969-1978.
  • 6Kenny P, Ouellet P, Dehak N, Gupta V, Dumouchel P. A study of inter-speaker variability in speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 2008, 16(5): 980-988.
  • 7Kenny P, Boulianne G, Dumouchel P. Eigenvoice modeling with sparse training data. IEEE Transactions on Audio, Speech, and Lnnguage Processing, 2005, 13(3): 345-354.
  • 8Kenny P, Boulianne G, Ouellet P, Dumouchel P. Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(4): 1435-1447.
  • 9NIST. The NIST Year 2008 Speaker Recognition Evaluation Plan [Online], available: http://www.nist.gov/speech/tests /sre/2008/index.html, March 20, 2008.
  • 10Bishop C M. Pattern Recognition and Machine Learning. Berlin: Springer, 2008. 583-586.

共引文献17

同被引文献18

引证文献5

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部