期刊文献+

基于因子分析信道失配补偿的SVM话者确认方法 被引量:2

SVM Speaker Verification Method of Mismatch Compensation Based on Factor Analysis
原文传递
导出
摘要 针对信道失配和统计模型区分性不足而导致话者确认性能下降问题,文中提出一种将因子分析信道失配补偿与支持向量机模型相结合的文本无关话者确认方法.在SVM话者模型前端采用高斯混合模型-背景模型(GMM-UBM)方法对语音特征参数进行聚类和升维,并利用因子分析(FA)方法,对聚类获得的超矢量进行信道补偿后作为基于SVM话者确认的输入特征,从而有效解决SVM用于文本无关话者确认的大样本、升维问题,以及信道失配对性能影响问题.在NIST06数据库上实验结果表明,文中方法比未做失配补偿的GMM-UBM系统、GMM-SVM系统在等误识率上有50%以上的改善,比做了FA失配补偿的GMM-UBM系统也有15.8%的改善. The poor performance of speaker verification system results from the channel mismatch and the lack of distinction between statistical models. A text-independent speaker verification method is proposed which combines the channel compensation based on factor analysis and the discriminative support vector machine (SVM) model. Gaussian mixture model (GMM) is used to make the speech parameter clustered and ascended, then the channel information of GMM mean super-vectors is wiped off by using factor analysis. The parameters, which are used as inputting parameters, are employed for the construction of SVM speaker verification system. The proposed method solves the problems of large samples, dimension raising and channel mismatch effectively when SVM is used for the text-independent speaker verification. Experimental results on NIST 06 male speaker corpus show that the proposed method improves system performance. Compared with the baseline system Gaussian mixture model-universal background model (GMM-UBM) , GMM-SVM without channel compensation, the system improves the equal error rate (EER) more than 50%. Compared with the system factor analysis (FA)-GMM-UBM which uses channel compensation based on factor analysis without discriminative models, it also gets the improvement of EER by 15.8%.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2010年第1期59-64,共6页 Pattern Recognition and Artificial Intelligence
关键词 因子分析 高斯混合模型(GMM)超矢量 支持向量机(SVM) 话者确认 Factor Analysis, Gaussian Mixture ( SVM), Speaker Verification Model (GMM) Super Vector, Support Vector Machine
  • 相关文献

参考文献9

  • 1Reynolds D A, Quatieri T F, Dunn R B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 2000, 10(1) : 19-41.
  • 2Vapnik V N. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag, 1995.
  • 3Reynolds D A. Channel Robust Speaker Verification via Feature Mapping//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Hongknng, China, 2003, Ⅱ : 53 - 56.
  • 4Teunen R, Shahshahani B, Heck L. A Model-Based Transformational Approach to Robust Speaker Recognition // Proc of the 6th International Conference on Spoken Language Processing. Beijing, China, 2000, Ⅱ: 495 -498.
  • 5Yin Shouchun, Rose R, Kenny P. A Joint Factor Analysis Approach to Progressive Model Adaptation in Text Independent Speaker Verification. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(7): 1999-2010.
  • 6Matrouf D, Scheffer N, Fauve B, et al. A Straightforward and Efficient Implementation of the Factor Analysis Model for Speaker Verification [ EB/OL]. [ 2007-08-29 ]. http://lia. univ-avignon. fr/ fileadmin/documents/Users/Intranet/fich-art/987 -Matrouf-IS07. pdf.
  • 7Campbell W M, Sturim D E, Reynolds D A. Support Vector Machines Using GMM Supervectors for Speaker Verification. IEEE Signal Processing Letters, 2006, 13 (5) : 308 - 311.
  • 8The NIST Year 2006 Speaker Recognition Evaluation Plan [ EB/ OL]. [ 2006-03-08 ]. http://www. nist. gov/speech/tests/spk/2006/sre-06_evalplan-v9. pdf.
  • 9Burger L, Matejka P, Schwarz P, et al. Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(7) : 1979 -1986.

同被引文献11

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部