期刊文献+

一种基于说话者话路变化的主成分分析方法

A PCA Method Based on Speaker Session Variability
原文传递
导出
摘要 在文本无关的说话人确认中,训练与测试语音中信道环境的不匹配是一种说话者话路变化问题.这种不匹配会严重降低说话人确认系统的性能.为了有效解决该问题,本文提出一种基于说话者话路变化的主成分分析方法,将其应用在说话者确认中,我们将这种方法称为面向话路变化的主成分分析方法.这种方法能够与类内协方差归一化结合,进一步提高识别效果.在NIST2006年说话者识别数据库上进行实验,证明该方法不仅在系统识别等错误率上比基线系统有了24.2%的降低,而且在计算复杂度上相对于目前传统的方法也有很大的优势. In the text-independent speaker verification systems, the mismatch and variability of the channel and environment between training and testing is a session variability problem. It can greatly degrade the speaker recognition performance. To deal with the problem more efficiently, a modified PCA method is proposed called session variation principal component analysis (SVPCA) which can integrate with within class covariance normalization (WCCN). In the NIST 2006 verification task, the proposed method is compared with our previous baseline general linear discriminative sequence-support vector machine (GLDS-SVM) system. The experimental results show a relative reduction of up to 24.2% in error equal ratio (EER). Moreover, the proposed method has advantages in computational and memory costs, compared with the state-of-art systems.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2009年第2期270-274,共5页 Pattern Recognition and Artificial Intelligence
关键词 面向话路变化的主成分分析(SVPCA) 类内协方差归一化(WCCN) 广义线性序列超向量 说话者确认 Session Variation Principal Component Analysis (SVPCA), Within Class Covariance Normalization (WCCN), General Linear Discriminative Sequence Supervector, Speaker Verification
  • 相关文献

参考文献9

  • 1Sturim D E, Campbell W M, Reynolds D A, et al. Robust Speaker Recognition with Cross-Channel Data: MIT-LL Results on the 2006 NIST SRE Auxiliary Microphone Task//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007,Ⅳ: 49 - 52
  • 2Kenny P, Boulianne G, Ouellet P, et al. Joint Factor Analysis versus Eigenchannels in Speaker Recognition. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(4) : 1435 -1447
  • 3Solomonoff A, Quillen C, Campbell W. Channel Compensation for SVM Speaker Recognition [ EB/OL]. [ 2004- 12- 01 ]. http:// www. 11. mit. edu. mission/communications/ist/publications/ 040531 Solomonoff. pdf
  • 4Vapnik V N. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag, 1995
  • 5Hatch A, Stolcke A. Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition // Proe of the IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France, 2006, Ⅴ: 585-588
  • 6Hatch A O, Kajarekar S, Stolcke A. Within-Class Covariance Normahzation for SVM-Based Speaker Recognition [ EB/OL]. [21307- 10- 21 ]. http ://v, ww. icsi. berkeley, edu/puhs/speeeh/HatchlCSLP06, pdf
  • 7Matejka P, Burget L, Schwarz P, et al. STBU System for the NIST 2006 Speaker Recognition Evaluation// Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007, Ⅳ: 221 -224
  • 8Campbell W M, Campbell J P, Reynolds D A, et al. Support Vector Machines for Speaker and Language Recognition. Computer Speech and Language, 2006, 20(2/3) : 210 -229
  • 9Nation Institute of Standards and Technology. NIST Speech Group Website [DB/OL]. [2007- 10- 03]. http://www. nist. gov/speech

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部