一种基于说话者话路变化的主成分分析方法

A PCA Method Based on Speaker Session Variability

导出

摘要在文本无关的说话人确认中,训练与测试语音中信道环境的不匹配是一种说话者话路变化问题.这种不匹配会严重降低说话人确认系统的性能.为了有效解决该问题,本文提出一种基于说话者话路变化的主成分分析方法,将其应用在说话者确认中,我们将这种方法称为面向话路变化的主成分分析方法.这种方法能够与类内协方差归一化结合,进一步提高识别效果.在NIST2006年说话者识别数据库上进行实验,证明该方法不仅在系统识别等错误率上比基线系统有了24.2%的降低,而且在计算复杂度上相对于目前传统的方法也有很大的优势. In the text-independent speaker verification systems, the mismatch and variability of the channel and environment between training and testing is a session variability problem. It can greatly degrade the speaker recognition performance. To deal with the problem more efficiently, a modified PCA method is proposed called session variation principal component analysis （SVPCA） which can integrate with within class covariance normalization （WCCN）. In the NIST 2006 verification task, the proposed method is compared with our previous baseline general linear discriminative sequence-support vector machine （GLDS-SVM） system. The experimental results show a relative reduction of up to 24.2% in error equal ratio （EER）. Moreover, the proposed method has advantages in computational and memory costs, compared with the state-of-art systems.

作者龙艳花郭武戴礼荣

机构地区中国科学技术大学电子工程与信息科学系科大讯飞语音实验室

出处《模式识别与人工智能》 EI CSCD 北大核心 2009年第2期270-274,共5页 Pattern Recognition and Artificial Intelligence

关键词面向话路变化的主成分分析(SVPCA) 类内协方差归一化(WCCN) 广义线性序列超向量说话者确认 Session Variation Principal Component Analysis （SVPCA）, Within Class Covariance Normalization （WCCN）, General Linear Discriminative Sequence Supervector, Speaker Verification

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1Sturim D E, Campbell W M, Reynolds D A, et al. Robust Speaker Recognition with Cross-Channel Data: MIT-LL Results on the 2006 NIST SRE Auxiliary Microphone Task//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007,Ⅳ: 49 - 52
2Kenny P, Boulianne G, Ouellet P, et al. Joint Factor Analysis versus Eigenchannels in Speaker Recognition. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(4) : 1435 -1447
3Solomonoff A, Quillen C, Campbell W. Channel Compensation for SVM Speaker Recognition [ EB/OL]. [ 2004- 12- 01 ]. http:// www. 11. mit. edu. mission/communications/ist/publications/ 040531 Solomonoff. pdf
4Vapnik V N. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag, 1995
5Hatch A, Stolcke A. Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition // Proe of the IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France, 2006, Ⅴ: 585-588
6Hatch A O, Kajarekar S, Stolcke A. Within-Class Covariance Normahzation for SVM-Based Speaker Recognition [ EB/OL]. [21307- 10- 21 ]. http ://v, ww. icsi. berkeley, edu/puhs/speeeh/HatchlCSLP06, pdf
7Matejka P, Burget L, Schwarz P, et al. STBU System for the NIST 2006 Speaker Recognition Evaluation// Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007, Ⅳ: 221 -224
8Campbell W M, Campbell J P, Reynolds D A, et al. Support Vector Machines for Speaker and Language Recognition. Computer Speech and Language, 2006, 20(2/3) : 210 -229
9Nation Institute of Standards and Technology. NIST Speech Group Website [DB/OL]. [2007- 10- 03]. http://www. nist. gov/speech

1张庆芳,刘正,黄英.基于改进算法的基音检测及其应用[J].高职论丛,2008(1):26-31.
2陈明义,周昆湘,曾理文.基于VQ的与文本无关的说话人确认系统[J].信息技术,2007,31(3):97-98. 被引量：1
3王金明,张雄伟.话者识别系统中语音特征参数的研究与仿真[J].系统仿真学报,2003,15(9):1276-1278. 被引量：17
4侯风雷,王炳锡.基于说话人聚类和支持向量机的说话人确认研究[J].计算机应用,2002,22(10):33-35. 被引量：11
5董治强,刘琚,邹欣,杜军.基于ICA的语音信号表征和特征提取方法[J].山东大学学报（工学版）,2010,40(4):19-22. 被引量：3
6龙艳花,郭武,戴礼荣.用于SVM说话者确认系统的序列核[J].清华大学学报（自然科学版）,2008,48(S1):688-692. 被引量：1
7韦晓东,胡光锐,任晓林.听觉掩蔽门限在说话者识别中的应用[J].上海交通大学学报,1999,33(12):1521-1524. 被引量：1
8CEVA提供基于CEVA-TeakLite系列DSP的语音处理解决方案[J].单片机与嵌入式系统应用,2013,13(2):12-12.
9刘明辉,黄中伟,戴蓓蒨,熊继平.用于SVM话者模型训练的冒认话者选取[J].计算机工程,2009,35(16):4-6. 被引量：1
10张伟杰,费万春,徐良军,刘琭.一种说话者识别的新方法[J].计算机应用,2009,29(3):764-767. 被引量：1

模式识别与人工智能

2009年第2期

浏览历史

内容加载中请稍等...

一种基于说话者话路变化的主成分分析方法

参考文献9

相关作者

相关机构

相关主题

浏览历史