
基于本征音因子分析的短时说话人识别 被引量:3

Eigenvoice Factor Analysis in Short Time Speaker Recognition
摘要 提出了一种基于本征音因子分析的文本无关的说话人识别方法。它解决了训练语音与测试语音均很短的情况下,传统的基于最大后验概率准则的混合高斯模型无法建立稳定的说话人模型问题。首先利用期望最大化算法在开发集上训练出说话人的本征音载荷矩阵,在说话人模型建模时通过将短时语音数据向本征音空间的降维映射来得到模型参数。实验结果表明,在NIST SRE 2006数据库中的10 s训练语音-10 s测试语音任务中,在传统的混合高斯模型的基线系统上,通过采用本征音因子分析的方法可以使系统等错误率降低18%。 A text-independent speaker verification method is proposed based on eigenvoiee fac- tor analysis algorithm. It focuses on the short-duration text-independent speaker verification. The Gaussian mixture model (GMM)-universal background model (UBM) based on maximum a posteriori(MAP) estimation cannot work when the training and test speech data are sparse. Firstly, the eigenvoice loading matrix is trained using the expectation maximuzation(EM) algo- rithm in the development corpus. Then, the speaker factor is calculated through the eigenvoiee space to obtain the speaker model. Experimental results show that the algorithm can improve the system performance. In the NIST speaker recognition evaluation (SRE) 2006 10 s-10 s corpus, the equal error rate (EER) of the proposed system can be reduced by 18% against the baseline GMM system.
出处 《数据采集与处理》 CSCD 北大核心 2009年第4期449-452,共4页 Journal of Data Acquisition and Processing
关键词 本征音 本征信道 说话人确认 eigenvoice eigenchannel speaker verification
  • 相关文献


  • 1Kenny P,Mihoubi M,Dumouchel P.New Map esti-mators for speaker recognition[].Proc Eu-rospeech.2003
  • 2Kenny P.Joint factor analysis of speaker and sessionvariability:theory and algorithms. http://www.crim.ca/perso/patrick.kenny/ .
  • 3Reynolds D A,Quatieri T F,Dunn R B.Speaker verification using adapted Gaussian mixture models[].Digital Signal Processing.2000
  • 4WM Campbell,JP Campbell,DA Reynolds,E Singer."Support vector machines for speaker and language recognition"[].Computer Speech and Language.2006
  • 5W. M. Campbell,D. E. Sturim,and D.A.Reynolds.Support Vector Machines Using GMM Supervector for Speaker Verification[].IEEE SIGNAL PROCESSING LETTERS.2006
  • 6P. Kenny,,G. Boulianne,P. Dumouchel.Eigenvoice modeling with sparse training data[].IEEE Transactions on Speech and Audio Processing.2005
  • 7D. Reynolds:."Comparison of Background Normalization Methods for Text-Independent Speaker Verification,"[].Speech Communication.1997
  • 8Martin A.The NIST Year 2006 Speaker Recognition Evaluation Plan. http://www.nist.gov/speech/tests/spk/2006/index.htm .


  • 1鲍焕军,郑方.GMM-UBM和SVM说话人辨认系统及融合的分析[J].清华大学学报(自然科学版),2008,48(S1):693-698. 被引量:9
  • 2Reynolds D A,Quatieri T F,Dunn R B. Speakerverification using adapted Gaussian mixture models[J].Digital Signal Processing, 2000,10(1-3) : 19-41.
  • 3Kenny P? Mihoubi M,Dumouchel P. New MapEstimators for Speaker Recognition [ C ]//ProcEurospeech-2003, 2003 : 2 961-2 964.
  • 4Kenny P. Joint Factor analysis of speaker and sessionvariability: Theory and algorithms [EB/OL].http://www. crim. ca/perso/Patrick, kenny,2006.
  • 5Kenny P,Ouellet P,Dehak N. A study of inter-speaker variability in speaker verification [J].IEEETransaction on Audio Speech and LanguageProcessing, 2008,16(5) : 980-988.
  • 6Stefanos Z,Anastasioa T, Ioannis P. Minimum classvariance support vector machines [J].IEEE Transactionson Image Processing,2007,16(10) : 2 551-2 564.
  • 7Gauvain J L,Lee C H. Maximum a posteriorestimation for multivariate Gaussian mixtureobservations of Markov chains [ J ].IEEE Trans.Speech and audio processing, 1994,2(2) : 291-298.
  • 8Jeff A B. A Gentle Tutorial of the EM algorithm andits application to parameter estimation for Gaussianmixture and hidden Markov models [EB/OL].http://www. icsi. berkeley. edu. /ftp/global/pub/techreports/1997/tr-07-021. pdf. 1998.
  • 9Andrew O H. Kernel optimization for support vectormachines : Application to speaker verification [ EB/OL].Technical Report No. UCB/EEC&*2006-187.http : / / www. eecs. berkerley. edu/Pubs/TechRpts/2006-187. pdf, 2006.
  • 10Dehak N, Kenny P, Dehak R. Support vectormachines and joint factor analysis for speakerverification [C]//ICASSP,2009 : 4 237-4 240.









使用帮助 返回顶部