期刊文献+

利用i-vectors构建区分性话者模型的话者确认 被引量:3

Discriminative Speaker Models Based on i-vectors for Speaker Verification
下载PDF
导出
摘要 对于电话手机语音的文本无关话者确认,运用联合因子分析构建话者信息子空间与信道信息子空间来进行失配信道补偿取得了较好的效果.然而研究表明,信道信息子空间仍然包含了可以用来区分话者的信息.因此,本文运用一种既包含话者信息又包含信道信息的全变量信息子空间来提取i-vectors低维特征矢量,再运用类内协方差规整进行失配信道补偿,最后用补偿后的i-vectors特征矢量构建支持向量机话者模型.在NIST08数据库上实验表明,本文所构建系统的性能在等误识率和最小检测代价函数上有相对近70%的提高. Joint Factor Analysis provides an effective means for text independent speaker verification system. It is a powerful technique for compensating the variability caused by different channels and speakers. However, studies show that, the channel information sub- space also contains information that can be used to distinguish between speakers. In this study, we propose a new speaker representa- tion called i-vectors which is a low-dimensional vector. Firstly , it is extracted from a total variability space which models both the speaker and channel variability. Then, within this total variability space, Within-Class Covariance Normalization, a common used channel compensation method, is performed to reduce the channel variability. Finally, the compensative i-vectors are used to train discriminative models based on Support Vector Machines. Experiments on NIST08 SRE database show that the proposed strategy can improve the system performance as much as 70% both in EER and MinDCF over the baseline system.
出处 《小型微型计算机系统》 CSCD 北大核心 2014年第3期685-688,共4页 Journal of Chinese Computer Systems
关键词 话者确认 全变量信息子空间 类内协方差规整 支持向量机 i—vectors speaker verification total-variability subspace within-class covariance normalization support vector machine i-vectors
  • 相关文献

参考文献11

  • 1Kanagasundaram A, Vogt R, Dean D, et al. i-vector based speaker recognition on short utterances [ C ]. Interspeech 2011, Florence, 2011 : 2341-2344.
  • 2Dehak N. Discriminative and generative approaches for long- and short-term speaker characteristics modeling: application to speaker verification: application to speaker verification [ D ]. Montreal: Ecole de Technologie Suprrieure, 2009.
  • 3Dehak N, Kenny P, Dehak R, et al. Front end factor analysis for speaker verification[ J]. IEEE Transactions on Audio, Speech and Language Processing, 2010, 19(4) :788-798.
  • 4Hatch A O, Kajarekar S, Stolcke A. Within-class covariance nor- malization for SVM-based speaker recognition [ C ]. Interspeech 2006, Pittsburgh, 2006: 53-56.
  • 5Dehak N, Dehak R, Kenny P, et al. Support vector machines ver- sus fast scoring in the low-dimensional total variability space for speaker verification [ C ]. lnterspeech 2009, Brighton UK, 2009 : 53 -56.
  • 6Glembek O, Burget L, Matejka P, et al. Simplification and opti- mization of i-vector extraction [ C ]. Proceedings of IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing, Prague, 2011: 4516-4519.
  • 7Matthias Seeger. Low rank updates for the cholesky decomposition [ EB/OL]. http://upseeger, epfl. ch/papers/cholupdate, pdf, 2008.
  • 8Dehak N, Dehak R, Glass J, et al. Cosine similarity scoring with- out score normalization techniques[ C ]. Odyssey Speaker and Lan- guage Recognition Workshop, Bmo, 2010.
  • 9The NIST year 2006 speaker recognition evaluation plan[ EB/OL]. http://www, nist. gov/speech/tests/spk/2006/sre436- evalplan- v9. pdf, 2006.
  • 10Kinnunen T, Li H. An overview of text-independent speaker rec- ognition: from features to supervectors [ J ]. Speech Communica- tion, 2010,52 : 12-40.

同被引文献14

  • 1吴礼福,姚志强,戴蓓蒨,李辉.音源特征用于提高话者确认系统的鲁棒性[J].中国科学技术大学学报,2006,36(5):476-480. 被引量:2
  • 2REYNOLDS D A,QUATIERI T F,DUNN R. Speaker verifica.tion using adapted gaussian mixture model [J]. Digital signalprocessing,2000,10(1/2/3):19-41.
  • 3CAMPBELL W M,STURIM D E,REYNOLDS D A. Supportvector machines using GMM supervectors for speaker verifica.tion [J]. IEEE signal processing letters,2006,13(5):308-311.
  • 4KENNY P,OUELLET P,DEHAK N,et al. A study of inter.speaker variability in speaker verification [J]. IEEE transac.tions on audio, speech and language processing, 2008, 16(5):980-988.
  • 5DEHAK N,KENNY P,OUELLET P,et al. Front.end factoranalysis for speaker verification [J]. IEEE Transactions on au.dio,speech and language processing,2011,19(4):788-798.
  • 6GHAHRAMANI Z,HINTON G. The EM algorithm for mix.tures of factor analyzers:CRG.TR.96.1 [R]. Toronto:Depart.ment of Computer Science,University of Toronto,1966.
  • 7GAUVAIN J L,LEE C H. Maximum a posterior estimationfor multivariate Gaussian mixture observations of Markovchains [J]. IEEE transactions on speech and audio processing,1994,2(2):291-298.
  • 8GLEMBEK O,BURGET L,MAěJKA P,et al. Simplifica.tion and optimization of I.vector extraction [C].Proceedings ofIEEE International Conference on Acoustics,Speech and SignalProcessing. Prague:IEEE,2011:4516-4519.
  • 9SEEGER Matthias.Low rank updates for the cholesky decompo.sition [EB/OL].[2010.12.04].http://upseeger.epfl.ch/papers/cholupdate.pdf.
  • 10POVEY D,GHOSHAL A,BOULIANNE G,et al. The Kaldispeech recognition toolkit[EB/OL].[2013.02.03].http://blog.csdn.net/jiangyangbo/article/.

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部