利用i-vectors构建区分性话者模型的话者确认被引量：3

Discriminative Speaker Models Based on i-vectors for Speaker Verification

下载PDF

导出

摘要对于电话手机语音的文本无关话者确认,运用联合因子分析构建话者信息子空间与信道信息子空间来进行失配信道补偿取得了较好的效果.然而研究表明,信道信息子空间仍然包含了可以用来区分话者的信息.因此,本文运用一种既包含话者信息又包含信道信息的全变量信息子空间来提取i-vectors低维特征矢量,再运用类内协方差规整进行失配信道补偿,最后用补偿后的i-vectors特征矢量构建支持向量机话者模型.在NIST08数据库上实验表明,本文所构建系统的性能在等误识率和最小检测代价函数上有相对近70%的提高. Joint Factor Analysis provides an effective means for text independent speaker verification system. It is a powerful technique for compensating the variability caused by different channels and speakers. However, studies show that, the channel information sub- space also contains information that can be used to distinguish between speakers. In this study, we propose a new speaker representa- tion called i-vectors which is a low-dimensional vector. Firstly , it is extracted from a total variability space which models both the speaker and channel variability. Then, within this total variability space, Within-Class Covariance Normalization, a common used channel compensation method, is performed to reduce the channel variability. Finally, the compensative i-vectors are used to train discriminative models based on Support Vector Machines. Experiments on NIST08 SRE database show that the proposed strategy can improve the system performance as much as 70% both in EER and MinDCF over the baseline system.

作者方昕李辉刘青松

机构地区中国科学技术大学电子科学与技术系

出处《小型微型计算机系统》 CSCD 北大核心 2014年第3期685-688,共4页 Journal of Chinese Computer Systems

关键词话者确认全变量信息子空间类内协方差规整支持向量机 i—vectors speaker verification total-variability subspace within-class covariance normalization support vector machine i-vectors

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Kanagasundaram A, Vogt R, Dean D, et al. i-vector based speaker recognition on short utterances [ C ]. Interspeech 2011, Florence, 2011 : 2341-2344.
2Dehak N. Discriminative and generative approaches for long- and short-term speaker characteristics modeling: application to speaker verification: application to speaker verification [ D ]. Montreal: Ecole de Technologie Suprrieure, 2009.
3Dehak N, Kenny P, Dehak R, et al. Front end factor analysis for speaker verification[ J]. IEEE Transactions on Audio, Speech and Language Processing, 2010, 19(4) :788-798.
4Hatch A O, Kajarekar S, Stolcke A. Within-class covariance nor- malization for SVM-based speaker recognition [ C ]. Interspeech 2006, Pittsburgh, 2006: 53-56.
5Dehak N, Dehak R, Kenny P, et al. Support vector machines ver- sus fast scoring in the low-dimensional total variability space for speaker verification [ C ]. lnterspeech 2009, Brighton UK, 2009 : 53 -56.
6Glembek O, Burget L, Matejka P, et al. Simplification and opti- mization of i-vector extraction [ C ]. Proceedings of IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing, Prague, 2011: 4516-4519.
7Matthias Seeger. Low rank updates for the cholesky decomposition [ EB/OL]. http://upseeger, epfl. ch/papers/cholupdate, pdf, 2008.
8Dehak N, Dehak R, Glass J, et al. Cosine similarity scoring with- out score normalization techniques[ C ]. Odyssey Speaker and Lan- guage Recognition Workshop, Bmo, 2010.
9The NIST year 2006 speaker recognition evaluation plan[ EB/OL]. http://www, nist. gov/speech/tests/spk/2006/sre436- evalplan- v9. pdf, 2006.
10Kinnunen T, Li H. An overview of text-independent speaker rec- ognition: from features to supervectors [ J ]. Speech Communica- tion, 2010,52 : 12-40.

同被引文献14

1吴礼福,姚志强,戴蓓蒨,李辉.音源特征用于提高话者确认系统的鲁棒性[J].中国科学技术大学学报,2006,36(5):476-480. 被引量：2
2REYNOLDS D A,QUATIERI T F,DUNN R. Speaker verifica.tion using adapted gaussian mixture model [J]. Digital signalprocessing,2000,10(1/2/3):19-41.
3CAMPBELL W M,STURIM D E,REYNOLDS D A. Supportvector machines using GMM supervectors for speaker verifica.tion [J]. IEEE signal processing letters,2006,13(5):308-311.
4KENNY P,OUELLET P,DEHAK N,et al. A study of inter.speaker variability in speaker verification [J]. IEEE transac.tions on audio, speech and language processing, 2008, 16(5):980-988.
5DEHAK N,KENNY P,OUELLET P,et al. Front.end factoranalysis for speaker verification [J]. IEEE Transactions on au.dio,speech and language processing,2011,19(4):788-798.
6GHAHRAMANI Z,HINTON G. The EM algorithm for mix.tures of factor analyzers:CRG.TR.96.1 [R]. Toronto:Depart.ment of Computer Science,University of Toronto,1966.
7GAUVAIN J L,LEE C H. Maximum a posterior estimationfor multivariate Gaussian mixture observations of Markovchains [J]. IEEE transactions on speech and audio processing,1994,2(2):291-298.
8GLEMBEK O,BURGET L,MAěJKA P,et al. Simplifica.tion and optimization of I.vector extraction [C].Proceedings ofIEEE International Conference on Acoustics,Speech and SignalProcessing. Prague:IEEE,2011:4516-4519.
9SEEGER Matthias.Low rank updates for the cholesky decompo.sition [EB/OL].[2010.12.04].http://upseeger.epfl.ch/papers/cholupdate.pdf.
10POVEY D,GHOSHAL A,BOULIANNE G,et al. The Kaldispeech recognition toolkit[EB/OL].[2013.02.03].http://blog.csdn.net/jiangyangbo/article/.

引证文献3

1琚炜,李锐,李辉.使用置信区间的基频特征对Ⅰ-Vector系统的性能补偿[J].小型微型计算机系统,2016,37(7):1629-1632.
2金超,龚铖,李辉.语音识别中神经网络声学模型的说话人自适应研究[J].计算机应用与软件,2018,35(2):200-205. 被引量：12
3马平,黄浩,程露红,杨萌萌.基于i-vector说话人识别算法中训练时长研究[J].现代电子技术,2016,39(14):1-3. 被引量：2

二级引证文献14

1刘琼.几种开源英语识别工具包的对比分析[J].计算技术与自动化,2018,37(4):123-127. 被引量：3
2冀瑞国.神经网络在语音识别中的应用[J].电子技术与软件工程,2019(3):249-249. 被引量：4
3贾艳洁,陈曦,于洁琼,王连明.基于特征语谱图和自适应聚类SOM的快速说话人识别[J].科学技术与工程,2019,19(15):211-218. 被引量：5
4茅正冲,王俊俊,黄舒伟.基于PLDA信道补偿的说话人识别算法[J].计算机与数字工程,2019,47(11):2757-2762. 被引量：2
5李侠,唐高峰.基于语音识别的英语声学检测系统研究[J].自动化技术与应用,2019,38(12):110-112. 被引量：2
6刘娟宏,胡彧,黄鹤宇.端到端的深度卷积神经网络语音识别[J].计算机应用与软件,2020,37(4):192-196. 被引量：30
7谢淑林.试析面向市场需求的平面设计PhotoShop创新之路[J].电脑编程技巧与维护,2020(5):152-153. 被引量：1
8刘虹,袁三男.基于多尺度残差深度卷积神经网络的语音识别[J].计算机应用与软件,2020,37(11):275-279. 被引量：10
9崔阳,刘长红.基于PIFA的语音识别系统评测平台[J].计算机科学,2020,47(S02):638-641. 被引量：5
10陈立,朱丙丽.基于多尺度与改进注意力机制的序列到序列模型[J].计算机应用与软件,2020,37(12):140-144.

1吴明辉,胡群威,李辉.一种基于深度神经网络的话者确认方法[J].计算机应用与软件,2016,33(6):159-162. 被引量：4
2琚炜,李锐,李辉.使用置信区间的基频特征对Ⅰ-Vector系统的性能补偿[J].小型微型计算机系统,2016,37(7):1629-1632.
3许敏强,戴蓓蒨,刘青松,许东星.基于多微商核函数的SVM话者确认[J].数据采集与处理,2011,26(5):508-514.
4戴蓓蒨,辛文,赵问道.与文本无关的话者识别[J].中国科学技术大学学报,1991,21(3):84-92.
5田垚,蔡猛,何亮,刘加.基于深度神经网络和Bottleneck特征的说话人识别系统[J].清华大学学报（自然科学版）,2016,56(11):1143-1148. 被引量：13
6微软公司推出“聪明电话”手机操作系统[J].中外科技信息,2002(11):72-73.
7启迪.微软与惠普联手推出“雅典”个人机[J].世界产品与技术,2003(9):24-24.
8黄光许,田垚,康健,刘加,夏善红.低资源条件下基于i-vector特征的LSTM递归神经网络语音识别系统[J].计算机应用研究,2017,34(2):392-396. 被引量：21
9王伟,韩纪庆,郑铁然,郑贵滨,陶耀.基于Fisher判别字典学习的说话人识别[J].电子与信息学报,2016,38(2):367-372. 被引量：6
10丁贵祥,王琪,翁默颖.基于数字信号处理方法的话者确认计算机识别系统[J].电子测量与仪器学报,1999,13(1):1-6.

小型微型计算机系统

2014年第3期

浏览历史

内容加载中请稍等...

利用i-vectors构建区分性话者模型的话者确认被引量：3

参考文献11

同被引文献14

引证文献3

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

利用i-vectors构建区分性话者模型的话者确认 被引量：3

参考文献11

同被引文献14

引证文献3

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

利用i-vectors构建区分性话者模型的话者确认被引量：3