一种用于鲁棒性说话人确认的分段概率分布参数规整方法

Robust speaker verification based on the method of piecewise normalization of the cumulative distribution function of parameters

下载PDF

导出

摘要目前与文本无关的话者确认系统大都是基于GMM-UBM模型结构的,为了精确的描述说话人语音特征空间的分布,模型混合度M通常都选的很大,因而模型训练需要大量的语音数据。本文提出了一种基于分段估计概率分布函数的规整方法,在概率分布的意义上降低特征参数偏离高斯分布的程度,从而可以用较低混合度的高斯混合模型对其建模。同时,这种映射也是一种无监督规整,因此可以提高系统的鲁棒性及其确认性能。在NIST'03数据库上的实验表明,在使用相同混合度模型的情况下,概率分布规整后的参数相对于变换前的参数系统性能可以提高11%左右。 Current text-independent speaker verification systems are mostly based on GMM-UBM （Gaussian Mixture Model - Universal Background Model） structures. In order to model the distribution of speech signal exactly, the number of mixtures usually becomes very large. So that the speech needed to train the models will increase greatly too. The technique of parameters normalization based on piecewise estimating the cumulative distribution function is performed in this paper. In this way the non-Gaussianity in the means of the cumulative distribution of Mel-cepstral parameters is decreased. Thus GMMs with fewer mixtures could model them precisely. The projection is also unsupervised normalization technique, so as to improve the robustness and performance of the system. Experiments on the database of NIST＇03 show that the verification performance of normalized parameters could relatively improve about 11% in contrast to original parameters when modeled with the same mixtures.

作者解焱陆刘青松戴蓓蒨李辉

机构地区中国科学技术大学多媒体计算与通信教育部-微软重点实验室

出处《电路与系统学报》 CSCD 北大核心 2008年第6期91-95,90,共6页 Journal of Circuits and Systems

基金国家自然科学基金资助项目(60272039) 教育部-微软重点实验室开放基金资助项目(05071810)

关键词概率分布规整高斯分布 MFCC GMM-UBM 说话人确认 distribution normalization the gaussian distribution MFCC GMM-UBM speaker verification

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献7

1Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models [J]. IEEE Transactions on Speech and Audio Processing, 1995, 3(1): 72-83.
2Reynolds D A. Channel robust speaker verification via feature mapping [A]. Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on [C]. 2003-04, 2: 53-6.
3Chang-wen Hsu, Lin-shan Lee. Higher order cepstral moment normalization (HOCMN) for robust speech recognition [A]. ICASSP'04 [C]. 2004-05, 1: 197-200.
4Ramesh Gopinath. Gaussianization IMA Workshop: Mathematical Foundations of Speech Processing and Recognition[OL]. http://www.ima.umn.edu/talks/workshops/9-18-22.2000/gopinath/talk.pdf 2000-09.
5Bing Xiang, Chaudhari U V, Navratil J, Ramaswamy G N, Gopinath R A. Short-time Gaussianization for robust speaker verification [A]. Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on [C]. 2002-05, 1:681-684.
6D A Reynolds, T Quatieri R Dunn. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000-10, 10: 19-41.
7NIST. The NIST year 2004 speaker recognition evaluation plan [OL]. http://www.nist.gov/speech/tests/spk/2004/SRE-04_evalplan-vla.pdf.

1张保轩,王连军,田岚.基于PC机的汉语话者确认系统[J].山东电子,1995(3):16-17.
2汪扬.模拟电路元件故障诊断的研究[J].广播电视信息,1998(6):13-15.
3汪扬.模拟电路故障的求值诊断方法[J].世界科技研究与发展,1998,20(2):121-123.
4李勃,杨腾祥,胡建华,赵琳.智能卡话者确认系统的研究[J].昆明理工大学学报（理工版）,1999,24(2):12-17.
5刘维亭,朱志宇.基于小波网络和HMM的语音识别方法[J].电声技术,2004,28(11):56-59. 被引量：2
6陈继旭,刘明辉,戴蓓蒨,李辉.文本无关说话人确认中的一种新的评分规整方法[J].信号处理,2006,22(4):545-549. 被引量：1
7张申如.单模光纤剖面参数偏离对色散系数影响的简化计算[J].通信学报,1989,10(3):71-73.
8张硕,钟子发,史英春,崔再华.DS-CDMA系统最优匹配MMSE时延估计[J].微计算机信息,2008,24(9):201-203.
9李鹏,屈丹.语音查询项检索中的两阶段得分规整方法[J].模式识别与人工智能,2016,29(3):216-222.
10王欣媛,程磊,魏巍.区间分段优化最大似然估计算法[J].制导与引信,2008,29(1):55-60.

电路与系统学报

2008年第6期

浏览历史

内容加载中请稍等...

一种用于鲁棒性说话人确认的分段概率分布参数规整方法

参考文献7

相关作者

相关机构

相关主题

浏览历史