采用高斯概率分布和支持向量机的说话人确认被引量：2

Speaker Verification Based on Gaussian Probability Distribution and SVM

导出

摘要在采用支持向量机的说话人确认中,将语音特征参数相对于通用背景模型各高斯分量的概率分布作为支持向量机输入,在线性核函数的情况下,系统能取得与广义线性判别式序列核函数(GLDS)几乎相同的识别率,同时该高斯概率分布算法能够与混合高斯背景模型、广义线性判别式序列核函数的得分进行融合,进一步提高识别性能.在2006年 NIST SRE 1conv4w-1conv4w 数据库上,融合后的系统相对于基线的混合高斯模型最多有25%的等错误率下降. In the text-independent speaker verification research, the probability distribution against the universal background model （PD-UBM） is calculated. And the score of each UBM Gaussian mixture is adopted as the input feature of the support vector machine （SVM） during the training and testing process. The proposed PD-UBM algorithm with linear kernel function can obtain the same or better performance as the generalized linear discriminant sequence （GLDS） kernel system. Furthermore, if the scores of the Gaussian mixture models （GMM-UBM） , the GLDS and the PD-UBM are combined, the significant improvement of the system can be achieved. In 2006, on NIST 1conv4w-1conv4w speaker recognition evaluation （SRE） corpus, the fusion system obtained 25% relative improvement equal error rate （ERR） of over the GMM-UBM system.

作者郭武戴礼荣王仁华

机构地区中国科学技术大学电子工程与信息科学系科大讯飞语音实验室

出处《模式识别与人工智能》 EI CSCD 北大核心 2008年第6期794-798,共5页 Pattern Recognition and Artificial Intelligence

基金国家863计划资助项目(No.2006AA010104)

关键词广义线性判别式序列(GLDS) 梅尔刻度式倒谱参数(MFCC) 线性预测倒谱参数(LPCC) Generalized Linear Discriminant Sequence （GLDS）, Mel Frequency Cepstrum Coefficient （ MFCC）, Linear Prediction Cepstrum Coefficient （LPCC）

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Campbell W M, Campbell J P, Reynolds D A. Support Vector Machines for Speaker and Language Recognition. Computer Speech and Language, 2006, 20(2/3): 210-229
2Campbell W M, Sturim D E, Reynolds D A. Support Vector Machines Using GMM Supervectors for Speaker Verification. IEEE Signal Processing Letters, 2006, 13(5) : 308 -311
3Campbell W M, Sturim D E, Reynolds D A, et al. SVM Based Speaker Verification Using a GMM Supervector Kernel and Nap Variability Compensation//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, USA, 2006,Ⅰ: 97 -100
4Reynolds D A, Quatieri T F, Dunn R B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 2000, 10(1/2/3) : 19 -41
5Nello C, Jhon S T. Support Vector Machines. Cambridge, UK: Cambridge University Press, 2000
6Lamel L F, Rabiner L R, Rosenberg A, et al. An Improved Endpoint Detector for Isolated Word Recognition. IEEE Trans on Acoustics, Speech and Signal Processing, 1981, 29(4) : 777 -785
7Xiang Bing, Chaudhari U V, Navratil J, et al. Short-Time Gaussianization for Robust Speaker Verification/! Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando, USA, 2002, Ⅰ: 681 -684
8Collobert R. SVMTorch: Support Vector Machines for Large-Scale Regression Problems. Journal of Machine Learning Research, 2001, 1:143-160
9Matejka P, Burget L, Schwarz P, et al. STBU System for the NIST 2006 Speaker Recognition Evaluation// Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007, Ⅳ: 221 -224
10Niko B, Johan D P. Application-Independent Evaluation of Speaker Detection. Computer Speech and Language, 2006, 20 ( 2/3 ) : 230 - 275

同被引文献6

1鲍焕军,郑方.GMM-UBM和SVM说话人辨认系统及融合的分析[J].清华大学学报（自然科学版）,2008,48(S1):693-698. 被引量：9
2叶寒生,陶进绪,张东文,余斌.噪声环境下基于特征信息融合的说话人识别[J].计算机仿真,2009,26(3):325-328. 被引量：2
3郑建炜,王万良,郑泽萍.GMM与RVM融合的话者辨识方法[J].计算机工程,2010,36(15):168-170. 被引量：2
4郭春霞.说话人识别算法的研究[J].西安邮电学院学报,2010,15(5):104-106. 被引量：5
5黄肖忠,李辉,许东星,郭伟.基于韵律特征的SVM说话人确认[J].计算机工程与应用,2011,47(15):148-151. 被引量：2
6DU Jun,ZOU Xin,HAO Jie,LIU Ju.The Efficiency of ICA-based Representation Analysis： Application to Speech Feature Extraction[J].Chinese Journal of Electronics,2011,20(2):287-292. 被引量：2

引证文献2

1杨迪,戚银城,刘明军,张华芳子,武军娜.说话人识别综述[J].电子科技,2012,25(6):162-165. 被引量：5
2卓著,李辉.PCA变换下的GMM-SVM话者确认研究[J].小型微型计算机系统,2015,36(3):637-640. 被引量：1

二级引证文献6

1成培.移动式智能化广播影视视听节目监管平台解决方案[J].科技创新与应用,2013,3(17):23-23. 被引量：2
2宋乐,白静.说话人识别中改进特征提取算法的研究[J].计算机工程与设计,2014,35(5):1772-1775. 被引量：3
3鲁晓倩,关胜晓.基于VQ和GMM的实时声纹识别研究[J].计算机系统应用,2014,23(9):6-12. 被引量：3
4王煜.说话人识别研究现状[J].数字技术与应用,2017,35(6):59-61. 被引量：2
5胡志隆,文畅,谢凯,贺建飚.联合HMM-UBM与RVM的声纹密码识别算法[J].计算机工程,2018,44(11):129-134. 被引量：5
6刘培培,杨祥来.基于图像信息的话者识别[J].中国科技论文,2018,13(20):2388-2393. 被引量：2

1姚红,梁栋,郭武.基于模型距离和支持向量机的说话人确认[J].计算机仿真,2009,26(3):343-346. 被引量：2
2鲁建华,王博,安玮,程洪玮.基于联合高斯概率分布的传感器截获概率计算分析[J].电子对抗,2008(4):22-25. 被引量：1
3宋园方.说话人识别技术的研究[J].网络财富,2010(20):185-185.
4贾克明,陶洪久.基于DSP的嵌入式语音识别系统的研究与实现[J].武汉理工大学学报（信息与管理工程版）,2006,28(7):156-159. 被引量：4
5杨于村,蒋燕.基于广义线性区分核支持向量机的说话人确认[J].电声技术,2009,33(8):64-67.
6Ni-Ni Rao Yu-Chuan Huang Bin Liu.A Class of Chaotic Sequences with Gauss Probability Distribution for Radar Mask Jamming[J].Journal of Electronic Science and Technology of China,2007,5(2):180-182.
7张婷,王彬,刘世刚.基于Hammerstein模型的非线性信道广义线性盲均衡算法[J].电子学报,2015,43(9):1723-1731. 被引量：2
8何建超,章坚武,吴震东.一种基于筛选高斯分量的说话人确认方法[J].杭州电子科技大学学报（自然科学版）,2015,35(6):50-54.
9徐潇潇,谢林柏,彭力.一种改进的基于贝叶斯的位置指纹算法[J].江南大学学报（自然科学版）,2015,14(5):527-531. 被引量：3
10邵朝,赵妮,石超雄.一类最小方差无失真响应波束的形成方法[J].西安邮电大学学报,2014,19(3):22-25. 被引量：1

模式识别与人工智能

2008年第6期

浏览历史

内容加载中请稍等...

采用高斯概率分布和支持向量机的说话人确认被引量：2

参考文献12

同被引文献6

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

采用高斯概率分布和支持向量机的说话人确认 被引量：2

参考文献12

同被引文献6

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

采用高斯概率分布和支持向量机的说话人确认被引量：2