采用M-矢量和支持向量机的说话人确认系统被引量：2

Speaker verification system based on M-vector and support vector machine

导出

摘要将UBM子空间中的说话人MLLR自适应得到的M-矢量应用于SVM中,提出了一种新的说话人确认系统.该系统有效地将扰动属性映射算法整合到SVM核函数中,实现在核空间中直接对M-矢量进行信道补偿,从而提高系统对信道干扰的鲁棒性能.实验结果表明:相比传统基于音素类的MLLR-SVM和基于I-矢量的I-vector-SVM基线系统,在不需要大量有文本内容标注的语音数据、复杂度和运算量都很高的自动语音识别系统、因子空间统计量的估计的情况下,本系统可获得与最好的基线系统几乎相当的性能,同时还表现出很强的互补特性.在NIST SRE2008说话人评测数据库上测试结果表明:提出系统的性能与基于I-矢量的说话人确认系统的性能接近,并表现出很强的互补性,融合后的等错误率相对下降了13.3%. A new speaker verification system based on the M-vectors and support vector machine (SVM ) was proposed in this paper .The M-vectors were derived from multiple maximum likelihood linear regression (MLLR) speaker transformations which were calculated for a given speech data with respect to each subspace of the universal background model (UBM ) .Furthermore ,a nuisance attrib-ute projection was introduced into the SVM kernel space to project the M-vectors into a speaker-de-pendent space ,to alleviate the channel and session variability during training and testing .Compared with the traditional phone-class based MLLR-SVM and I-vector -SVM systems ,experimental results show that the proposed system can achieve almost the same good performance as the best baseline sys-tem without large factor statistical calculations and any automatic speech recognition system w hich needs large labeled training data ,complexity and computations .In the NIST SRE2008 evaluation task ,the proposed system can achieve almost the same performances as the state-of-the-art I-vector based system .Large complementary information has been demonstrated in a relative 13.3% EER re-duction after system fusion .

作者龙艳花戴礼荣

机构地区上海师范大学电气信息系中国科学技术大学电子工程与信息科学系

出处《华中科技大学学报（自然科学版）》 EI CAS CSCD 北大核心 2014年第8期63-68,共6页 Journal of Huazhong University of Science and Technology(Natural Science Edition)

基金国家高技术研究发展计划专项基金资助项目(2012CB326405) 国家自然科学基金资助项目(61273264) 上海市青年科技英才扬帆计划资助项目(14YF1409300)

关键词语音识别说话人确认最大似然线性回归扰动属性映射支持向量机 M-矢量 speech recognition speaker verification maximum likelihood linear regression nuisance attribute projection support vector machines M-vector

分类号 TP18 [自动化与计算机技术—控制理论与控制工程] TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1Tomi Kinnunen,Haizhou Li.An overview of text-independent speaker recognition: From features to supervectors[J].Speech Communication.2009(1)

同被引文献14

1郭武,戴礼荣,王仁华.采用UBM更新量作为支持向量机特征的说话人确认[J].清华大学学报（自然科学版）,2008,48(S1):704-707. 被引量：4
2Campbell W,Sturim D,Reynolds D.Support vector machinesusing GMM supervectors for speaker verification[J].SignalProcess Letters,2006,13(5):308-311.
3Sarkar S,Rao K S.Speaker verification in noisy environmentusing GMM supervectors[C].2013 National Conferenceon Communications,2013:1-5.
4Dehak N,Kenny P J,Dehak R,et al.Front-end factor analysisfor speaker verification[J].Audio,Speech,and LanguageProcessing,2011,19(4):788-798.
5Kanagasundaram A,Deana D,Sridharan S,et al.I-vectorbased speaker recognition using advanced channel compensationtechniques[J].Computer Speech and Language,2014,28(1):121-140.
6Naseem I,Togneri R,Bennamoun M.Sparse representationfor speaker identification[C].20th International Conferenceon Pattern Recognition(ICPR),2010:4460-4463.
7Mohammadi M R,Fatemizadeh E,Mahoor M H.PCA-baseddictionary building for accurate facial expression recognitionvia sparse representation[J].Journal of Visual Communicationand Image,2014,25(5):1082-1092.
8Haris B C,Sinha R.Speaker verification using sparse representationover KSVD learned dictionary[C].2012 NationalConference on Communications,2012:1-5.
9Matejka P,Glembek O,Castaldo F,et al.Full-covarianceUBM and heavy-tailed PLDA in i-vector speaker verification[C].IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP),2011:4828-4831.
10Scholkopf B.Platt J.Hofmann T.Sparse representation forsignal classification[C].Proceedings of the 2006 Conferenceon Advances in Neural Information ProcessingSystems.2007:609-616.

引证文献2

1屈召贵,鲁顺昌.说话人识别的不确定性i-vector分析[J].计算机工程与设计,2017,38(6):1647-1650. 被引量：5
2舒毅,邢玉娟.基于i-向量和PCA字典学习稀疏表示的说话人确认[J].计算机工程与应用,2016,52(18):144-147. 被引量：1

二级引证文献6

1屈召贵.基于窗函数法的FIR数字滤波器设计[J].信息技术与网络安全,2019,38(9):85-89. 被引量：7
2王铮,傅山.基于改进身份向量提取的短语音说话人确认[J].小型微型计算机系统,2019,40(11):2264-2268. 被引量：3
3茅正冲,王俊俊,黄舒伟.基于PLDA信道补偿的说话人识别算法[J].计算机与数字工程,2019,47(11):2757-2762. 被引量：2
4董元菲,王康.基于频域卷积和三元组损失的端到端声纹识别[J].电子设计工程,2020,28(13):154-159. 被引量：2
5范玉红,魏向鑫.视频场景中的暴力行为识别上的关键技术研究[J].电脑知识与技术,2021,17(25):116-117. 被引量：1
6罗家诚.基于改进信道补偿的I-vector说话人识别[J].电子设计工程,2021,29(20):96-100. 被引量：1

1钟山,何亮,邓妍,刘加.基于最大似然线性回归矩阵的说话人识别算法研究[J].自动化学报,2009,35(5):546-550.
2郭武,戴礼荣,王仁华.采用因子分析和支持向量机的说话人确认系统[J].电子与信息学报,2009,31(2):302-305. 被引量：5
3丰洪才,卢正鼎.基于MAP和MLLR的综合渐进自适应方法研究[J].计算机工程,2005,31(5):4-7. 被引量：3
4周宇,陈熙霖,赵德斌,姚鸿勋,高文.基于数据生成的手语识别自适应方法[J].高技术通讯,2009,19(12):1258-1264.
5张浩.概念,法则和模糊推理：因子空间的研究[J].杭州电子工业学院译丛,1992(2):39-51.
6潘复平,赵庆卫,颜永红.使用无监督网络MLLR自适应改进算法的语音识别[J].数据采集与处理,2007,22(1):8-13.
7余姗姗,张亚琼.语音识别的自适应研究[J].福建电脑,2011,27(6):53-54.
8谭小彬,奚宏生,王卫平,殷保群.基于支持向量机的异常检测[J].中国科学技术大学学报,2003,33(5):599-605. 被引量：5
9舒毅,邢玉娟.基于i-向量和PCA字典学习稀疏表示的说话人确认[J].计算机工程与应用,2016,52(18):144-147. 被引量：1
10王翔.核独立分量分析在盲源信号分离中的应用研究[J].南京工程学院学报（自然科学版）,2011,9(2):6-10. 被引量：1

华中科技大学学报（自然科学版）

2014年第8期

浏览历史

内容加载中请稍等...

采用M-矢量和支持向量机的说话人确认系统被引量：2

参考文献1

同被引文献14

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

采用M-矢量和支持向量机的说话人确认系统 被引量：2

参考文献1

同被引文献14

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

采用M-矢量和支持向量机的说话人确认系统被引量：2