WCCN聚类序列核函数在话者识别中的应用被引量：1

A novel WCCN clustering kernel applied in speaker recognition

导出

摘要针对说话人确认系统中GMM超向量建模计算复杂度高以及易受信道干扰的问题,提出一种新型的基于Bhattacharyya距离聚类的WCCN序列核函数算法.首先计算话者GMM模型之间的Bhattacharyya距离,根据该Bhattacharyya距离对话者模型进行聚类,得到聚类中心模型;紧接着对聚类中心模型的均值向量进行MAP自适应,进而生成超向量序列核函数;最后采用WCCN平滑归一化技术对序列核函数进行信道补偿,抑制噪音和信道畸变对核函数的影响.将该Bhattacharyya聚类WCCN核函数应用到SVM说话人确认系统,仿真实验结果表明该核函数可以有效地提高系统的识别准确率和识别速度. A novel WCCN kernel based on Bhattacharyya distance clustering algorithm was proposed in this paper in order to reduce the computation complexity of GMM super-vector,meanwhile the channel interference was removed from speaker verification system.Firstly,the GMM models of speakers were clustered based on Bhattacharyya distance,and clustering center models were obtained.Then super-vector sequence kernel was generated by adapting only mean vectors of these clustering center models.Finally,WCCN was used to restrain the noise and channel distortion effection of this kernel.Our experiment results showed that our new kernel can improve the recognition accuracy and speed.

作者邢玉娟李恒杰胡建军王万军

机构地区甘肃联合大学电子信息工程学院

出处《云南大学学报（自然科学版）》 CAS CSCD 北大核心 2013年第2期167-172,共6页 Journal of Yunnan University(Natural Sciences Edition)

基金甘肃省教育厅基金项目(1113-01)

关键词语音识别 GMM超向量 BHATTACHARYYA距离类内协方差归一化支持向量机 speech recognition GMM super-vector Bhattacharyya distance within class covariance normalization support vector machine

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1侯风雷,王炳锡.基于说话人聚类和支持向量机的说话人确认研究[J].计算机应用,2002,22(10):33-35. 被引量：11
2LONGWORTH C, GALES M. Combining derivative and para- metric kernels for speaker verification [ J ]. IEEE Trans Audio Speech Language Process ,2007,6 ( 1 ) :1-10.
3CAMPBELL W, STURIM D, REYNOLDS D. Support vector machines using GMM supervectors for speaker verification [J]. IEEE Signal Process Itt,2006,13 (5):308-311.
4陆亮.多信道条件下的说话人认证[D].北京:北京邮电大学.2009.
5Vijendra Raj Apsingekar, PHILLIP L, LEON D E. Speaker model clustering for efficient speaker identification in largepopulation applications [ J]. IEEE Transactions on Audio, Speech, and Language Processing,2009,17 (4) :848-853.
6LEE Kong-aik, YOU Chang-huai, LI Hai-zhou, et al. Using discrete probabilities with bhattacharyya measure for SVM - based speaker verification [ J ]. IEEE Transactions on Audio, Speech, and Language Processing, 2011,19 (4) : 861-869.
7CHANG Huai-you, LEE Kong-aik , LI Hai-zhou. GMM - SVM Kernel with a Bhattacharyya - Based Distance for Speaker Rec- ognition [ J ]. IEEE Transactions on Audio, Speech, and Language Processing,2010,18 (6) : 1 300-1 312.
8ANDREW O Hatch, Andreas Stolcke. Generalized linear kernels for one - versus - all classification: application to speaker recognition[ C ]. IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, 2006, Volume 12:5 443-5 446.
9MATEJKA P, BURGET L, SCHWARZ P, et al. STBU system for the NIST 2006 speaker recognition evaluation [ C ]. IEEE In- ternational Conference on Acoustics, Speech and Signal Processing, Honolulu, USA ,2007:221-224.

二级参考文献1

1Vapnik V . N.The Nature of Statistical Learning Theory ( Second Edition)[]..1999

共引文献10

1王炜,王波,王炳锡.一个新的基于融合的说话人确认系统及DSP的实时实现[J].信号处理,2004,20(6):586-589.
2李琳,张晓龙.支持向量机学习方法的选择与应用[J].武汉科技大学学报,2006,29(1):75-78. 被引量：11
3王睿.关于支持向量机参数选择方法分析[J].重庆师范大学学报（自然科学版）,2007,24(2):36-38. 被引量：39
4刘雪燕,李明,张亚芬.基于PCA和多约简SVM的多级说话人辨识[J].计算机应用,2008,28(1):127-130. 被引量：4
5邢玉娟,李明.基于Fisher分值的特征提取在语音确认中的应用[J].科学技术与工程,2008,8(21):5854-5857.
6邢玉娟,谭萍,李明.一种新的说话人识别序列特征提取方法[J].兰州理工大学学报,2009,35(4):98-102. 被引量：4
7凌萍,王喆,周春光,黄岚.简约支持向量聚类[J].计算机研究与发展,2010,47(8):1372-1381.
8乔春霞.基于DSP的语音信号实时采集与处理的研究[J].科技信息,2010(20). 被引量：1
9李恒杰,邢玉娟.一种新的α-GMM聚类说话人确认算法[J].计算机应用与软件,2012,29(10):191-193. 被引量：1
10张健沛,徐华.支持向量机(SVM)主动学习方法研究与应用[J].计算机应用,2004,24(1):1-3. 被引量：51

同被引文献9

1Campbell W, Sturim D,Reynolds D. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process Lett, 2006; 13 (5):308-311.
2Longworth C, Gales M. Combining derivative and parametric kernels for speaker verification. IEEE Trans Audio, Speech Language Process, 2007; 6 (1) :1-10.
3Hu Hao, Xu Mingxing, Wu Wei. GMM super-vector based SVM with spectral features for speech emotion recognition. USA: IEEE Interna- tional Conference on Acoustics, Speech and Signal Processing, 2007 ; 4:IV-413-1V-416.
4Solomonoff A, Campbell W M, Boardman I. Advances in channel compensation for swn speaker recognition. International Conference on Acoustics, Speech, and Signal Processing. Pennsylvania, USA: IEEE, 2005 : 1-629-1-632.
5Dehak N. Front-end factor analysis for speaker verification. Audio, Speech and Language Processing, 2011;19(4) : 788-798.
6Gang L V, Zhao Heming. Joint factor analysis of channel mismateh in whispering speaker verification. Archives of Acoustics, 2012; 37 (4) :555-559.
7McLaren M, Van Leeuwen D. Improved speaker reeoguition when using i-vectors from multiple speech sources. IEEE International Con- ference on Acoustics, Speech and Signal Processing, Prague, Czech Republic: IEEE, 2011 : 5460-5463.
8栗志意,何亮,张卫强,刘加.基于鉴别性i-vector局部距离保持映射的说话人识别[J].清华大学学报（自然科学版）,2012,52(5):598-601. 被引量：11
9范冠杰,陈万培,陈才扣,王旻毅.一种融合WPCA与WLDA的人脸识别方法[J].无线电通信技术,2013,39(5):89-92. 被引量：3

引证文献1

1邢玉娟,潘颖,曹晓丽.改进i-向量说话人识别算法研究[J].科学技术与工程,2014,22(34):224-228. 被引量：2

二级引证文献2

1李湾湾,范承志,祁才君.基于改进MFD的I-Vector说话人识别[J].电声技术,2016,40(12):43-48. 被引量：1
2罗家诚.基于改进信道补偿的I-vector说话人识别[J].电子设计工程,2021,29(20):96-100. 被引量：1

1吴文昭.基于i向量的SVM说话人确认[J].兰州文理学院学报（自然科学版）,2016,30(3):53-55.
2邢玉娟,李明.NAP序列核函数在话者识别中的应用[J].计算机工程,2010,36(8):194-196. 被引量：2
3舒毅,邢玉娟.基于i-向量和PCA字典学习稀疏表示的说话人确认[J].计算机工程与应用,2016,52(18):144-147. 被引量：1
4赵桂儒,刘典婷,崔满丰,李丽.基于GMM超向量的行为识别研究[J].现代计算机（中旬刊）,2014(3):20-22.
5周燕,刘韬.基于小波神经网络的话者识别系统研究[J].烟台职业学院学报,2008,14(2):57-61.
6杨成福,章毅.相关向量机及在说话人识别应用中的研究[J].电子科技大学学报,2010,39(2):311-315. 被引量：13
7王聪,周激流,李晓华,郎方年,付翔飞.基于最大类可分离性新颜色空间的肤色检测[J].计算机应用,2008,28(12):3095-3097. 被引量：1
8袁晓琴,黄凤岗,张健沛.基于DCT的人脸特征提取[J].应用科技,2003,30(4):32-33. 被引量：2
9赵红.一种支持向量回归机的音频水印算法[J].漳州师范学院学报（自然科学版）,2012,25(2):34-39. 被引量：1
10赵桂儒,李卫东,刘典婷,吴敏,崔满丰.EM算法的改进及其在行为识别中的应用[J].电视技术,2014,38(13):196-199. 被引量：3

云南大学学报（自然科学版）

2013年第2期

浏览历史

内容加载中请稍等...

WCCN聚类序列核函数在话者识别中的应用被引量：1

参考文献9

二级参考文献1

共引文献10

同被引文献9

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

WCCN聚类序列核函数在话者识别中的应用 被引量：1

参考文献9

二级参考文献1

共引文献10

同被引文献9

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

WCCN聚类序列核函数在话者识别中的应用被引量：1