期刊文献+

高斯序列核支持向量机用于说话人识别 被引量:5

Gaussian sequence kernel support vector machine for speaker recognition
下载PDF
导出
摘要 说话人识别问题具有重要的理论价值和深远的实用意义,在研究支持向量机核方法理论的基础上,将其与传统高斯混合模型(GMM)相结合构建成基于高斯序列核的支持向量机(SVM)。SVM的灵活性和强大分类能力主要在于可以根据要处理的问题来相应的选取核函数。在识别的过程中引入特征空间归正技术NAP(Nuisance Attribute Projection)对同一说话人在不同信道和环境所带来的特征差异进行弥补。用美国国家标准与技术研究所(NIST)2004年评测数据集进行实验,结果表明该方法可以大幅度提高识别率。 Speaker recognition problems have important theoretical value and farreaching practical significance.On the basis of the support vector machine kernel methods,this paper combines it with traditional Gaussian Mixture Mode(lGMM) to build into a new support vector machine based on Gaussian sequence kernel.Much of the flexibility and classification power of SVM resides in the choice of kernel.And in the process of identifying,it introduces feature space norm technology performed by Nuisance Attribute Projection(NAP) to compensate the feature difference in different channels and environment from the same speaker.It is tested on the National Institute of Standards and Technology(NIST) 2004 evaluation database.Experiments results show that this method can greatly improve the recognition rate.
作者 李杰 刘贺平
出处 《计算机工程与应用》 CSCD 北大核心 2010年第18期183-185,共3页 Computer Engineering and Applications
关键词 支持向量机 高斯线性核 高斯非线性核 NAP技术 说话人识别 support vector machine Gaussian linear kernel Gaussian non-linear kernel Nuisance Attribute Projection(NAP) speak-er recognition
  • 相关文献

参考文献6

  • 1Fine S,Navratil J,Gopinath R A.A hybrid GMM/SVM approach to speaker identification[C]//Proc ICASSP,2001:417-420.
  • 2Sturim D E,Reynolds D A,Singer E,et al.Speaker indexing in large audio databases using anchor models[C]//Proceedings of ICASSP, 2001 : 429-432.
  • 3Solomonoff A,Campbell W,Boardman I.Advances in channel compensation for SVM speaker recognition[C]//ICASSP, 2005,1:629-632.
  • 4Campbell W M.Generalized linear discriminant sequence kernels for speaker recognition[C]//Proceedings of ICASSP, 2002 : 161-164.
  • 5Moreno P,Ho P,Vasconcelos N.A generative model based kernel for SVM classification in multimedia applications[C]//NIPS,2003.
  • 6Kenny P,Dumouchel P.Experiments in speaker verification using factor analysis likelihood ratios[C]//Odyssey, 2004:219-226.

同被引文献47

  • 1鲍焕军,郑方.GMM-UBM和SVM说话人辨认系统及融合的分析[J].清华大学学报(自然科学版),2008,48(S1):693-698. 被引量:9
  • 2程俊,张璞,戴善荣,易克初.小波变换用于信号突变的检测[J].通信学报,1995,16(3):96-104. 被引量:36
  • 3奉国和,李拥军,朱思铭.边界邻近支持向量机[J].计算机应用研究,2006,23(4):11-12. 被引量:7
  • 4王波,徐毅琼,李弼程.基于SVM的多分类器融合算法在说话人识别中的应用[J].计算机工程与设计,2007,28(12):2909-2910. 被引量:5
  • 5邓菁.电话信道下多说话人识别研究[D].北京:清华大学,2007.
  • 6Wooters C, Ftmg J, Peskin B, et al.Towards robust speaker seg- mentation: The ICSI-SRI fall 2004 diarization system[C]//Proc of Fall 2004 Rich Transcription Workshop,New York,Palisades, 2004:315-320.
  • 7Anguera X, Wooters C, Peskin B, et al.Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system[J].Machine Learning for Multimodal Interaction,2006,3869:402-414.
  • 8Anguera X, Wooters C, Pardo J M.Robust speaker diarization for meetings:ICSI RT06s evaluation system[J].Lecture Notes in Computer Science,2006,4299 : 346-358.
  • 9Wooters C, Huijbregts M.The ICSI RT07s speaker diarization system[J].Multimodal Technologies for Perception of Humans, 2008,4625 : 509-519.
  • 10Carletta J,Ashby S,Bourban S,et al.The AMI meeting corpus: A preannouncement[C]//Proc of the Workshop on Machine Learning for Multimodal Interaction(MLMI), Edinburgh,2005 : 325-336.

引证文献5

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部