期刊文献+

基于CFCC-PCA的说话人辨识方法

Speaker Identification Based on CFCC-PCA
下载PDF
导出
摘要 针对说话人训练和识别时间长、噪音环境下识别率低的问题,提出一种CFCC-PCA特征参数的说话人辨识方法。首先提取具有听觉特性的CFCC特征参数,然后对其进行PCA变换,找出具有分辨能力的参数,最后再用这些参数在云服务器中训练和识别说话人。实验表明:该方法可以提高说话人辨识的鲁棒性和识别率,云服务可提高系统实时性。 Training speaker system and speaker identification need a long time, and in the noise environment, the recognition rate is very low, A CFCC-PCA characteristic parameter method is proposed.Firstly, the acoustic characteristics of CFCC characteristic parameters are extracted.Then, CFCC-PCA parameters are extracted by PCA transformation of CFCC characteristic parameters.Finally the speaker models are trained and recognized in cloud.Experiments show that the CFCC-PCA characteristic parameters can improve the robustness and recognition rate of the speaker, the cloud services with efficient processing ability to improve system real-time performance.
出处 《成都工业学院学报》 2015年第2期32-34,共3页 Journal of Chengdu Technological University
基金 中山市科技发展专项基金项目"基于云计算的生物身份认证技术研究及应用"(2013A3FC0350) 中山市科技发展专项基金项目"基于中山地貌的最优化无线网络模型研究"(2013A3FC0318)
关键词 CFCC-PCA 说话人辨识 支持向量机 云服务器 CFCC-PCA speaker identification Support Vector Machine( SVM) cloud server
  • 相关文献

参考文献10

  • 1JAIN A K,HONG L,KULKARNI Y A. Muhimodal biometric sys-tem using fingerprints, face and speech [ C ]//2nd Int'l Confcreneeon Audio-and Video-based Biometric Person Authentication, Washington I). C', !999 -182 -187:.
  • 2曹洁,余丽珍.改进的说话人聚类初始化和GMM的多说话人识别[J].计算机应用研究,2012,29(2):590-593. 被引量:6
  • 3GARAU G, DIELMANN A, BOURLARD H. Audio-visual synchroni- sation for speaker diarisation [ C ]// Proc of International Conference on Speech and language Processing. Makuhari, Chiba: [ s n. ] ,2010: 2654 - 2657.
  • 4LI Q, HUANG Y. An Auditory-based robust speaker identification under feature extraction algorithm for mismatched conditions [ J ]. Audio, Speech, and Language Processing, IEEE Transactions on, 2010,19(6) : 1791 -1801.
  • 5TSAIW H, CHHEN S S, WANG H M. Automatic speaker clutering using a voice characteristic reference space and maximum purity estination[ J ]. IEEE Transactions on Audio Speech and Languager Processing,2013,15 (4) : 1461 - 1471.
  • 6LIUM H,XIEY L,YAO Z Q,et al. A new hybrid GMM /SVM for speaker verification [ C ]// The 18th International Conference on Pattern Recognition, Hang Kong: IEEE Press,2006:314 - 317.
  • 7ZHANG W F,YANG Y C,WU Z H,Exploition PCA classifiers to speaker recognition [C ]//Proceddings of the International Joint Conference on the Neural Networks Portland IEEE Press,2003 (1):820- 823.
  • 8BURGES C L C. A tutorial on support vector machines for pattern recognition [ J ]. Data Mining and Knowledge Discovery, 1998,2 ( 2 ) : 121 - 167.
  • 9GAO Y,JIN L W,HE C ,et al. Handwriting character recognition as a service: a new handwriting recognition system based on cloud Computing[ C ]//Document Analysis and Recognition ( ICDAR ), 2011 International Conference on ,2011:885 - 889.
  • 10罗希,刘锦高.基于NIOS的ANN语音识别系统[J].计算机系统应用,2009,18(12):144-146. 被引量:3

二级参考文献13

  • 1邓菁.电话信道下多说话人识别研究[D].北京:清华大学,2007.
  • 2Lee L, Rose RC. Speaker normalization using efficient frequency warping procedures. IEEE Int. Conf. on Acoustics, Speech and Signal Processing. Atlanta. 1996.353 - 356.
  • 3Rabiner L, Juang BH. Fundamentals of Speech Recognition. Prientice Hall PTR, 1993.11 - 54.
  • 4WOOTERS C, HUIJBREGTS M. The ICSI RT07s speaker diarization system[ J]. Multimodal Technologies for Perception of Humans, 2008,4625:509-519.
  • 5GARAU G,BOURLARD H. Using audio and visual cues for speaker diarisation initialization [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. [ S. 1. ] :IEEE Signal Pro- cessin~ Society,2010:4942-4945.
  • 6HUNG H,HUANG Yan, FRIEDLAND G, et al. Estimating the dom- inant person in multi-party conversations using speaker diarization strategies [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. [ S. 1. ] : IEEE Press,2008:2197-2200.
  • 7FRIEDLAND G, HUNG H, YEO C. Multi-modal speaker diarization of real-world meetings using compressed-domain video features[ C ]/! Proc of International Conference on Audio, Speech and Signal Proces- sing. [ S. 1. ] :IEEE Press,2009:4069-4072.
  • 8HUNG H, FRIEDLAND G. Towards audio-visual on-line diarization of participants in group meetings[ C ]//Proc of Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications. Mar- seille : European Conference on Computer Vision,2008 : 1-12.
  • 9HUNG H, HUANG Yan, FRIEDLAND G, et al. Estimating domi- nance in multi-party meetings using speaker diarization [ J ]. IEEE Yrans on Audio, Speech and Language Processing, 2010, 19 (4) :84?-860.
  • 10NOULAS A, ENGLEBIENNE G, KROSE B. Multi-modal speaker di- arisation[ J]. IEEE Trans on Pattern Analysis and Machine In- telligence,2011,34( 1 ) :79-93.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部