期刊文献+

基于一种新特征参数的说话人识别 被引量:4

Speaker recognition based on a new feature parameter
下载PDF
导出
摘要 针对传统的特征参数Mel频域倒谱系数MFCC难以满足语音信号的非平稳性问题,提出一种基于小波分析的新特征参数FPBW的提取方法.为了提高训练速度,采用正交高斯混和模型,将正交变换改到最大期望EM算法之前进行,从而减少训练时间.实验结果表明,新的特征参数FPBW优于特征参数MFCC,并且采用正交高斯混合模型进一步提高了识别性能和训练速度. Aimed at the problem that the traditional feature parameters MFCC (reel-frequency ceptrum coefficients) was hard to satisfy the non-stationary characteristic of speech signal, a method was proposed for extraction of a new feature parameter FPBW based on wavelet analysis. In order to improve training speed, an orthogonal Guass mixture model (OGMM) was employed in order that the orthogonal transform was to be performed before the use of expectation maximization algorithm, so that the training time was reduced. The experiment results showed that a new feature vector FPBW was better than MFCC, and the OGMM could further improve the recognition performance and training speed.
出处 《兰州理工大学学报》 CAS 北大核心 2008年第1期68-71,共4页 Journal of Lanzhou University of Technology
基金 甘肃省信息化专项基金
关键词 说话人识别 MFCC FPBW正交高斯混合模型 speaker recognition MFCC FPBW orthogonal Gauss mixture model
  • 相关文献

参考文献10

二级参考文献28

  • 1楼红伟,胡光锐.基于Teager能量算子和小波变换的语音识别特征参数[J].上海交通大学学报,2003,37(z1):83-85. 被引量:2
  • 2李苇营,易克初,胡征.神经网络与HMM构成的混合网络在语音识别中应用的研究[J].电子学报,1994,22(10):73-80. 被引量:8
  • 3林遂芳,潘永湘,孙旭霞.基于HMM和小波网络模型的抗噪语音识别方法[J].系统仿真学报,2005,17(7):1720-1723. 被引量:13
  • 4边肇祺.模式识别[M].清华大学出版社,1999..
  • 5Gowdy J N, Tufekci Z. Mel-Scaled discrete wavelet coefficients for speech recognition [EB/OL]. http:∥ieeexplore.ieee.org/ie15/6939/18687/00861829.pdf, 2000-06-01/2004-02-06.
  • 6Torres H M, Rufiner H L. Automatic speaker identification by means of Mel cepstrum, wavelets and wavelet packets [EB/OL]. http:∥ieeexplore.ieee.org/ie15/7218/19434/00897886.pdf, 2000-07-01/2004-02-08.
  • 7Farooq O, Datta S. Mel filter-Like admissible wavelet packet structure for speech recognition [J]. IEEE Signal Processing Letters, 2001, 8(7): 196-198.
  • 8Reynodls D, Rose R. Robust text-independent speaker identification using Gaussian mixture speaker models [J]. IEEE Trans on Speech and Audio processing, 1995, 3(1): 72-83.
  • 9HaykinS.神经网络的综合基础[M].北京:清华大学出版社,培生教育集团,2001..
  • 10刘贵忠,邸双亮.小波分析及应用[M].第一版,西安:西安电子科技大学出版社,1992

共引文献33

同被引文献35

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部