期刊文献+

A non-linear frequency transform and its application to speaker recognition 被引量:1

A non-linear frequency transform and its application to speaker recognition
原文传递
导出
摘要 Based on analyzing contribution of short-time spectrum in different frequency subbands to speaker recognition and using of polynomial curve matching techniques, a non-linear frequency transform and feature detection algorithm are proposed to highlight the speaker's individuality in short-time spectrum of speech. The experimental results show that the performance of speaker recognition system is improved effectively, the average error rate of recognition relatively falls about 70.5%, 60.8% and 70.5% in comparison with classical frequency transform of Mel, Bark and ERB (Equivalent Rectangular Bandwidth) respectively. Based on analyzing contribution of short-time spectrum in different frequency subbands to speaker recognition and using of polynomial curve matching techniques, a non-linear frequency transform and feature detection algorithm are proposed to highlight the speaker's individuality in short-time spectrum of speech. The experimental results show that the performance of speaker recognition system is improved effectively, the average error rate of recognition relatively falls about 70.5%, 60.8% and 70.5% in comparison with classical frequency transform of Mel, Bark and ERB (Equivalent Rectangular Bandwidth) respectively.
出处 《Chinese Journal of Acoustics》 2009年第3期280-288,共9页 声学学报(英文版)
  • 相关文献

参考文献6

二级参考文献50

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2Pandey P C, Bhandorkar S M. Enhancement of alaryngeal speech using spectral subtraction. Digital Signal Processing, 2002; 12(2): 591-594
  • 3Zhong Lin, Rafik Goubran. Musical noise reduction in speech using two-dimensional spectrogram enhancement.Proceedings of HAVE, 2003; 20(5): 61-64
  • 4Tadj C, Gabrea M. Towards robustness in speaker verification: Enhancement and adaptation. Midwest Symposium on Circuits and Systems, 2002; 3(3): 320-323
  • 5Soon I Y, Koh S N. Speech enhancement using 2-D Fourier transform. IEEE Transactions on Speech and Audio Processing, 2003; 11(6): 717-724
  • 6Douglas Reynolds A. Speaker identification and verification using Gaussian mixture speaker models. Speech Communication, 1995; 17(1): 91-108
  • 7Matsui T, Furui S. Concatenated phoneme models for text variable speaker recognition. ICASSP. 1993; 2(2): 391-394
  • 8Markov K, Nakagawa S.Text-independent speaker recognition system using frame level likelihood processing. Technical Report of IEICE, 1996; 96(17): 37-44
  • 9Ke Chen. Towards better making a decision in speaker verification. Pattern Recognition, 2003; 36(2) : 329-346
  • 10Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models.IEEE Trans. On Speech and Audio Processing, 1995; 3(1):72-83

共引文献63

同被引文献6

引证文献1

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部