期刊文献+

基于FMFCC和HMM的说话人识别 被引量:8

Speaker Recognition Based on FMFCC and HMM
下载PDF
导出
摘要 美尔频率倒谱系数(MFCC)是说话人识别中常用的特征参数,而语音信号是非平稳信号,MFCC并不能很好的反映语音的时频特性。针对这一缺陷,为了提高说话人的识别率,结合新的时频分析工具分数傅立叶变换(FRFT)。将MFCC推广到分数形式,得到分数美尔频率倒谱系数(FMFCC),用以表征语音信号的特征;并利用可分性测度验证了特征参数的有效性;通过建立20个不同说话人的FMFCC特征库,采用隐马尔可夫模型(HMM)对说话人进行仿真识别。仿真结果表明,在合适的变换阶次下,说话人的平均识别率可达93%以上。 Mel frequency cepstral coefficient (MFCC) is a frequently - used characteristic in speaker recogniton. In evidence, speech are non - stationary signals, the time - frequency characteristic of speech is not clearly expressed through MFCC. Thus, in the calculation of MFCC parameter, fractional Fourier transform (FRFT) is adopted to replace discrete Fourier transform. Then fractional Mel frequency cepstral coefficient (FMFCC) is acquired, and the effectivity of the parameter is verified. Finally, the Hidden Markov Model (HMM) of 20 different speakers is estab- lished, and speaker identification is performed. The simulation shows that in different transform orders, the average of right speaker recognition rate is up to 93%.
出处 《计算机仿真》 CSCD 北大核心 2010年第5期352-354,358,共4页 Computer Simulation
基金 南昌航空大学校基金(EC200604057)
关键词 分数傅立叶变换 频率倒谱系数 隐马尔可夫模型 Fractional Fourier transform Frequency cepstral coefficient Hidden Markov model
  • 相关文献

参考文献9

  • 1赵力.语音信号处理[M].北京:机械工业出版社,2004,236-253.
  • 2J W Picone. Signal Modeling Techniques in Speech Recognition [C]. Proc. IEEE, 1993, 81(9) :1215 -1247.
  • 3张永亮,曾以成.一种分数余弦变换及应用[J].通信学报,2005,26(9):111-115. 被引量:1
  • 4R G Dorsch, A W Lohmann, Bitran D Mendlovic. Chirp Filtering in the Fractional Fourier Domain [ J ]. Appl Opt. 1994,33 : 7599 - 7602.
  • 5H M Ozaktas, D Mendlovic. Fractional Fourier Optics[J]. J. Opt. Soc. Am. A. 1995, 12:743-751.
  • 6D Mendlovic, H M Ozaktas. Fractional Fourier Transformations and Their Optical Implementation: I[J]. J. Opt. Soc. Amer. A. 1993,10:1875 - 1881.
  • 7V Namias. The Fractional Order Fourier Transform and Its Application to Quantum Mechanics [ J ]. J. Inst. Math, 1980,25 : 241 - 265.
  • 8P Somervuo, A Harma, S Fagerlund. Parametric Representations of Bird Sounds for Automatic Species Recognition [ J ]. IEEE Transactions on audio, speech, and language processing, 2006,14 (6) :2252 - 2263.
  • 9Lu Yu, Lenan Wu. Comments on A Separable Low Complexity 2D HMM with Application to Face Recognition[J]. Pattern Analysis and Machine Intelligence. IEEE Transactions on audio, speech, and language processing. 2007, 29(2) :368 -368.

二级参考文献9

  • 1OZAKTAS H M, MENDLOVIC D. Fractional Fourier optics[J].Optical Society of American, 1995, 12(4): 743-751.
  • 2SHIH C C. Fractionalization of Fourier transforms[J]. Optics Communications, 1995, 118(5): 495-498.
  • 3LIU S T, ZHANG J D, ZHANG Y. Properties of fractionalization of a Fourier transform[J]. Optics Communications, 1997, 113(1): 50-54.
  • 4NAMIAS V. The fractional order Fourier transform and its application to quantum mechanics[J]. J Inst Maths Applics, 1980, 25: 241-265.
  • 5DORSCH R G, LOHMANN A W, BITRAN Y. Chirp filtering in the fractional Fourier domain[J]. Applied Optics, 1994, 33(32): 7599-7602.
  • 6LOHMANN A W, MENDLOVIC D, ZALEVSKY Z. Some important fractional transforms for signal processing[J]. Optics Communications,1996, 125(4): 18-20.
  • 7PEI S C, DING J J. Fractional cosine sine and Hartley transforms[J].IEEE Transactions on Signal Processing, 2002, 50(7): 1661-1680.
  • 8PEI S C, YEH M H. The discrete fractional cosine and sine transforms[J]. IEEE Transactions on Signal Processing, 2001, 49(6):1198-1207.
  • 9王秋生,孙圣和.一种在数字音频信号中嵌入水印的新算法[J].声学学报,2001,26(5):464-467. 被引量:58

共引文献2

同被引文献121

引证文献8

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部