期刊文献+

变帧长和变帧率在说话人确认中的应用

Application of variable frame length and frame rate in speaker verification system
下载PDF
导出
摘要 从变帧长、变帧率角度考虑提出一种新的提取MFCC的方法。该方法先将帧长和帧率都限制为基音周期的整数倍,即基音同步算法;然后基于变帧率算法的原理在语音特征变化缓慢的地方去除一些帧来降低帧率。在NIST 99说话人评测上进行的说话人确认实验表明,该方法不但提升了系统性能,而且降低了帧率,节省了特征文件的存储空间。 A new method for extracting Mel-Frequency Ceptral Coefficients (MFCC) was proposed from the perspective of variable frame length and frame rate. The proposed method restricted the frame length and frame shift to multiples of pitch period, called pitch synchronous algorithm; then removed some frames where the acoustic feature changed slowly to decrease the frame rate according to the principle of variable frame rate algorithm. With speaker verification experiments on NIST 99 speaker recognition evaluation, the new approach not only improves the system performance but also decreases frame rate, which means saving the storage space of feature files.
作者 王明 肖熙
出处 《计算机应用》 CSCD 北大核心 2007年第8期2051-2052,2076,共3页 journal of Computer Applications
关键词 说话人确认 基音同步 变帧率算法 speaker verification pitch synchronization variable frame rate algorithm
  • 相关文献

参考文献10

  • 1REYNOLDS D,QUATIERI T,DUNN R.Speaker verification using adapted mixture models[J].Digital Signal Processing,2002,10(1-3):181-202.
  • 2QUATIERI T,DUNN B,REYNOLDS D.On the influence of rate,pitch,and spectrum on automatic speaker recognition performance[C]// ICSLP.Beijing:[s.n.],2002:491-494.
  • 3ZILCA R,NAVRATIL J,RAMASWAMY G.Depitch and the role of fundamental frequency in speaker recognition[C]// ICASSP'03,Hong Kong.[S.l.]:IEEE Press,2003,2:81-84.
  • 4ZILCA R,NAVRATIL J,RAMASWAMY G.Syncpitch:A pseudo pitch synchronous algorithm for speaker recognition[C/OL]// EUROSPEECH,Geneva,Switzerland,2003[2007-01-15].http://www.research.ibm.com/CBG/papers/eurospeech03_syncpitch.pdf.
  • 5KIM S,ERIKSSON T,KANG H-G,et al.A pitch synchronous feature extraction method for speaker recognition[C]// ICASSP'04,Montreal,Canada.[S.l.]:IEEE Press,2004,1:405-408.
  • 6SECREST B,DODDINGTON G.An integrated pitch tracking algorithm for speech systems[C]// ICASSP'83,Boston,Massachusetts.[S.l.]:IEEE Press,1983,8:1352-1355.
  • 7ZHU Q F,ALWAN A.On the use of variable frame rate analysis in speech recognition[C]// ICASSP'00,Istanbul,Turkey.[S.l.]:IEEE Press,2000:1783-1786.
  • 8YOU H,ZHU Q F,ALWAN A.Entropy-based variable frame rate analysis of speech signals and its application to ASR[C]// ICASSP'04,Montreal,Canada.[S.l.]:IEEE Press,2004:549-552.
  • 9MARTIN A,PRZYBOCKI M.The NIST 1999 speaker recognition evaluation an overview[J].Digital Signal Processing,2000,10(1):1-18.
  • 10MARTIN A,DODDINGTON G,KAMM G,et al.The DET curve in assessment of detection task performance[C]// Proceedings of 5th European Conference on Speech Communication and Technology (Eurospeech'97).Rhodes,Greece:[s.n.],1997:1895-1898.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部