期刊文献+

一种基于加权Mel倒谱的语音信号共振峰提取算法 被引量:6

A formant extraction algorithm of speech signal based on weighted Mel-cepstrum
下载PDF
导出
摘要 提出了一种利用加权Mel倒谱提取语音信号共振峰的算法.首先对短时语音信号进行加权Mel倒谱分析,获得包含频谱主要成分的加权Mel倒谱系数;然后利用离散余弦平滑算法,从加权Mel倒谱系数获得谱包络,并从谱包络的峰值位置获得候选共振峰;最后根据共振峰的连续性约束条件和频率范围,从候选共振峰筛选得到共振峰的估计值.实验结果表明,本算法比倒谱法提取的共振峰误差更小,在噪声环境下具有较好的鲁棒性. This paper presents a method to realize formants extraction from speech signal.The weighted Mel-cepstrum coefficients(WMCC),which contain main components of spectrum,are obtained from speech signal by using weighted Mel-cepstrum analysis.The discrete cosine transform (DCT) based smooth algorithm is then applied to the WMCCs to obtain the smooth contour of spectrum in which the peaks of contour are candidate formants.The formant frequencies are selected from candidate formants according to the continuity constrain and the frequency range of formants.Tests show that the errors of this method outperform the cepstrum based method.The method is also robust on noisy speech signal.
出处 《西北师范大学学报(自然科学版)》 CAS 北大核心 2014年第1期53-57,共5页 Journal of Northwest Normal University(Natural Science)
基金 国家自然科学基金资助项目(61263036) 甘肃省杰出青年基金资助项目(1210RJDA007)
关键词 加权Mel倒谱 共振峰 DCT变换 鲁棒性 weighted Mel-cepstrum formant DCT robustness
  • 相关文献

参考文献9

  • 1LU G,ZHAO H. Developments of the research ofthe formant tracking algrithm [ J ]. Computer andInformation Science, 2010,3(1) : 68-71.
  • 2CODELLO I,KUNISZYK-JOZKOWIAK W.Formant paths tracking using linear prediction basedmethods [ J ]. Annales UMCS Informatica Al*2010, 10(2): 7-12.
  • 3赵毅,尹雪飞,陈克安.一种新的基于倒谱的共振峰频率检测算法[J].应用声学,2010,29(6):416-424. 被引量:9
  • 4KOISHIDA K, TOKUDA K, KOBAYASHI T, etal. CELP speech coding based on mel-generalizedcepstral analyses [ J ]. Electronics andCommunications in Japan , 2000, 83(5) : 32-41.
  • 5HONGWU Y,HUANG D,LIANHONG C A I.Perceptually weighted melcepstrum analysis of speechbased on psychoacoustic model [ J ]. IEICEtransactions on information and systems, 2006,89(12): 2998-3001.
  • 6黄德智,杨鸿武,蔡莲红.语音信号的加权mel倒谱分析[J].信号处理,2006,22(6):840-843. 被引量:4
  • 7赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报(自然科学版),2005,45(4):448-451. 被引量:5
  • 8陈宁,万茂文.语音信号共振峰频率估计的分段线性预测算法[J].计算机工程与应用,2009,45(28):156-159. 被引量:1
  • 9DUCKWORTH M, MCDOUGALL K,DE JONGG,et al. Improving the consistency of formant mea-surement [J]. International Journal of SpeechLanguage and the Lazv , 2011, 18(1) : 35-51.

二级参考文献31

  • 1黄海,陈祥献.基于Hilbert-Huang变换的语音信号共振峰频率估计[J].浙江大学学报(工学版),2006,40(11):1926-1930. 被引量:12
  • 2Heller P N,Karp T, Nguyen T Q.A general formulation of modulated filter banks [J].IEEE Transactions on Speech and Audio Processing, 1999,47 (4) : 986-1002.
  • 3Johnston J D.A filter family designed for use in quadrature mirror filter banks [C]//Proe Int Conf Aeeoust Speech, Signal Proeessing, 1980: 291-294.
  • 4Karp T,Fliege N J.Modified DFT filter banks with perfect reconstruction[J].IEEE T-CS:Analog and Digital Signal Processing, 1996,46( 11 ) : 1404-1414.
  • 5Tran T D,De Queririoz R L.Linear-phase perfect reconstruction filter bank : Lattice structure, design, and application in image coding[J].IEEE T-SP, 2000,48( 1 ) : 133-147.
  • 6何峰,陈晓清,李国锁,林嘉宇.一种新的语音信号共振峰提取的算法[J].信号处理,2007,23(4):618-621. 被引量:6
  • 7徐长发 李国宽.实用小波方法[M].武汉:华中科技大学出版社,2004..
  • 8LeBlanc W P, Bhattacharya B, Mahmoud S A, et al.Efficient search and design procedures for robust multi-stage VQ of LPC parameters for 4 kb/s speech coding [J]. IEEE Transactions on Speech Audio Processing, 1993, 1(4): 373-385.
  • 9Tsao C, Gray R M. Matrix quantizer design for LPC speech using the generalized Lloyd algorithm [J]. IEEE Transactions on Acoust, Speech, Signal Processing, 1985,33(3): 537-545.
  • 10Xydeas C S, Papanastasiou C. Efficient coding of LSP parameters using split matrix quantization [A]. Proceedings of IEEE Inter Conf Acoustics, Speech and Signal Processing,Proceedings of ICASSP-1995 [C]. Detroit, MI, USA: IEEE Press, 1995. 740- 743.

共引文献14

同被引文献42

引证文献6

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部