期刊文献+

能量参数解码端HMM估计算法

HMM estimation of energy contours in speech decoders
原文传递
导出
摘要 在低速率语音编码算法中,如何对特征参数进行有效的量化表示是影响声码器合成语音质量的关键因素。该文提出一种能量参数解码端恢复算法,它利用线谱频率(linespectral frequency,LSF)和清浊音判决参数(unvoiced/voiced decision,U/V)估计能量参数的变化轨迹。该算法利用特征参数之间的相关性,采用隐Markov模型(hiddenMarkov model,HMM)描述LSF、U/V和能量参数之间的统计特性,通过对能量进行解码端恢复,省去量化所需的比特数,从而提高特征参数的整体量化性能。测试结果表明:能量参数解码端恢复算法能够将150b/s混合激励线性预测编码算法(mixed excitation linear prediction,MELP)的合成语音平均意见得分(mean opinion score,MOS)提高0.042。该算法应用于超低速率声码器参数量化是可行的。 Low bit rate speech coding must effectively quantize the parameters.This article presents an energy contour estimation algorithm to predict changes of speech energy from the line spectral frequency(LSF) and the unvoiced/voiced(U/V) decision parameters.The statistical properties of the energy,the LSF and the U/V decision parameters are characterized based on the hidden Markov model(HMM) which uses the correlations between different parameters.The algorithm properly estimates the energy contour,which contributes to quantization of the decoder parameters.Tests show that the energy contour estimation algorithm improves the mean opinion score(MOS) of the synthesized speech for the mixed excitation linear prediction(MELP) vocoder at a 150 b/s bit rate by 0.042,which shows that this algorithm improves parameter quantization in ultra low bit rate vocoders.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第6期869-872,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家"八六三"高技术项目(2011AA010601)
关键词 语音信号处理 能量参数 隐MARKOV模型 线谱频率参数 参数编码 speech signal processing energy parameter hidden Markov model(HMM) line spectral frequency(LSF) parametric coding
  • 相关文献

参考文献12

  • 1Kondoz A M. Digital Speech: Coding for Low Bit Rate Communication Systems [M]. Chichester, UK: John Wiley Sons, 2004.
  • 2Engan K, Aase S O, Husoy J H. Multi-frame compression: Theory and design [J]. EURASIP Signal Process, 2000, 80(10) : 2121 - 2140.
  • 3Eriksson T, Linden J, Skoglund J. Interframe LSF quantization for noisy channels [J]. IEEE Trans Speech and Audio Processing, 1999, 7(5) : 495 - 509.
  • 4Hagen R, Hedelin P. Low bit-rate spectral coding in CELP, a new LSP method [C]// Proc Int Conf Acoustic, Speech, Signal Processing (ICASSP). Albuquerque, NW, USA, 1990: 189-192.
  • 5Soong F, Juang B. Optimal quantization of LSP parameters using delayed decisions [C]// Proc lnt Conf Acoustic, Speech, Signal Processing (ICASSP). Albuquerque, NM, USA, 1990: 185-188.
  • 6ZHAO Ming, TANG Kun, CUI Huijuan. Mode-based quantization of LP parameters for very low bit rate vocoder [C]// International Conf on Communications, Circuits and Systems and West Sino Expositions. Chengdu, China, 2002: 28 - 31.
  • 7Tokuda K, Masuko T, Hiroi J, et al. A very low bit rate speech coder using HMM-based speech recognition synthesis techniques [C]// Proc Int Conf Acoustic, Speech, Signal Processing (ICASSP). Seattle, WA, USA, 1998: 609-612.
  • 8Hoshiya T, Sako S, Zen H, et al. Improving the performance of HMM-based very low bit rate speech coding [C]// Proc Int Conf Acoustic, Speech, Signal Processing (ICASSP). Hong Kong, China, 2003: 800- 803.
  • 9魏旋.参数相关超低速率语音编码算法[D].北京:清华大学,2010.
  • 10Rabiner L R, Juang B H. Fundamentals of Speech Recognition [M]. Upper Saddler River, NJ, USA: Prentice Hall, 1993.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部