期刊文献+

基于隐马尔可夫模型的能量参数预测量化算法 被引量:2

HMM-Based Prediction and Quantization Algorithms for Energy Parameters
下载PDF
导出
摘要 为了充分利用能量与线性预测编码(Linear prediction coding,LPC)系数之间的相关性,提高能量参数量化效率,提出了一种基于隐马尔可夫模型(Hidden Markov model,HMM)的能量参数预测量化算法。通过适当假设,使用HMM模拟能量参数和LPC系数之间的相关性,其中离散化后的能量参数组成隐状态序列,量化后的LPC系数组成可观测序列。然后利用HMM预测每一超帧中的能量参数的变化轨迹,并根据预测出的能量轨迹对预测残差进行分模式矢量量化(Mode-based vector quantization,MBQ)。仿真实验中能量参数量化后的平均失真为2.668 dB,与线性预测量化算法相比下降了14.0%,表明本文算法通过利用能量参数与LPC系数的相关性,能够有效地提高能量参数量化效率。 To use the correlation between energy parameters and linear prediction coding(LPC) coefficients,and to quantize the energy parameters more efficiently,hidden Markov model(HMM) based prediction and quantization algorithms are proposed.HMM is used to model the correlation between the energy and the LPC coefficients under appropriate assumptions.In HMM,the discretized energy parameters constitute hidden state sequences and the quantized LPC coefficients constitute observation sequences.HMM is used to predict the energy contour of each super frame,and then mode-based vector quantization(MBQ) is applied to quantize the energy prediction errors according to the predicted energy contour.Experimental result shows that the average quantization distortion is 2.668 dB,which is reduced by 14.0% comparing with linear prediction and quantization algorithms.It implies that the proposed algorithms can improve the energy quantization efficiency by using the correlation between energy parameters and LPC coefficients.
出处 《数据采集与处理》 CSCD 北大核心 2011年第2期123-127,共5页 Journal of Data Acquisition and Processing
基金 国家自然科学基金(60572081)资助项目
关键词 语音编码 低速率 隐马尔可夫模型 分模式量化 speech coding low bit-rate hidden Markov model mode-based quantization
  • 相关文献

参考文献10

  • 1Ovens M J, Ponting K M, Turner M E. Ultra low bit rate voice coding[C]//IEE Seminar on Speech Coding for Algorithms for Radio Channels. London: IEE Press, 2000:1-15.
  • 2闵刚,张雄伟,杨吉斌,安云峰.一种采用混合激励的超低速率分段声码器[J].数据采集与处理,2009,24(5):680-685. 被引量:3
  • 3邹霞,何俊,张雄伟.基于线谱对高效矢量量化的0.6kb/s语音编码算法[J].解放军理工大学学报(自然科学版),2008,9(2):114-118. 被引量:2
  • 4Samuelsson J, Hedelin P. Recursive coding of spectrum parameters[J]. IEEE Trans on Speech and Au- dio Processing, 2001,9 (5) : 492-503.
  • 5邹霞,张雄伟.线谱对参数预测多级矢量量化联合优化算法[J].数据采集与处理,2008,23(2):186-190. 被引量:3
  • 6Eriksson T, Linden J, Skoglund J. Interframe LSF quantization for noisy channels[J]. IEEE Trans on Speech and Audio Processing, 1999,7 (5) : 495-509.
  • 7Zhao M, Tang K, Cui H. Mode-based quantization of LP parameters for very low bit rate vocoder[C]// International Conference on Communications, Circuits and Systems and West Sino Expositions. Chengdu : IEEE Press, 2002 : 28-31.
  • 8Wei X, Dang X, Cui H, et al. Voiced/unvoiced classification recovery in the speech decoder based on GMM[C]//ICSP. Beijing: IEEE Press, 2008,546- 548.
  • 9Rabiner L, Juang B H. Fundamentals of speech recognition[M]. New Jersey: Prentice-Hall, 1993: 321-386.
  • 10LeBlanc W P, Bhattacharya B, Mahmoud S A, et al. Effieient search and design procedures for robust multistage VQ of LPC parameters for 4 kb/s speech coding[J]. IEEE Trans on Speech and Audio Pro- cessing, 1993,1(4):373-385.

二级参考文献45

  • 1邹霞,陈亮,张雄伟.高质量鲁棒600BPS甚低速率语音编码算法[J].信号处理,2003,19(z1):109-112. 被引量:4
  • 2丛键,张知易.一种600bps极低速率语音编码算法[J].电子与信息学报,2007,29(2):429-433. 被引量:7
  • 3Chamberlain M W. A 600bps MELP vocoder for use on HF channels [C]//IEEE MILCOM. Mclean, VA, USA: IEEE Press, 2001:447-450.
  • 4Wang T, Koishida K,Cuperman V, et al. A 1200/ 2400bps coding suite based on MELP[C]//IEEE Workshop on Speech Coding. Tsukuba,Japan:IEEE Press,2002:90-92.
  • 5Roucos S,Wilgus A,Russell W. A segment vocoder algorithm for real-time implementation[C]//IEEEICASSP. Dallas, TX, USA: IEEE Press, 1987: 1949-1952.
  • 6Roucos S, Schwartz R, Makhoul J. A segment vocoder at 150B/S [C]//IEEE ICASSP. Boston, Mass, USA: IEEE Press, 1983:61-64.
  • 7Shiraki Y, Honda M. LPC speech coding based on variable-length segment quantization [ J]. IEEE Trans on ASSP, 1988,36 (9) :1437-1444.
  • 8闵刚 蒋永生 杨吉斌 等.分段声码器中的语音分段算法研究.信号处理,2007,23(4):119-122.
  • 9Chetan J V. Very low bit rate speech coding using segmentation [D]. Bombay:Indian Institute of Technology, 2005 : 22-23.
  • 10Paliwal K K,Atal B S. Efficient vector quantization of LPC parameters at 24bits/frame[J]. IEEE Trans on Speech and Audio Processing, 1993,1 (1) : 3-14.

共引文献5

同被引文献17

  • 1赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报(自然科学版),2005,45(4):448-451. 被引量:5
  • 2李哗.低速率语音编码技术与算法研究[D].北京:清华大学,2009.
  • 3Tsao C, Gray R M. Matrix quantizer design for LPC speech using the generalized Lloyd algorithm [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985, 33(3): 537-545.
  • 4Zhao M, Tang K, Cui H. Mode-based quantization of LP parameters for very low bit rate vocoder [C]// International conference on Communications, Circuits and Systems and West Sino Expositions. Chengdu, China: IEEE Press, 2002 : 28 - 31.
  • 5Eriksson T, Linden J, Skoglund J. Intcrframe LSF quantization for noisy channels [J]. IEEE Transactions on Speech and Audio Processing, 1999, 7(5) : 495 - 509.
  • 6JIANG Hao, CUI Huijuan, TANG Kun. Sinusoidal excitation LPC vocoder [J]. Chinese Journal of Electronics, 1998, 7(3), 296-300.
  • 7Theodoridis S, Koutroumbas K. Pattern Recognition [M]. 3rd ED. Beiiing: China Machine Press, 2006.
  • 8何洪华.超低速率语音编码算法研究[D].北京:清华大学,2011.
  • 9刘张宇,鲍长春,邱建伟,徐昊.3GPP AMR-NB与ITU-T G.729A语音编码标准技术的对比研究[J].电声技术,2009,33(4):56-61. 被引量:2
  • 10李晔,彭坦,许明,计哲,崔慧娟,唐昆.带有帧间级间预测的线谱频率参数多级矢量量化[J].清华大学学报(自然科学版),2009(7):981-983. 被引量:9

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部