期刊文献+

基于Gauss混合模型的清浊音恢复改进算法 被引量:1

Improved recovery algorithm for unvoiced/voiced parameters based on GMM
原文传递
导出
摘要 为提高子带清浊音(unvoiced/voiced,U/V)解码端恢复算法在不同能量电平下的鲁棒性,提出了一种改进型能量自适应U/V参数解码端恢复算法。通过跟踪长时能量的变化轨迹,在Gauss混合模型(Gaussian mixed model,GMM)下,用归一化的能量参数和线谱频率参数(line spec-tral frequency,LSF)对U/V参数的分布特性进行估计。测试结果表明:在较低的能量电平下,与用绝对能量对U/V参数进行恢复的算法相比,该能量自适应U/V参数恢复算法能够将清浊音误判率降低10%~25%,并将合成语音的平均意见得分(mean opinion score,MOS)提高0.03~0.09,改善了算法的性能。 The robustness of an unvoiced/voiced (U/V) speech classification recovery algorithm is improved by an energy self-adaption algorithm for the recovery of the U/V parameter. The algorithm traces the long-time changes of the energy level to estimate the statistical distribution of the U/V parameter from the normalized energy and the line spectral frequency (LSF) parameters based on the Gaussian mixed model (GMM). Tests show that for relatively low energy levels, this energy self-adaption algorithm reduces the U/V classification error rate by 10% - 25% and improves the mean opinion score (MOS) of the synthesized speech signal by about 0.03 - 0.09 compared to the original method which uses the absolute energy value.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2011年第11期1751-1755,共5页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60572081)
关键词 语音编码 Gauss混合模型 特征参数 线谱频率 清浊音参数 speech coding Gaussian mixed model (GMM) characteristic parameter line spectral frequency (LSF)unvoiced/voiced (U/V) parameter
  • 相关文献

参考文献11

  • 1Kondoz A M. Digital Speech: Coding for Low Bit Rate Communication Systems [M]. Chichester, UK: John Wiley & Sons, 2004.
  • 2Paliwal K K, Kleijn W B. Quantization of LPC parameters[C]//Speech Coding and Synthesis. Amsterdam, the Netherlands: Elsevier Science, 1995: 433- 466.
  • 3Paliwal K K, Atal B S. Efficient vector quantization of LPC parameters at 24 bits/frame [J]. IEEE Trans Speech Audio Processing, 1993, 1(1): 3-14.
  • 4魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报(自然科学版),2010,50(1):79-82. 被引量:4
  • 5洪侃,李晔,崔慧娟,唐昆.基于子带清浊音模式的声码器增益参数抗误码算法[J].清华大学学报(自然科学版),2008,48(10):1621-1624. 被引量:2
  • 6Ovens M J, Ponting K M, Turner M E, et al. Ultra low bit rate voice coding [C]// Speech Coding for Algorithms for Radio Channels. London, UK: IEE, 2000: 97- 111.
  • 7李晔.低速率语音编码技术与算法研究[D].北京:清华大学,2009.
  • 8Theodoridis S, Koutroumbas K. Pattern Recognition [M]. 4th Ed. London, UK: Academic Press, 2008.
  • 9李军林.低速率语音编码算法研究[D].北京:清华大学,2004.
  • 10Plante F, Meyer G F. A pitch extraction reference database [C]// European Conf on Speech Communication and Technology. Madrid, Spain, 1995:837-840.

二级参考文献13

  • 1Farvardin N. A study of vector quantization for noisy channels [J]. IEEE Trans Inform Theory, 1993, 39(3)I 799 - 809,
  • 2Farvardin N. On the performance and complexity of channel-optimized vector quantizers[J].IEEE Trans Inform Theory, 1991, 37(1) : 155 - 160.
  • 3De Marca J R B, Jayant N S. An algorithm for assigning binary indices to the code vectors of multi-dimensional quantizer[C]//IEEE Int Comm Conf Seattle. WA: IEEE, 1987: 1128- 1132.
  • 4Ovens M J, Ponting K M, Turner, M E. Ultra low bit rate voice coding [C] // Speech Coding for Algorithms :for Radio Channels, IEE Seminar, London, UK, 2000: 97- 111.
  • 5Wei X, Dang X, Cui H, et al. Voiced/unvoiced classification recovery in the speech decoder based on GMM [C]//ICSP, IEEE, 2008: 546-548.
  • 6McCree V, Barnwell T. A mixed excitation LPC vocoder model for low bit rate speech coding [J]. IEEE Trans on Speech Audio Processing, 1995, 3(4) : 242 - 250.
  • 7Deng H, O'Shaughnessy D. Voiced-unvoiced-silence speech sound classification based on unsupervised learning [C] // International Conf on Multimedia Expo. Beijing: IEEE, 2007: 176-179.
  • 8Theodoridis S, Koutroumbas K. Pattern Recognition (Third Edition) [M]. Beijing: China Machine Press, 2006.
  • 9Plante F, Meyer G F. A pitch extraction reference database [C] // European Conf on Speech Communication and Technology. Madrid, 1995 : 837 - 840.
  • 10李晔,洪侃,王童,崔慧娟,唐昆.声码器基音周期参数抗差错算法[J].清华大学学报(自然科学版),2008,48(1):82-84. 被引量:2

共引文献10

同被引文献12

  • 1赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报(自然科学版),2005,45(4):448-451. 被引量:5
  • 2李哗.低速率语音编码技术与算法研究[D].北京:清华大学,2009.
  • 3Tsao C, Gray R M. Matrix quantizer design for LPC speech using the generalized Lloyd algorithm [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985, 33(3): 537-545.
  • 4Zhao M, Tang K, Cui H. Mode-based quantization of LP parameters for very low bit rate vocoder [C]// International conference on Communications, Circuits and Systems and West Sino Expositions. Chengdu, China: IEEE Press, 2002 : 28 - 31.
  • 5Eriksson T, Linden J, Skoglund J. Intcrframe LSF quantization for noisy channels [J]. IEEE Transactions on Speech and Audio Processing, 1999, 7(5) : 495 - 509.
  • 6JIANG Hao, CUI Huijuan, TANG Kun. Sinusoidal excitation LPC vocoder [J]. Chinese Journal of Electronics, 1998, 7(3), 296-300.
  • 7Theodoridis S, Koutroumbas K. Pattern Recognition [M]. 3rd ED. Beiiing: China Machine Press, 2006.
  • 8何洪华.超低速率语音编码算法研究[D].北京:清华大学,2011.
  • 9李晔,彭坦,许明,计哲,崔慧娟,唐昆.带有帧间级间预测的线谱频率参数多级矢量量化[J].清华大学学报(自然科学版),2009(7):981-983. 被引量:9
  • 10魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报(自然科学版),2010,50(1):79-82. 被引量:4

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部