期刊文献+

高质量的0.6kb/s声码器算法 被引量:23

High quality 0.6 kb/s speech coding algorithm
原文传递
导出
摘要 为满足语音信息存贮和交流对极低速率下语音压缩编码的需求 ,提出了一种 0 .6 kb/ s声码器算法。此算法基于线性预测正弦激励模型 ,在极低码率下获得高质量的合成语音 ,提出清浊音定位和量化方法 ,应用了多帧参数联合矢量量化技术 ,以及多带正弦混合激励、谱增强等技术。主观听觉测试显示 ,在 0 .6 kb/ s的速率下 ,此声码器合成语音不仅具有高可懂度而且具有一定的自然度 ,诊断押韵测试 (DRT)的分数为 89.5 % ,而且在 10 - 2的随机误码的信道条件下仍然具有很好的可懂度。实验表明 A 0.6 kb/s high quality vocoder was developed to encode phonetic information at very low bit rates. The algorithm is based on a sinusoidally excited linear prediction model and uses multi frame joint vector quantification, multi band mixing excitation, sub band voicing strength parameter prediction, and adaptive spectral enhancement to obtain high quality synthetic speech with a low bit rate. Simulation results show that the synthesized speech is intelligible with reasonable naturalness. The diagnostic rhyme test score was 89.5% in the formal test. The vocoder is robust in a noisy environment and is still intelligible with a bit error rate of 10 -2 . The results suggest that the use of relative frame parameters and vector qualitification can greatly reduce the bit rate while maintaining clarity.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2003年第4期449-452,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目 ( 69972 0 2 0 )
关键词 声码器 线性预测 矢量量化 混合正弦激励 谱增强 语音压缩编码 语音合成 vocoder linear prediction vector quantification mixed sinusoidal excitation
  • 相关文献

参考文献6

  • 1Kleijin W. A frame interpretation of sinusoidal coding and waveform interpolation [A]. IEEE Inter Conf Acoustics,Speech and Signal Processing ICASSP-2000 [C]. Istambul,Turkey: IEEE Press, 2000. 1475 - 1478.
  • 2Jamrozik M, Gowdy J. Modified multiband excitation model at 2 400 bps [A]. IEEE ICASSP 1997 [C]. Munich,Germany: IEEE Press, 1997. 1603 - 1606.
  • 3McCree A V, Barnwell Ⅲ T P. A mixed excitation LPC vocoder model for low bit rate speech coding [J]. IEEE Trans on Speech and Audio Processing, 1995,3(4):242 - 250.
  • 4LeBlanc W P, Bhattacharya B, Mahmoud S A, et al.Efficient search and design procedures for robust multi-stage VQ of LPC parameters for 4 kb/s speech coding [J]. IEEE Trans on Speech Audio Processing, 1993, 1(4): 373- 385.
  • 5WANG Tian, Koishida K, Cuperman V, et al. A 1 200 bps speech coder based on MELP [A]. IEEE Inter Conf Acoustics, Speech and Signal Processing ICASSP-2000 [C].Istambul, Turkey, IEEE Press, 2000. 1375 - 1378.
  • 6Farvardin N. A study of vector quantization for noise channels [J]. IEEE Trans Inform Theory, 1993, 39(3):799 - 809.

同被引文献120

引证文献23

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部