期刊文献+

基于小波变换的2.4kbit/s波形内插语音编码算法 被引量:3

Waveform interpolation speech coding algorithm at 2.4kbit/s based on wavelet transform
下载PDF
导出
摘要 基于双正交小波滤波器组对波形内插编码中提取的特征波进行多级分解与重构,提出了一种基于小波变换(WT)的2.4kbit/s特征波形内插(CWI)语音编码算法。编码端去除了特征波对齐运算,并对幅度谱进行多级分解,相位谱不传输,鉴于小波变换对信号的压缩特性,仅传输对人耳感知起主要贡献的最后一级特征波幅度谱;解码端对各尺度空间采用单独重建的方法,相位信息在重构的末级与幅度谱结合,并由浊音度标志选择固定或随机相位。此外,根据语音信号的时变特性,由基于子帧的浊音度标志选择需要传输的幅度谱及量化模式。主观R-A/B测试表明,这种基于小波变换的2.4kbit/s编码算法的合成语音质量明显优于标准的2.4kbit/s的MELP编码器及FS1016的4.8kbit/sCELP编码器,亦优于3.8kbit/s的传统CWI编码框架下的合成语音效果。 A waveform interpolation speech coding algorithm at 2.4kbit/s based on wavelet transform was proposed, in which characteristic waveform decomposition and reconstruction is processed using biorthogonal wavelet filters. At the encoder, the alignment operation was removed and the amplitude spectrum is decomposed without phase spectrum. Because of the wavelet compression, only the amplitude spectrum of last layer is transmitted. At the decoder, different scale space is reconstructed separately and random or fixed phase was combined to the spectrum based on voice degree at the last. Additionally, based on the time-variable feature of speech, voiced degree flag of each subframe was introduced to indicate the spectrum selection and quantization. Subjective R-A/B listening tests indicated that the synthesis speech quality of the 2.4kbit/s WT-CWI coding algorithm is better than that of standard 2.4kbit/s MELP coder and FS1016 4.8kbit/s CELP coder and is better than traditional 3.8kbit/s CWI coder.
出处 《通信学报》 EI CSCD 北大核心 2007年第5期43-48,共6页 Journal on Communications
关键词 语音编码 小波变换 波形内插 特征波形分解 speech coding wavelet transform waveform interpolation characteristic waveform decomposition
  • 相关文献

参考文献7

  • 1KLEIJIN W B,HAAGEN J.Waveform Interpolation for Coding and Synthesis.Speech Coding and Synthesis[M].Elsevier Science,1995.175-207.
  • 2EDDIE L,CHOY T.Waveform Interpolation Speech Coder at 4kbit/s[D].McGill University,Montreal,Canada,1998.
  • 3CHONG N R,BURNET I S,CHICHARO J F,et al.Use of the pitch synchronous wavelet transform as a new decomposition method for WI[A].IEEE ICASSP'98[C].Seattle,1998.512-516.
  • 4CHONG N R,BURNETT I S,CHICHARO J E Low-delay multi-level decomposition and quantization techniques for WI coding[A].IEEE ICASSP'99[C].Phoenix,1999.241-244.
  • 5CHONG N R,BUMETT I S,CHICHARO J E A new waveform interpolation coding scheme based on pitch synchronous wavelet transform decomposition[J].IEEE Trans Speech and Audio Proc,2000,8(3):345-348.
  • 6BASU S,CHIANG C H,CHOI H M.Wavelets and perfect reconstruction subband coding with causal stable ⅡR filters[J].IEEE Trans Circuits and Systems Ⅱ,1995,42(1):24-38.
  • 7NAYEBI K,BARNWELL T P,SMITH M J T.Low delay fIR filter banks:design and evaluation[J].IEEE Trans Sig Proc,1994,42(1):24-31.

同被引文献33

  • 1李靓,鲍长春,白燕宁.一种高效、低存储的线谱频率参数矢量量化器[J].北京工业大学学报,2005,31(2):130-135. 被引量:5
  • 2王贵平,鲍长春,张鹏.基于奇异值分解的低速率波形内插语音编码算法[J].电子学报,2006,34(1):135-140. 被引量:13
  • 3齐峰岩,鲍长春.波形内插语音编码中特征波形表达和对齐快速算法[J].北京工业大学学报,2006,32(6):514-519. 被引量:3
  • 4Kleijn W B.A speech coder based on decomposition of characteristic waveforms[C].IEEE ICASSP'95,Detroit,1995:508-511.
  • 5鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007,第9章.
  • 6Kleijn W B,Shoham Y,and Sen D,et al..A low-complexity waveform interpolation coder[C].IEEE ICASSP'96,Atlanta,1996:212-215.
  • 7Supplee L M,Cohn R P,and Collura J S,et al..MELP:thenew Federal Standard at 2400 bps[C].IEEE ICASSP'97,Munich,1997:1591-1594.
  • 8McCree A V and Barnwell T P.A mixed excitation LPC vocoder model for low bit rate speech coding[J].IEEE Transactions on Speech and Audio Processing,1995,3(4):242-250.
  • 9Gottesman O and Gersho A.Enhanced waveform interpolative coding at 4 kbps[C].IEEE Workshop on Speech Coding Proceedings,Haikko Manor Porvoo,1999:90-92.
  • 10Federal Information Processing Standards Publication.Specifications for the analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction[S],1998.

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部