期刊文献+

基于提升小波分解的低速率波形内插语音编码算法

Waveform Interpolation Speech Coding Algorithm at Low Bit Rates on the Basis of Lifting Wavelet Decomposition
下载PDF
导出
摘要 提出了一种基于双正交提升小波变换(bi-orthogonal lifting wavelet transform,BLWT)的低速率特征波形内插语音编码方法,其中的特征波形分解算法不需要复杂的特征波形对齐操作和滤波器的卷积运算,其固有的原位运算降低了传统特征波形小波分解算法所需的内存,当前帧边界点替代相邻帧样点的措施有效减少了传统特征波形小波分解算法的时延.同时,该分解方法对分解后的各成分单独重建,并根据人耳的感知特性选择量化参数.基于该分解,分别构建了1.84 kb/s和2.32 kb/s两种速率的BLWT-CWI(characteristic waveform interpo-lation)语音编码器.主观平均意见得分(mean opinin score,MOS)结果表明,2.32 kb/s的BLWT-CWI语音编码质量与2.4 kb/s的MELP声码器相当,1.84 kb/s的BLWT-CWI语音编码质量稍逊于2.4 kb/s的MELP声码器.主观A/B听力测试结果表明,1.84 kb/s的BLWT-CWI语音编码质量优于2 kb/s的LIWI(low-complex improvedwaveform interpolation)声码器. A characteristic waveform interpolation (CWI) speech coding algorithm at low bit rates based on biorthogonal lifting wavelet (BLWT) is proposed in this paper. The complicated characteristic waveform (CW) alignment operation and convolution operation of filter are cancelled by using BLWT-based CW decomposition. The memory of traditional CW wavelet decomposition algorithm is reduced with its inherent situ calculation. The algorithm delay of the traditional CW decomposition based on wavelet transform is decreased by replacing the samples of adjacent frames with the boundary samples of the current frame. Each decomposed component of CW is independently reconstructed and the quantization parameters of CW are flexibly selected according to the human ear' s perception. Two types of CWI speech codecs at 1.84 kb/s and 2.32 kb/s are designed based on BLWT. MOS test results show that 2.32 kb/s BLWT-CWI and 2.4 kb/s MELP have similor quality and the performance of 1.84 kb/s BLWT-CWI is slightly less than 2.4 kb/s MELP. Subjective A/B listening tests also show that the quality of 1.84 kb/s BLWT-CWI is better than that of 2 kb/s LIWI ( low complex improved waveform interpolation) codec.
出处 《北京工业大学学报》 EI CAS CSCD 北大核心 2011年第12期1779-1785,共7页 Journal of Beijing University of Technology
基金 国家自然科学基金资助项目(60372063) 北京市自然科学基金资助项目(4042009) 北京市教育委员会科技发展资助项目(KM200710005001) 北京市自然科学基金资助项目(KZ201110005005) 北京工业大学博士启动基金资助项目(X0002012201103 X0002012201102)
关键词 语音编码 小波变换 提升小波 特征波形分解 特征波形内插 speech coding wavelet transform characteristic waveform interpolation lifting wavelet characteristic waveform decomposition
  • 相关文献

参考文献15

  • 1KLEIJIN W B, HAAGEN J. Transformation and decomposition of the speech signal for coding[ J ]. IEEE Signal Processing Letters, 1994, 1(9): 136-139.
  • 2KLEIJN W B, HAAGEN J. A speech coder based on decomposition and characteristic waveform [ C ]//Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing. Detroit: IEEE Press, 1995: 508-511.
  • 3LI Jing, BAO Chang-chun. Quantization of SEW and REW magnitude for 2 kb/s waveform interpolation speech coding[ C ] // The 4th International Symposium on Chinese Spoken Language Processing. Hong Kong: IEEE Press, 2004: 141-144.
  • 4齐峰岩,鲍长春.波形内插语音编码中特征波形表达和对齐快速算法[J].北京工业大学学报,2006,32(6):514-519. 被引量:3
  • 5LUKASIAK J, BURNETT I S. Scalable decomposition of speech waveforms [ C ] //2002 IEEE Speech Coding Workshop Proceedings. Ibaraki , Japan : IEEE Press, 2002:135 - 137.
  • 6张鹏,鲍长春,郭莉莉.基于非负矩阵分解的2kb/s波形内插语音编码算法[J].电子学报,2008,36(4):632-638. 被引量:5
  • 7刘靖宇,鲍长春,李如玮.基于离散余弦变换的波形内插语音编码算法[J].电子学报,2009,37(7):1599-1605. 被引量:4
  • 8CHONG N R, BURNETT I S, CHICHARO J F. Use of pitch synchronous wavelet transform as a new decomposition method for WI[ C ]//JProceeding of IEEE International Conference on Acoustics, Speech, Signal Processing. Seattle: IEEE Press, 1998 : 513-516.
  • 9CHONG N R, BURNETT I S, CHICHARO J F. A new waveform interpolation coding scheme based on pitch synchronous wavelet transform decomposition[ J]. IEEE Trans on Speech and Audio Processing, 2000, 8 (3) : 345-348.
  • 10WANG Jing, KUANG Jing-ming, ZHAO Sheng-hui, et al. A characteristic waveform decomposition algorithm using biorthogonal wavelet transform[ C ]//ICSP2006 Proceedings. Singapore: IEEE Press, 2006 : 654-657.

二级参考文献58

共引文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部