期刊文献+

基于信号规整和稀疏变换的语音与音频分层编码方法 被引量:1

The Layered Coding of Speech and Audio Signals Based on Signal Warp and Sparse Transform
下载PDF
导出
摘要 基于语音和音频信号的固有周期性特征,本文构建了一种适合语音和音频信号的统一分析/合成模型,并分别在24kbps和32kbps码率下,实现了对宽带语音和音频信号的高质量分层编码.首先,本文将具有时变周期的输入信号规整为具有固定周期的信号,并对规整后的周期信号构建规整矩阵;其次,对规整矩阵的行和列分别进行调制叠接变换(MLT)和离散余弦变换(DCT),完成规整矩阵的稀疏化;最后,利用分带量化和矢量哈夫曼编码完成稀疏矩阵元素的量化和编码.主客观测试结果表明,本文所提方法的语音、音频及其混合信号的编码质量均优于同等速率下的ITU-T G.722.1和AMR-WB编码器. Based on the periodic characteristics of speech and audio,a layered coding method by using uniform analysis and synthesis model is proposed in this paper. The constructed coder can perform equally well on speech and audio at the bit rates of 24 kbps and 32 kbps. First,the input signal which has time-varying period is warped into a constant period signal.Second,a sparse representation of the warped signal is achieved by applying the MLT and DCT on the warped matrix derived from the warped signal. Finally,the sub-band quantization and Huffman coding are applied on the transform coefficients. Both the objective PESQ / PEAQ results and the subjective A / B listening tests showthat the proposed coder outperforms the ITU-T G. 722. 1 and AMR-WB codec.
出处 《电子学报》 EI CAS CSCD 北大核心 2015年第7期1286-1293,共8页 Acta Electronica Sinica
基金 国家自然科学基金(No.61072089 No.61201197) 北京市教委科技计划面上项目(No.KM201310005008) 教育部博士学科点专项科研基金新教师基金(No.20121103120017) 北京工业大学第12届研究生科技基金(No.ykj-2013-9563)
关键词 语音编码 音频编码 信号规整 稀疏变换 speech coding audio coding signal warping sparse transform
  • 相关文献

参考文献23

  • 1鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.
  • 2Xiao-ming Li,Chang-chun Bao,W Bastiaan Kleijn.Speech coding based on pitch synchrony and two-stage transformation[A].Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing(ICASSP2013)[C].Vancouver,Canada:IEEE,2013.8159-8163.
  • 3Takehiro Moriya.Technologies forspeech and audio coding[A].Proceedings of the IEEE International Symposium on Consumer Electronics[C].Kyoto,Japan:IEEE,2009.148-149.
  • 4ITU-T G.729.1.An 8-32 kb/s Scalable Wideband Coder Bit-stream Interoperable with G.729[S].2006-05.
  • 5贾懋珅,鲍长春.一种符合ITU-T指标要求的嵌入式立体声语音频编码方法[J].电子学报,2009,37(10):2291-2297. 被引量:2
  • 6ITU-T G.718.Frame Error Robust Narrowband and Wideband Embedded Variable Bit-rate Coding of Speech and Audio from 8-32 kb/s[S].2008.
  • 73GPP.TS 26.290 V6.3.0.Extended Adaptive Multi-Rate-Wideband(AMR-WB+)Codec[S].2005-6.
  • 8H Malvar.Lapped transforms for efficient transform/subband coding[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1990,38(6):969-978.
  • 9N Ahmed,T Natarajan,K R Rao.Discretecosine transform[J].IEEE Transactions on Computers,1974,C-23(1):90-93.
  • 10刘靖宇,鲍长春,李如玮.基于离散余弦变换的波形内插语音编码算法[J].电子学报,2009,37(7):1599-1605. 被引量:4

二级参考文献54

共引文献48

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部