期刊文献+

AMR-WB到AMR转码中合成滤波器转换算法

Transformation algorithms of synthesis filter in transcoding from AMR-WB to AMR
下载PDF
导出
摘要 提出AMR-WB到AMR转码中的2种合成滤波器转换算法.第1种是基于采样率转换和Prony算法的转换,首先将AMR-WB合成滤波器的单位采样响应进行采样率转换,然后根据最小二乘法,使得新的滤波器的单位采样响应和采样率转换后的响应的误差最小化.第2种是基于自相关值内插的转换算法,首先由AMR-WB语音的LPC参数倒推出自相关,然后采用三次样条内插出AMR语音的自相关,最后利用Levinson-Durbin算法计算LPC参数,即得到解码端的合成滤波器.算法复杂度分析表明,2种算法的计算复杂度都低于Tandem转码.实验结果表明,2种算法都可以得到比较小的谱失真.第2种算法的谱失真在浊音帧比第1种算法略大,在清音帧谱失真有时较大,但是由于清音激励的随机性,对合成清音质量影响不大. Two translation algorithms of synthesis filter are presented in the transcoding from adaptive multi-rate wideband(AMR-WB) to adaptive multi-rate (AMR).The first one is based on the conversion of sampling rate and Prony algorithm: the sampling rate of the unit sampling response of AMR-WB synthesis filter is converted,and the error between the unit sampling response of the translated synthesis filter and the synthesis filter whose sampling rate has been converted is minimized according to the least square method.The second one is based on the interpolation of autocorrelations: the autocorrelations are deduced from the LPC parameters of AMR-WB;the autocorrelations of AMR speech are obtained through cubic spline interpolation;finally,the LPC parameters of AMR are computed through Levinson-Durbin algorithm.Complexity analysis indicates that,compared to tandem transcoding,the computational complexity of these two algorithms is lower.Experimental results show that,the spectral distortion(SD) of these two algorithms is small,but for voiced frames,it is a little larger in the second algorithm than that in the first algorithm.For unvoiced frames,the SD in the second algorithm is sometimes high,but it has little effect on the synthesized speech due to the randomicity of excitation in unvoiced speech.
出处 《东南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2010年第4期676-681,共6页 Journal of Southeast University:Natural Science Edition
基金 国家自然科学基金资助项目(60971098)
关键词 转码 合成滤波器 PRONY算法 自相关 三次样条内插 transcoding synthesis filter Prony algorithm autocorrelation cubic spline interpolation
  • 相关文献

参考文献10

  • 13rd Generation Partnership Project (3GPP).TS 26.190,V.8.0.0.adaptive multi-rate-wideband (AMR-WB) speech codec[S].Valbonne,France:3GPP,2008.
  • 23rd Generation Partnership Project (3GPP).3G TS 26.093 AMR speech codec:source controlled rate operation[S].Valbonne,France:3GPP,2000.
  • 3Choi Jin-Kyu,Lee Chang-Heon,Kang Hong-Goo,et al.Improvement issues on transcoding algrithms:for the flexible usage to the various pairs of speech codec[C]//IEEE International Conference on Acoustic,Speech,and Signal Processing.Quebec,Canada,2004:I-269-I-272.
  • 4Kwon Goo-Rak,Chung Ji-Min,Nam Sang-Jae,et al.A novel transcoding technique between EVRC and G.729A for mobile multimedia devices[J].IEEE Transactions on Consumer Electronics,2007,53(3):885-890.
  • 5Christophe Beaugeant.Smart transcoding between CELP speech codecs through voiced oriented pitch mapping[C]//IEEE 9th Workshop on Multimedia Signal Processing.Crete,Greece,2007:155-158.
  • 6尤红岩,王仕奎,周琳,吴镇扬.AMR与G.729之间的转码算法[J].东南大学学报(自然科学版),2009,39(5):894-899. 被引量:1
  • 7Erkelens J S,Broersen P M T.On the statistical properties of line spectrum pairs[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.Detroit,MI,USA,1995,1:768-771.
  • 8Soong Frank K,Juang Biing-Huang.Line spectrum pair and speech data compression[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.San Diego,CA,USA,1984:37-40.
  • 9Paliwal K K,Atal B S.Efficient vector quantization of LPC parameters at 24 bits-frame[J].IEEE Transactions on Speech and Audio Processing,1993,1(1):3-14.
  • 103rd Generation Partnership Project 2 (3GPP2).C.S0052-A v1.0 Source-controlled variable-rate multimode wideband speech codec(vmr-wb),service options 62 and 63 for spread spectrum systems[S].Helsinki,Finland:Nokia Inc.,2005.

二级参考文献10

  • 1Kang H G, Kim H K, Cox R V. Improving the transcoding capability of speech coders [J]. IEEE Transactions on Multimedia, 2003,5( 1 ) : 24- 33.
  • 2ITU-T. Recommendation G. 729 coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction (CS-ACELP) [ S]. 1996.
  • 3Kwon G R, Chung J M, Nam S J, et al. A novel transcoding technique between EVRC and G. 729A for mobile multimedia devices [J].IEEE Transactions on Consumer Electronics, 2007,53 ( 3 ) : 885 - 890.
  • 4Beaugeant C. Smart transcoding between CELP speech codecs through voiced oriented pitch mapping [ C ]// IEEE 9th International Workshop on Multimedia Signal Processing. Alexandropolis, Greece, 2007 : 155 - 158.
  • 53GPP. 3G TS 26. 093 AMR speech codec : source controlled rate operation [S ]. 2000.
  • 63GPP. 3G TS 26. 104 ANSI-C code for the floatingpoint AMR speech codec [S].2004.
  • 7ITU-T. Recommendation P. 800 Methods for subjective determination of transmission quality [S]. 1996.
  • 8ITU-T. Recommendation P. 862 Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs [ S]. 2001.
  • 9Holub J, Blaskova L. Transcoded speech contemporary objective quality measurements reliability [C ]//The 7th Annual Wireless Telecommunications Symposium. Pomona, California,USA, 2008 : 106 - 109.
  • 10鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部