AMR-WB到AMR转码中合成滤波器转换算法

Transformation algorithms of synthesis filter in transcoding from AMR-WB to AMR

下载PDF

导出

摘要提出AMR-WB到AMR转码中的2种合成滤波器转换算法.第1种是基于采样率转换和Prony算法的转换,首先将AMR-WB合成滤波器的单位采样响应进行采样率转换,然后根据最小二乘法,使得新的滤波器的单位采样响应和采样率转换后的响应的误差最小化.第2种是基于自相关值内插的转换算法,首先由AMR-WB语音的LPC参数倒推出自相关,然后采用三次样条内插出AMR语音的自相关,最后利用Levinson-Durbin算法计算LPC参数,即得到解码端的合成滤波器.算法复杂度分析表明,2种算法的计算复杂度都低于Tandem转码.实验结果表明,2种算法都可以得到比较小的谱失真.第2种算法的谱失真在浊音帧比第1种算法略大,在清音帧谱失真有时较大,但是由于清音激励的随机性,对合成清音质量影响不大. Two translation algorithms of synthesis filter are presented in the transcoding from adaptive multi-rate wideband（AMR-WB） to adaptive multi-rate （AMR）.The first one is based on the conversion of sampling rate and Prony algorithm： the sampling rate of the unit sampling response of AMR-WB synthesis filter is converted,and the error between the unit sampling response of the translated synthesis filter and the synthesis filter whose sampling rate has been converted is minimized according to the least square method.The second one is based on the interpolation of autocorrelations： the autocorrelations are deduced from the LPC parameters of AMR-WB;the autocorrelations of AMR speech are obtained through cubic spline interpolation;finally,the LPC parameters of AMR are computed through Levinson-Durbin algorithm.Complexity analysis indicates that,compared to tandem transcoding,the computational complexity of these two algorithms is lower.Experimental results show that,the spectral distortion（SD） of these two algorithms is small,but for voiced frames,it is a little larger in the second algorithm than that in the first algorithm.For unvoiced frames,the SD in the second algorithm is sometimes high,but it has little effect on the synthesized speech due to the randomicity of excitation in unvoiced speech.

作者王仕奎蔡卫平杨志鸿吴镇扬

机构地区东南大学信息科学与工程学院安徽师范大学物理与电子信息学院东南大学水声信号处理教育部重点实验室(B类筹)

出处《东南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2010年第4期676-681,共6页 Journal of Southeast University：Natural Science Edition

基金国家自然科学基金资助项目(60971098)

关键词转码合成滤波器 PRONY算法自相关三次样条内插 transcoding synthesis filter Prony algorithm autocorrelation cubic spline interpolation

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献10

13rd Generation Partnership Project (3GPP).TS 26.190,V.8.0.0.adaptive multi-rate-wideband (AMR-WB) speech codec[S].Valbonne,France:3GPP,2008.
23rd Generation Partnership Project (3GPP).3G TS 26.093 AMR speech codec:source controlled rate operation[S].Valbonne,France:3GPP,2000.
3Choi Jin-Kyu,Lee Chang-Heon,Kang Hong-Goo,et al.Improvement issues on transcoding algrithms:for the flexible usage to the various pairs of speech codec[C]//IEEE International Conference on Acoustic,Speech,and Signal Processing.Quebec,Canada,2004:I-269-I-272.
4Kwon Goo-Rak,Chung Ji-Min,Nam Sang-Jae,et al.A novel transcoding technique between EVRC and G.729A for mobile multimedia devices[J].IEEE Transactions on Consumer Electronics,2007,53(3):885-890.
5Christophe Beaugeant.Smart transcoding between CELP speech codecs through voiced oriented pitch mapping[C]//IEEE 9th Workshop on Multimedia Signal Processing.Crete,Greece,2007:155-158.
6尤红岩,王仕奎,周琳,吴镇扬.AMR与G.729之间的转码算法[J].东南大学学报（自然科学版）,2009,39(5):894-899. 被引量：1
7Erkelens J S,Broersen P M T.On the statistical properties of line spectrum pairs[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.Detroit,MI,USA,1995,1:768-771.
8Soong Frank K,Juang Biing-Huang.Line spectrum pair and speech data compression[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.San Diego,CA,USA,1984:37-40.
9Paliwal K K,Atal B S.Efficient vector quantization of LPC parameters at 24 bits-frame[J].IEEE Transactions on Speech and Audio Processing,1993,1(1):3-14.
103rd Generation Partnership Project 2 (3GPP2).C.S0052-A v1.0 Source-controlled variable-rate multimode wideband speech codec(vmr-wb),service options 62 and 63 for spread spectrum systems[S].Helsinki,Finland:Nokia Inc.,2005.

二级参考文献10

1Kang H G, Kim H K, Cox R V. Improving the transcoding capability of speech coders [J]. IEEE Transactions on Multimedia, 2003,5( 1 ) : 24- 33.
2ITU-T. Recommendation G. 729 coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction (CS-ACELP) [ S]. 1996.
3Kwon G R, Chung J M, Nam S J, et al. A novel transcoding technique between EVRC and G. 729A for mobile multimedia devices [J].IEEE Transactions on Consumer Electronics, 2007,53 ( 3 ) : 885 - 890.
4Beaugeant C. Smart transcoding between CELP speech codecs through voiced oriented pitch mapping [ C ]// IEEE 9th International Workshop on Multimedia Signal Processing. Alexandropolis, Greece, 2007 : 155 - 158.
53GPP. 3G TS 26. 093 AMR speech codec : source controlled rate operation [S ]. 2000.
63GPP. 3G TS 26. 104 ANSI-C code for the floatingpoint AMR speech codec [S].2004.
7ITU-T. Recommendation P. 800 Methods for subjective determination of transmission quality [S]. 1996.
8ITU-T. Recommendation P. 862 Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs [ S]. 2001.
9Holub J, Blaskova L. Transcoded speech contemporary objective quality measurements reliability [C ]//The 7th Annual Wireless Telecommunications Symposium. Pomona, California,USA, 2008 : 106 - 109.
10鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.

1吕娈,王拴荣,付文君,范忠范.数字接收机中合成滤波器的设计与性能分析[J].电讯技术,2006,46(5):156-160. 被引量：1
2杨阳,曾凡鑫,杨波.重叠变换抑制窄带干扰时确定干扰频率的新方法[J].通信对抗,2005,24(2):29-31.
3杨潇,陈玲玲.DSP系统有限字长效应的分析[J].吉林化工学院学报,2006,23(2):43-46. 被引量：1
4胡斌,何其超.语音LPC参数的自适应LMS实时估计算法[J].声学学报,1992,17(1):65-70. 被引量：1
5吴莉莉,刘益成.线性预测及其DSP实现[J].电声技术,2004,28(1):40-42.
6张骥祥,杨敏.窗函数在FIR数字滤波器设计中的应用[J].天津职业技术师范学院学报,2004,14(2):26-28.
7武晓飞,郑海起,栾军英,唐力伟.基于Prony算法的大型结构振动信号降噪方法[J].军械工程学院学报,2010,22(2):74-78.
8徐璟,何明浩,陈昌孝,周琳.雷达辐射源特征参数提取算法复杂度分析[J].中国电子科学研究院学报,2013,8(1):43-47. 被引量：17
9吕子兴,张文杰,郭永.多径情况下的信道估计算法[J].电子科技,2009,22(11):16-18.
10鲍长春,诸庆麟,唐昆,冯重熙.一种改进的4．8kb／s码激励线性预测语音编码[J].电子学报,1995,23(4):107-110.

东南大学学报（自然科学版）

2010年第4期

浏览历史

内容加载中请稍等...

AMR-WB到AMR转码中合成滤波器转换算法

参考文献10

二级参考文献10

相关作者

相关机构

相关主题

浏览历史