基于自适应加权谱内插的宽带语音编码算法

Wideband Speech Coding Algorithm Based on Adaptive Interpolation of Weighted Spectrum

下载PDF

导出

摘要提出了一种基于自适应加权谱内插 ( STRAIGHT)的宽带语音编码算法。输入的语音信号首先经过STRAIGHT分析得到精确的基频参数和谱参数 ,然后通过时域抽取和频域建模实现有效的编码压缩。在时域抽取时采用的区别于传统编码算法固定帧长的自适应可变帧长方法 ,使得编码存储量可以根据实际语音变化情况得到更加合理的分配。主观测听结果表明 ,该算法针对 1 6k Hz采样的语音信号 ,在 6kbps码率上可以取得与AMR-WB( G.72 2 .2 )在 8.85 kbps时的相当的音质效果。此外 ,该算法还具有对恢复语音的时长。 Based on speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT), a wideband speech coding algorithm is presented. The input speech signals are firstly decomposed into pitch parameters and spectral parameters by STRAIGHT,and then compressed effectively by sampling in temporal domain and modeling in frequency domain. Because of the introduction of adaptive sampling with variable frame lengths, the bitrates can be more reasonably allocated accoding to the actural movement of speech signals. Subjective listening test demonstrates that the decoded quality of proposed algorithm at 6 kbps for 16 kHz sampled speech signal corresponds to that of AMR-WB(G.722.2) at 8.85 kbps. Besides, the method has flexible modification ability on duration, pitch and spectrum of decoded speech. So it can be widely applied in the fields, such as speech synthesis with parametric modification, voice conversion and so on.

作者凌震华戴礼荣王仁华双志伟周斌

机构地区中国科学技术大学电子工程与信息科学系

出处《数据采集与处理》 CSCD 北大核心 2005年第1期28-33,共6页 Journal of Data Acquisition and Processing

关键词语音编码算法语音信号宽带内插自适应抽取码率加权编码压缩存储量 STRAIGHT paramatric speech coding all-pole model adaptive sampling

分类号 TP391 [自动化与计算机技术—计算机应用技术] TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1Hamon C, Moulines E, Charpentier F. A diphone synthesis system based on time domain prosodic modifications of speech[A]. Proc IEEE ICASSP[C]. 1989.238～242.
2Kawahara H, Masuda-Katsuse I, Cheveignè A. Restructuring speech representations using a pitch adaptive time frequency smoothing and a instantaneous frequency based F0 extraction: possible role of a repetitive structure in sound[J]. Speech Communication, 1999, 27(3):187～207.
3Kawahara H. Speech representation and transformation using adaptive interpolation of weighted spectrum: vecoder revisited[A]. Proc IEEE ICASSP[C].1997.1303～1306.
4Kawahara H.STRAIGHT-TEMPO:a universal tool to manipulate linguistic and para-linguistic speech information[A]. Proc IEEE ICSMC[C]. 1997.1620～1625.
5MacAulay R J, Quatieri T F. Sinusoidal coding speech coding and synthesis[M]. Elsevier: Amsterdam, 1995.151～157.
6Paliwal K, Atal S. Efficient vector quantization of LPC parameters at 24 bits/frame[J]. IEEE Trans Speech and Audio Processing, 1993,1(1):3～7.

1马鸿飞,樊昌信,宋国乡.基于M-频带小波变换的宽带语音编码算法[J].通信学报,1998,19(6):20-25. 被引量：2
2康桂霞,林辉,王婷,张平.第三代移动通信系统中通用高速维特比译码器的设计与实现[J].电子学报,2000,28(z1):152-154. 被引量：3
3石海,毛哲.基于DSP实现RFID实时信号频谱分析[J].武汉工业学院学报,2008,27(3):69-72. 被引量：2
4吕声,尹俊勋.一种适应移动设备的CELP宽带语音编码算法[J].移动通信,2003,27(11B):61-64. 被引量：1
5陈亮,郑国宏,杨思祥.宽带语音编码技术专题讲座(一) 第1讲宽带语音编码算法发展概述[J].军事通信技术,2011,32(2):87-91.
6王晓晨,姜林.一种基于高斯混合模型的导谱频率参数量化算法[J].电视技术,2014,38(15):185-188.
7应娜,赵晓晖.一种基于正弦模型的变码率低速率宽带语音编码算法[J].吉林大学学报（工学版）,2005,35(4):403-408. 被引量：1
8郭小莉,黄钉劲,阮照军.MATLAB辅助DSP实现基2时域抽取法FFT[J].机电产品开发与创新,2008,21(5):134-135.
9林奕琳,李巧玲,李江源,韦岗.AMR-WB语音编码算法及仿真[J].计算机工程与应用,2003,39(29):67-69.
10KB／S和Kbps的区别[J].中学生电脑,2004(9):51-51.

数据采集与处理

2005年第1期

浏览历史

内容加载中请稍等...

基于自适应加权谱内插的宽带语音编码算法

参考文献6

相关作者

相关机构

相关主题

浏览历史