SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL

SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL

下载PDF

导出

摘要 This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales, large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlap- add manner across the three scales by using a psychoacoustically weighted matching pursuits. The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions.This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using tile same number of sinusoids. Tile most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification. This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales,large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlapadd manner across the three scales by using a psychoacoustically weighted matching pursuits.The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions. This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using the same number of sinusoids. The most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification.

作者 Al-Moussawy Raed

机构地区 Gollege of Electronic and Information Eng.

出处《Journal of Electronics(China)》 2004年第3期213-221,共9页 电子科学学刊（英文版）

基金 Supported by the National Natural Science Foundation of China(No.69802007) Motorola China Research Center(No.B38300) Natural Science Foundation of Guangdong(No.011611)

关键词 Multiresolution sinusoidal modeling Parametric audio coding Low-rate audio coding Signal modifications 正弦多解模型音频参数编码低速率编码 Signal 信号修改

分类号 TN912.32 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献5

1M. Goodwin.Matching pursuit with damped sinusoids, In Proc[].IEEE ICASSP M(?)nich.1997
2D. Ellis,B. Vercoe.A wavelet-based sinusoidal model of sound for auditory signal separation, In Proc.Int. Comp. Mus. Conf[].Montreal.1991
3M. Goodwin.Residual modeling in music analysis/synthesis, In Proc[].IEEE ICASSP Atlanta.1996
4X. Rodet,P. Depalle,Spectral envelopes and inverse FFT synthesis,In Proc.of the 93rd AES Conv[].San Francisco.1992
5M. Goodwin.Multiresolution sinusoidal modeling using adaptive segmentation, In Proc[].IEEE ICASSPSeattle.1998

1AL-MoussawyRaed,YINJunxun,HUANGJiancheng.A Perceptual Audio Representation for Low Rate Coding Based on Sines＋Noise Modeling[J].Chinese Journal of Electronics,2003,12(3):354-357.
2王晶,晋艳伟,赵胜辉,匡镜明.Bark-Band Residual Noise Model for Parametric Audio Coding[J].Journal of Beijing Institute of Technology,2004,13(S1):1-6.
3Gu Weiqing (Network Division of ZTE Corporation, Nanjing 210012, China).ZTE's Softswitch-Based Reconstruction and Optimization Solution for Evolution of PSTN to NGN[J].ZTE Communications,2005,3(2):46-50.
4褚为利,朱阳军,张杰,胡爱斌.SPTC^+-IGBT characteristics and optimization[J].Journal of Semiconductors,2013,34(1):45-48.
5华国刚,戴蓓倩.滤波器的相似度及其在基于分析-合成语音编码中的应用[J].信号处理,2001,17(6):558-562. 被引量：2
6Zhu, Xiaoguang, Hong, Bingrong, Wang, Dongmu.Implementation of Time-Scale Transformation Based on Continuous Wavelet Theory[J].Journal of Systems Engineering and Electronics,2000,11(1):32-37. 被引量：2
7余卫宇,田菁.Rate-oriented perceptual image coding using contrast-based quantization[J].Chinese Optics Letters,2010,8(4):381-383.
8谭建国,Zhang,Wenjun,LiuPeilin.Quantization of wavelet packet audio coding[J].High Technology Letters,2006,12(3):295-299.
9Yu-tang ZHU,Yong-bo ZHAO,Jun LIU,Peng-lang SHUI.Low complexity robust adaptive beamforming for general-rank signal model with positive semidefinite constraint[J].Frontiers of Information Technology & Electronic Engineering,2016,17(11):1245-1252.
10Zhou Ying Zhang Linghua.AN IMPROVED ALGORITHM OF GMM VOICE CONVERSION SYSTEM BASED ON CHANGING THE TIME-SCALE[J].Journal of Electronics(China),2011,28(4):518-523.

<12 >

Journal of Electronics(China)

2004年第3期

SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL

参考文献5

相关作者

相关机构

相关主题

SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL

参考文献5

相关作者

相关机构

相关主题

微信扫一扫：分享