期刊文献+

SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL

SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL
下载PDF
导出
摘要 This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales, large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlap- add manner across the three scales by using a psychoacoustically weighted matching pursuits. The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions.This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using tile same number of sinusoids. Tile most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification. This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales,large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlapadd manner across the three scales by using a psychoacoustically weighted matching pursuits.The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions. This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using the same number of sinusoids. The most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification.
出处 《Journal of Electronics(China)》 2004年第3期213-221,共9页 电子科学学刊(英文版)
基金 Supported by the National Natural Science Foundation of China(No.69802007) Motorola China Research Center(No.B38300) Natural Science Foundation of Guangdong(No.011611)
关键词 Multiresolution sinusoidal modeling Parametric audio coding Low-rate audio coding Signal modifications 正弦多解模型 音频参数编码 低速率编码 Signal 信号修改
  • 引文网络
  • 相关文献

参考文献5

  • 1M. Goodwin.Matching pursuit with damped sinusoids, In Proc[].IEEE ICASSP M(?)nich.1997
  • 2D. Ellis,B. Vercoe.A wavelet-based sinusoidal model of sound for auditory signal separation, In Proc.Int. Comp. Mus. Conf[].Montreal.1991
  • 3M. Goodwin.Residual modeling in music analysis/synthesis, In Proc[].IEEE ICASSP Atlanta.1996
  • 4X. Rodet,P. Depalle,Spectral envelopes and inverse FFT synthesis,In Proc.of the 93rd AES Conv[].San Francisco.1992
  • 5M. Goodwin.Multiresolution sinusoidal modeling using adaptive segmentation, In Proc[].IEEE ICASSPSeattle.1998

相关主题

;
使用帮助 返回顶部