期刊文献+

参数音频编码回顾 被引量:2

The Review of Parametric Audio Coding
下载PDF
导出
摘要 音频编码主要有两类技术:波形编码,参数编码。前者适合高速率高质量的应用环境,后者适合带宽受限或存储容量受限的应用或环境。参数音频编码以源模型表示信号,运用基于心理声学原理的参数估计和量化方法,提取、量化感知重要的源模型参数,在保证重建信号质量的同时,有效减小了编码比特率。近年来,研究者将自适应时间分段、联合参数量化、参数立体声等新技术引入参数音频编码,使算法得到了优化,重建信号质量也得到了显著提升,其中某些技术还成为了国际标准,并获得商业应用。本文回顾了近十几年来参数音频编码的重要进展,评述、探讨了存在的问题和研究的难点,并给出了两个典型参数音频编码系统的主观测试数据,以定量显示其性能,最后,展望了参数音频编码发展的方向。 There are two basic techniques in audio coding:waveform coding and parametric coding.The first one is usually utilized in high rates and high quality coding applications,the last one utilized in bandwidth or storage capacity constrained applications and fields.Parametric audio coding decomposes source signal into several components by source models,and employs parameter estimation /quantization algorithms that based on psychoacoustic model,to extract model parameters and then quantize these parameters.These algorithms reduce the bit rate and preserve the quality of reconstructed signal.In recent years,some new techniques have been proposed, such as adaptive time segmentation,joint parameter quantization,parametric stereo,which optimize the coding algorithms and enhance the quality of reconstructed signal significantly.As a result,some of the schemes have now become international standards and have being used in commercial products.In the paper,an overview on the important advances of parametric audio coding in the past decades is introduced.While reviewing the aspects that should be improved,the existing difficult problems in parametric audio coding are discussed.Furthermore,the results of listening tests are presented for two special parameter audio coders in order to illustrate the performance of these techniques.In the end,possible directions of further improvements are indicated.
出处 《信号处理》 CSCD 北大核心 2011年第4期575-586,共12页 Journal of Signal Processing
基金 国家自然科学基金(No.60872027 No.61072089) 北京市自然科学基金(No.4082006) 北京市属高等学校人才强教计划资助项目
关键词 参数音频编码 心理声学模型 运筹率失真优化 参数建模 parametric audio coding psychoacoustic models operational rate-distortion optimization parameter modeling
  • 相关文献

参考文献100

  • 1Bosi M, Brandenburg K et al.. ISO/IEC MPEG-2 Advanced audio coding[ J ]. Journal of the Audio Engineering Society, 1997, 45(10) : 789-814.
  • 2Berger T, Gibson J D. Lossy source coding [ J ]. IEEE Transactions on Information Theory, 1998,44 (6) : 2693- 2723.
  • 3McAulay R J, Quatieri T F. Speech analysis/synthesis based on a sinusoidal representation [ J ]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986, 34(4): 744-754.
  • 4McAulay R J, Quatieri T F. Sinusoidal coding[A]. W B Kleijn, K K Paliwal. Speech Coding and Synthesis [ M ]. Amsterdam, The Netherlands: Elsevier, 1995. 121-173.
  • 5Smith III J O, Serra X. Spectral modeling synthesis: a sound analysis/synthesis system based on a deterministic plus stochastic decomposition[ J ]. Computer Music Journal, 1990, 14(4): 12-24.
  • 6George E B, Smith M J T. Ana|ysis-by-synthesis/overlapadd sinusoidal modeling applied to the analysis-synthesis of musical tones [ J]. Journal of the Audio Engineering Society, 1992, 40(6) : 497-516.
  • 7Lindblom J, Hedelin P. Packet loss concealment based on sinusoidal modeling [ A]. in Proc. 1EEE Workshop on Speech Coding for Telecommunications '02 [ C ]. Delavan, USA: IEEE Signal Processing Society, IEEE Comnmnications Society, 2002. 65-67.
  • 8Rφdbro C A, Murthi M Net al.. Hidden markov model based framework for packet loss concealmenl in voice over IP[J]. IEEE Transactions on Audio, Speech, and Lan- guage Processing, 2006, 14(5): 1609-1623.
  • 9ISO/IEC 14496-3 : 1999, FPDAM1 : Audio extensions[S].
  • 10ISO/IEC 14496-3 : 2001, FPDAM2: Parametric coding [S].

二级参考文献16

  • 1W B Kleijn, K K Paliwal. Speech Coding and Synthesis[M]. Amsterdam, Netherlands: Elsevier, 1995.121 - 173.
  • 2C Roads,et al.X Serra.musical sound modeling with sinusoids plus noise [A]. Musical Signal Processing [C]. Lisse, Netherlands: Swets & Zeitlinger Publishers, 1997.91 - 122.
  • 3R Kumaresan, D Tufts. Estimating the parameters of exponenflatly damped sinusoids and pole-zero modeling in noise [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1982,30(6) : 833 - 840.
  • 4R Boyer,K Abed-Meraim. Audio transients modeling by damped and delayed sinusoids (DDS) [A]. Proceedings of ICASSP' 02 [C]. Orlando, USA: IEEE Signal Processing Society, 2002. 1729- 1732.
  • 5J Jensen, R Heusdens, S H Jensen. A perceptual subspace ap- proach for modeling of speech and audio signals with damped sinusoids[J]. IEEE Transactions on Speech and Audio Processing, 2004,12(2) : 121 - 132.
  • 6R Badeau,B David, et al.A new perturbation analysis for signal enumeration in rotational invariance techniques [J]. IEEE Transactions on Signal Processing,2006,54(2):450- 458.
  • 7P Stoica, Y Selen. Model - order selection: a review of informarion criterion rules [ J ]. IEEE Signal Processing Magazine, 2004,21 (4) :36 - 47.
  • 8R Roy, A Paulraj, T Kailath. ESPRIT-a subspace rotation approach to estimation of parameters of cisoids in noise[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986,34(5) : 1340 - 1342.
  • 9S Van Huffel, J Vandewalle. The Total Least Squares Problem: Computational Aspects and Analysis[ M]. Philadelphia, USA: SIAM, 1991.27 - 95,213 - 220.
  • 10T J Abatzoglou, J M Mendel,et al. The constrained total least squares technique and its applications to harmonic superresolution[J]. IEEE Transactions on Signal Processing, 1991, 39 (5) : 1070 - 1087.

同被引文献16

  • 1赵超超.基于用户和基于项目结合的个性化推荐算法[J].内蒙古农业大学学报(社会科学版),2007,9(6):139-140. 被引量:3
  • 2数字音视频编解码技术标准工作组.GB/T20090信息技术先进音视频编码第10部分移动语音与音频编码标准[S].2013.
  • 3Zhan J,Choo K,Oh E.Bandwidth extension for China AVS-M standard[C]//IEEE International Conference on Acoustics,Speech and Signal Processing,2009:4149-4152.
  • 4Quackenbush S.MPEG unified speech and audio coding[J].IEEE Multi Media,2013,20(2):72-78.
  • 5Strutt J W.On our perception of sound direction[J].Philosophical Magazine,1907,13:214-232.
  • 6Schuijers E,Oomen W,den Brinker B,et al.Advances in parametric coding for high-quality audio[C]//Proceedings of AES Convention,2003.
  • 7Herre J,Dietz M.MPEG-4 high-efficiency AAC coding[J].IEEE Signal Processing Magazine,2008,25(3):137-142.
  • 83GPP TS 26.401 version12.0.0 Release 12.Enhanced aacPlus general audio codec:general description[Z].2014.
  • 93GPP TS 26.405 version 12.0.0 Release 12.Encoder specification parametric stereo part[Z].2014.
  • 10You S D,Cheng F Y.Spatial localization evaluation model for parametric stereo audio[J].Appl Math Inf Sci,2012,6:397-402.

引证文献2

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部