参数音频编码回顾被引量：2

The Review of Parametric Audio Coding

下载PDF

导出

摘要音频编码主要有两类技术:波形编码,参数编码。前者适合高速率高质量的应用环境,后者适合带宽受限或存储容量受限的应用或环境。参数音频编码以源模型表示信号,运用基于心理声学原理的参数估计和量化方法,提取、量化感知重要的源模型参数,在保证重建信号质量的同时,有效减小了编码比特率。近年来,研究者将自适应时间分段、联合参数量化、参数立体声等新技术引入参数音频编码,使算法得到了优化,重建信号质量也得到了显著提升,其中某些技术还成为了国际标准,并获得商业应用。本文回顾了近十几年来参数音频编码的重要进展,评述、探讨了存在的问题和研究的难点,并给出了两个典型参数音频编码系统的主观测试数据,以定量显示其性能,最后,展望了参数音频编码发展的方向。 There are two basic techniques in audio coding：waveform coding and parametric coding.The first one is usually utilized in high rates and high quality coding applications,the last one utilized in bandwidth or storage capacity constrained applications and fields.Parametric audio coding decomposes source signal into several components by source models,and employs parameter estimation /quantization algorithms that based on psychoacoustic model,to extract model parameters and then quantize these parameters.These algorithms reduce the bit rate and preserve the quality of reconstructed signal.In recent years,some new techniques have been proposed, such as adaptive time segmentation,joint parameter quantization,parametric stereo,which optimize the coding algorithms and enhance the quality of reconstructed signal significantly.As a result,some of the schemes have now become international standards and have being used in commercial products.In the paper,an overview on the important advances of parametric audio coding in the past decades is introduced.While reviewing the aspects that should be improved,the existing difficult problems in parametric audio coding are discussed.Furthermore,the results of listening tests are presented for two special parameter audio coders in order to illustrate the performance of these techniques.In the end,possible directions of further improvements are indicated.

作者王嵩鲍长春李晓明

机构地区北京工业大学电子信息与控制工程学院北京石油化工学院信息工程学院

出处《信号处理》 CSCD 北大核心 2011年第4期575-586,共12页 Journal of Signal Processing

基金国家自然科学基金(No.60872027 No.61072089) 北京市自然科学基金(No.4082006) 北京市属高等学校人才强教计划资助项目

关键词参数音频编码心理声学模型运筹率失真优化参数建模 parametric audio coding psychoacoustic models operational rate-distortion optimization parameter modeling

分类号 TN911.21 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献100

1Bosi M, Brandenburg K et al.. ISO/IEC MPEG-2 Advanced audio coding[ J ]. Journal of the Audio Engineering Society, 1997, 45(10) : 789-814.
2Berger T, Gibson J D. Lossy source coding [ J ]. IEEE Transactions on Information Theory, 1998,44 (6) : 2693- 2723.
3McAulay R J, Quatieri T F. Speech analysis/synthesis based on a sinusoidal representation [ J ]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986, 34(4): 744-754.
4McAulay R J, Quatieri T F. Sinusoidal coding[A]. W B Kleijn, K K Paliwal. Speech Coding and Synthesis [ M ]. Amsterdam, The Netherlands: Elsevier, 1995. 121-173.
5Smith III J O, Serra X. Spectral modeling synthesis: a sound analysis/synthesis system based on a deterministic plus stochastic decomposition[ J ]. Computer Music Journal, 1990, 14(4): 12-24.
6George E B, Smith M J T. Ana|ysis-by-synthesis/overlapadd sinusoidal modeling applied to the analysis-synthesis of musical tones [ J]. Journal of the Audio Engineering Society, 1992, 40(6) : 497-516.
7Lindblom J, Hedelin P. Packet loss concealment based on sinusoidal modeling [ A]. in Proc. 1EEE Workshop on Speech Coding for Telecommunications '02 [ C ]. Delavan, USA: IEEE Signal Processing Society, IEEE Comnmnications Society, 2002. 65-67.
8Rφdbro C A, Murthi M Net al.. Hidden markov model based framework for packet loss concealmenl in voice over IP[J]. IEEE Transactions on Audio, Speech, and Lan- guage Processing, 2006, 14(5): 1609-1623.
9ISO/IEC 14496-3 : 1999, FPDAM1 : Audio extensions[S].
10ISO/IEC 14496-3 : 2001, FPDAM2: Parametric coding [S].

二级参考文献16

1W B Kleijn, K K Paliwal. Speech Coding and Synthesis[M]. Amsterdam, Netherlands: Elsevier, 1995.121 - 173.
2C Roads,et al.X Serra.musical sound modeling with sinusoids plus noise [A]. Musical Signal Processing [C]. Lisse, Netherlands: Swets & Zeitlinger Publishers, 1997.91 - 122.
3R Kumaresan, D Tufts. Estimating the parameters of exponenflatly damped sinusoids and pole-zero modeling in noise [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1982,30(6) : 833 - 840.
4R Boyer,K Abed-Meraim. Audio transients modeling by damped and delayed sinusoids (DDS) [A]. Proceedings of ICASSP' 02 [C]. Orlando, USA: IEEE Signal Processing Society, 2002. 1729- 1732.
5J Jensen, R Heusdens, S H Jensen. A perceptual subspace ap- proach for modeling of speech and audio signals with damped sinusoids[J]. IEEE Transactions on Speech and Audio Processing, 2004,12(2) : 121 - 132.
6R Badeau,B David, et al.A new perturbation analysis for signal enumeration in rotational invariance techniques [J]. IEEE Transactions on Signal Processing,2006,54(2):450- 458.
7P Stoica, Y Selen. Model - order selection: a review of informarion criterion rules [ J ]. IEEE Signal Processing Magazine, 2004,21 (4) :36 - 47.
8R Roy, A Paulraj, T Kailath. ESPRIT-a subspace rotation approach to estimation of parameters of cisoids in noise[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986,34(5) : 1340 - 1342.
9S Van Huffel, J Vandewalle. The Total Least Squares Problem: Computational Aspects and Analysis[ M]. Philadelphia, USA: SIAM, 1991.27 - 95,213 - 220.
10T J Abatzoglou, J M Mendel,et al. The constrained total least squares technique and its applications to harmonic superresolution[J]. IEEE Transactions on Signal Processing, 1991, 39 (5) : 1070 - 1087.

同被引文献16

1赵超超.基于用户和基于项目结合的个性化推荐算法[J].内蒙古农业大学学报（社会科学版）,2007,9(6):139-140. 被引量：3
2数字音视频编解码技术标准工作组.GB/T20090信息技术先进音视频编码第10部分移动语音与音频编码标准[S].2013.
3Zhan J,Choo K,Oh E.Bandwidth extension for China AVS-M standard[C]//IEEE International Conference on Acoustics,Speech and Signal Processing,2009:4149-4152.
4Quackenbush S.MPEG unified speech and audio coding[J].IEEE Multi Media,2013,20(2):72-78.
5Strutt J W.On our perception of sound direction[J].Philosophical Magazine,1907,13:214-232.
6Schuijers E,Oomen W,den Brinker B,et al.Advances in parametric coding for high-quality audio[C]//Proceedings of AES Convention,2003.
7Herre J,Dietz M.MPEG-4 high-efficiency AAC coding[J].IEEE Signal Processing Magazine,2008,25(3):137-142.
83GPP TS 26.401 version12.0.0 Release 12.Enhanced aacPlus general audio codec:general description[Z].2014.
93GPP TS 26.405 version 12.0.0 Release 12.Encoder specification parametric stereo part[Z].2014.
10You S D,Cheng F Y.Spatial localization evaluation model for parametric stereo audio[J].Appl Math Inf Sci,2012,6:397-402.

引证文献2

1李诗晴,涂卫平.AVS-P10立体声编码算法的优化设计与实现[J].计算机工程与应用,2016,52(8):141-147.
2李龙生,艾均,苏湛,李妍妍.结合用户行为和物品标签的协同过滤推荐算法[J].计算机应用与软件,2018,35(6):248-253. 被引量：11

二级引证文献11

1郭雪梅.基于社会化标签的用户标注行为和时间因素的个性化推荐方法研究[J].情报科学,2020,0(2):68-74. 被引量：10
2白云,姜德录,卢造勋.中朝地台东北缘及邻区岩石层壳幔结构比研究[J].长春科技大学学报,2000,30(2):120-124. 被引量：6
3邓小燕,张晓彬.基于改进加权二部图和用户信任度的协同过滤推荐算法[J].计算机系统应用,2019,28(5):125-130. 被引量：2
4王刚,郭雪梅.融合用户行为分析和兴趣序列相似性的个性化推荐方法研究[J].情报理论与实践,2019,42(7):119-125. 被引量：10
5鲍凯丽,刘其成,牟春晓.融合朴素贝叶斯和协同过滤的外卖推荐并行算法研究[J].计算机应用与软件,2019,36(11):250-255. 被引量：2
6李顺勇,张钰嘉,张海玉.基于NKL和K-means聚类的协同过滤推荐算法[J].河南科学,2020,38(1):6-12. 被引量：6
7陆荣,周宁宁.基于MMTD和兴趣偏向系数的协同过滤推荐算法[J].计算机应用研究,2020,37(9):2600-2603. 被引量：1
8孔麟,黄俊,马浩,郑小楠.融合多层相似度与信任机制的协同过滤算法[J].计算机工程与设计,2020,41(12):3405-3411. 被引量：8
9张紫嫣,周驰.结合类别偏好的协同过滤推荐算法[J].计算机应用与软件,2021,38(1):293-296. 被引量：6
10刘国丽,廉孟杰,于丽梅,徐洪楠.融合专家信任的协同过滤推荐算法[J].计算机系统应用,2021,30(4):160-167. 被引量：1

1周宏,陈健.参数音频编码算法研究[J].计算机工程,2002,28(11):44-46.
2熊朋.增强型AAC+编码器中立体声参数的量化算法优化研究[J].电信科学,2010,26(S2):260-263.
3张志远.参数编码算法概述[J].北京广播学院学报（自然科学版）,2005,12(1):31-35.
4Scagl.,A 赖奕蓉.运用基于时频表示的参数建模对运动参数进行估计[J].空载雷达,1998(2):56-61.
5陶峻,陈健.参数音频编码算法的改进[J].通信技术,2002,35(12X):40-43.
6周宏,陈健.基于正弦波的参数音频编码算法[J].通信技术,2002,35(2X):4-6. 被引量：1
7夏田,窦维蓓.一种面向高采样率音频信号的参数音频编码系统[J].电声技术,2005,29(3):44-48.
8陶峻,陈健,曾文珺.参数音频编码中的正弦波参数提取[J].数据采集与处理,2003,18(2):161-164. 被引量：1
9孙凌.空天网络安全体系密钥管理技术研究[J].移动信息,2015(11):71-71.
10电声与音频设备[J].电子科技文摘,2000(10):58-58.

信号处理

2011年第4期

浏览历史

内容加载中请稍等...

参数音频编码回顾被引量：2

参考文献100

二级参考文献16

同被引文献16

引证文献2

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

参数音频编码回顾 被引量：2

参考文献100

二级参考文献16

同被引文献16

引证文献2

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

参数音频编码回顾被引量：2