期刊文献+

基于频域线性预测心理声学掩蔽模型的音频编解码器

Audio Codec Using Frequency Masking Based on Frequency Domain Linear Prediction
下载PDF
导出
摘要 频域线性预测给出了信号的希尔伯特包络的近似。基于频域线性预测的编解码器运用长时分割,很好地保持了时域包络信息。该编解码器能够重建高质量的信号,但是编码效率不高。将频域掩蔽引入到时域线性预测编解码器用以减少比特率。频域掩蔽是一个听音现象,如果另一个强度较大的声音出现,关注声音的听音阈值将增加。心理声学模型用于估计频域线性预测载波信号的听力阈值和绝对听力阈值。频域子带频域线性预测载波信号的比特配置根据听力阈值和绝对听力阈值计算得到。应用频率掩蔽,比特率下降5%。该文方法的效果应用音频质量感知评价和MUSHRA方法进行了测试。 Frequency Domain Linear Prediction (FDLP) gives an approximation of the Hilbert envelopes of a signal,which has been proved to contain most of the speech information.FDLP based Codec works with long temporal segments and keeps the information carried by the time-domain envelopes very wel .The codec gives good quality of the reconstructed signal,but is not efficient enough.This paper introduces Frequency masking to FDLP based codec to reduce the bit-rate.Frequency masking is a hearing phenomenon that the hearing threshold of a sound wil increase if an intense sound exists simultane-ously.The psychoacoustics model is used to estimate the hearing threshold and the absolute threshold of hearing (ATH) of the FDLP carrier signals,and bit al ocation for frequency sub-bands FDLP carrier signal is calculated according to the threshold and ATH.6% bit-rate reduction is obtained with the application of the frequency masking.
出处 《工业控制计算机》 2014年第6期75-77,共3页 Industrial Control Computer
基金 深圳市生物 互联网 新能源 新材料产业发展专项资金基础研究计划"基于AVS-P10技术的移动多媒体系统关键技术研究"(JC201104220203A)
关键词 心理声学模型 频域掩蔽 音频编码 频域线性预测 psychoacoustics mode,frequency masking,audio coding,Frequency Domain Linear Prediction (FDLP)
  • 相关文献

参考文献12

  • 1Z.M Smith,B. Delgutte and A.J. Oxenham, "Chimaeric sounds reveal dichotomies in auditory perception", Nature, 416(6876): 87-90, 2002.
  • 2March 7 P. Motlicek, H. Hermansky, S. Ganapathy, H. Garudadri, " Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes", Proceedings of TSD,LNCS/LNAI series,Springer-Verlag,Berlin, pp. 350-357, September 2007.
  • 3S. Ganapathy, P. Motlicek, H. Hermansky, H. Garudadri, " Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding", Audio Engineering Society, 124th Convention, Amsterdam, Nethertands. May 2008.
  • 4IS. Ganapathy, P. Motlicek, H. Hermansky, H. Garudadri, " Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction," ICASSP, 2008, pp.4781-4784, March 31 2008-April 4 2008.
  • 5T. Painter and A. Spanias, "Perceptual coding of digital au- dio," Proceedings of the IEEE, vo1.88, no.4, pp.451-515, Apr 2000.
  • 6M. Schroeder, B. S. Atal, and J. L. Hall, "Optimizing digital speech coders by exploiting masking properties of the hu- man ear," J. Acoust.Soc. Amer., pp. 1647-1652, Dec. 1979.
  • 7E. Zwicker and H. Fastl, Psychoacoustics Facts and Models. Berlin, Germany: Springer-Verlag, 1990.
  • 8P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, "De- composition for Wide-band Audio Coding based on Fre- quency Domain Linear Prediction," Tech. Rep., IDIAP, RR 07-43, October 2007.
  • 9ITU-R Recommendation BS.1387, "Method for objective psychoacoustic model based on PEAQ to perceptual audio measurements of perceived audio quality", December 1998.
  • 10ITU-R Recommendation BS.1534: "Method for the subjec- tive assessment of intermediate audio quality", June 2001.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部