期刊文献+

基于耳蜗滤波器倒谱参数的音频频带扩展方法 被引量:1

Bandwidth extension of audio signals based on cochlear filter cepstral coefficients
原文传递
导出
摘要 音频频带扩展是根据接收的宽带信号在解码端人为地重建出丢失的高频成分,以提升音频听觉质量。该文基于耳蜗滤波器倒谱参数提出了一种盲目式音频频带扩展方法。该方法模拟外耳听觉系统,提取耳蜗滤波器倒谱系数来描述宽带音频频谱信息,并利用Gauss混合模型对高频谱包络进行估计。结合基于最近邻匹配的谱细节恢复方法,实现了宽带向超宽带音频的有效扩展。主客观测试表明,该方法的重建音频质量优于基于传统音频特征的扩展方法。 Bandwidth extension of audio signals artificially restores the truncated high-frequency components from the transmitted wideband signal at the decoder to improve the auditory quality of the reproduced audio signals.This paper describes a blind bandwidth extension method based on the cochlear filter cepstral coefficients.The system emulates auditory peripheral hearing to calculate cochlear filter cepstral coefficients that more accurately describe the spectral information of the wideband audio.The Gaussian mixture model is used to estimate the spectral envelope of high frequencies.Finally,nearest neighbor mapping used to recover the fine structure at high frequencies is combined with the bandwidth extension of wideband audio signals to give superior wideband audio.Objective and subjective tests both indicate that the method achieves better performance than bandwidth extension methods based on conventional audio features.
作者 刘鑫 鲍长春
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第6期913-916,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金项目(61072089 60872027) 北京工业大学研究生科技基金项目(ykj-2012-7001)
关键词 音频信号处理 音频频带扩展 耳蜗滤波器倒谱系数 Gauss混合模型 最近邻匹配 audio signal processing audio bandwidth extension cochlear filter cepstral coefficient Gaussian mixture model nearest neighbor mapping
  • 相关文献

参考文献12

  • 1Larsen E, Aarts R M. Audio Bandwidth Extension: Application of Psychoacoustics, Signal Processing and Loudspeaker Design [M]. Chichester, UK~ John Wiley Sons, 2004.
  • 2Jax P, Vary P. On artificial bandwidth extension of telephone speech [J]. Signal Processing, 2003, 83(8) : 1707 - 1719.
  • 3Jax P, Vary P. Feature selection for improved bandwidth extension of speech signals [C]// IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada: IEEE Press, 2004:697-700.
  • 4Nilsson M, Gustafsson H, Andersen S V, ct al. Gaussian mixture model based mutual information estimation between frequency bands in speech I-C]// IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando, FL, USA: IEEE Press, 2002.- 525-528.
  • 5Nour-Eldin A H, Kabal P. Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speech [C]// 9th Annual Conference of the International Speech Communication Association. Brisbane, Australia: ISCA Press, 2008: 53-56.
  • 6Li Q. An auditory-based transform for audio signal processing [C]// IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA: IEEE Press, 2009: 181 - 184.
  • 7Li Q, Huang Y. An auditory-based feature extraction algorithm for robust speaker identification under mismatched conditions [J]. IEEE Transactions on Audio Speech and Language Processing, 2011, 19(6): 1791- 1801.
  • 8Jax P, Vary P. An upper bound on the quality of artificial bandwidth extension of narrowband speech signals [C]// IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando, FL, USA: IEEE Press, 2002.- 237 - 240.
  • 9Liu X, Bao C, Jia M, et al. A harmonic bandwidth extension based on Gaussian mixture model [C]// IEEE 10th International Conference on Signal Processing. Beijing, China: IEEE Press, 2010: 474-477.
  • 10Shlien S. The modulated lapped transform, its time-varying forms, and its applications to audio coding standards [J]. IEEE Transactions on Speech and Audio Processing, 1997, 5(4) : 359 - 366.

同被引文献10

  • 1窦庚欣,鲍长春.一种基于矢量量化的语音信号频带扩展方法[C]∥第十二届全国信号处理学术年会(CCSP-2005).苏州:信号处理,2005,21(z1).
  • 2Jax P, Vary P. On artificial bandwidth extension of telephone speech [J].Signal Processing,2003, 83(8): 1707-1719.
  • 3Nels Rohde, Svend Aage Vedstesen. Artificial bandwidth extension of narrowband Speech[D]. Aalborg: Aalborg University, 2007.
  • 4Kominek J, Black A W. The CMU Arctic speech databases[J]. Proc of Isca Speech Synthesis Workshop, 2004, 99(4):223--224.
  • 5Liu X, Bao C C. Audio bandwidth extension based on temporal smoothing cepslral coefficients[J]. Eurasip Journal on Audio Speech & Music Processing, 2014, 2014(1):1-16.
  • 6何勇军,韩纪庆.一种语音频带扩展的方法及其改进[C].乌鲁木齐:第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会论文摘要集,2009:40-41.
  • 7Kitawaki N, Nagabuchi evaluation for low-bit-rate Journal on Selected Areas H, Itoh K. Objective quality speech coding systems[J]. IEEE in Communications, 1988, 6(2):242-248.
  • 8ITU-T. ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs[S]. Geneva:ITU-TP.862 Recommendation, 2001.
  • 9张丽燕,鲍长春,刘鑫,张兴涛.基于非线性音频特征分类的频带扩展方法[J].通信学报,2013,34(8):120-130. 被引量:2
  • 10张勇,刘轶.窄带语音带宽扩展算法研究[J].声学学报,2014,39(6):764-773. 被引量:4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部