期刊文献+

基于相似关联度神经网络的音频频带扩展

Audio Bandwidth Extension Method Using Similarity Correlation Degree-Based Neural Network
下载PDF
导出
摘要 宽带音频带宽的限制会降低其主观质量和自然度.本文提出了一种基于相似关联度神经网络的宽带向超宽带音频频带扩展方法.该方法将宽带音频的精细谱重构成多维相空间,并建立相似关联度神经网络来恢复高频成分的精细谱,同时借助高斯混合模型估计高频谱包络,并以G.722.1编码器为平台实现音频信号的带宽扩展.测试结果表明,本文方法扩展性能优于参考方法,其主观质量接近于G.722.1C超宽带编码器. The bandwidth limitation of wideband audio degrades the subjective quality and the naturalness. In this paper, a bandwidth extension of audio signals from wideband to super-wideband was proposed by using a similarity correlation degree-based neural network. Firstly, the fine specmma of wideband audio was converted to a multi-dimensional phase space. Then, a similarity correlation degree-based neural network was built up to reproduce the high-frequency fine spectrum. In addition, Gaussian mixture model was used to estimate the high-frequency spectral envelope. Finally, the bandwidth was extended to super-wideband by the proposed method in the ITU-T G. 722.1 wideband codec. Evaluation results indicate that the proposed method is preferred over the reference methods and achieves a comparable subjective quality with the G. 722.1C super-wideband codec.
作者 刘鑫 鲍长春
出处 《电子学报》 EI CAS CSCD 北大核心 2015年第4期816-821,共6页 Acta Electronica Sinica
基金 国家自然科学基金(No.61072089)
关键词 音频编码 音频频带扩展 相似关联度神经网络 相空间重构 高斯混合模型 audio coding audio bandwidth extension similarity correlation degree-based neural network phase space recon- struction Gaussian mixture medel
  • 相关文献

参考文献19

  • 1P Vary,R Martin.Digital Speech Transmission—Enhancement,Coding and Error Concealment[M].UK:John Wiley & Sons Ltd,2006.
  • 2P Ekstrand.Bandwidth extension of audio signals by spectral band replication[A].Proceedings of 1st IEEE Benelux Workshop on Model Based Processing and Coding of Audio[C].New Paltz,NY,USA:IEEE,2002.53-58.
  • 3ITU-T Rec G.722.1,Coding at 24 and 32 kbit/s for Hands—free Operation in Systems with Low Frame Loss[S].
  • 4E Larsen,R M Aarts.Audio Bandwidth Extension -Application of Psychoacoustics,Signal Processing and Loudspeaker Design[M].UK:John Wiley & Sons Ltd,2004.
  • 5F Tobias,G Schuller.Spectral band replication tool for very low delay audio coding applications[A].Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics[C].New Paltz,NY,USA:IEEE,2007.199-202.
  • 6A H Nour-Eldin,P Kabal.Memory-based approximation of the Gaussian mixture model framework for bandwidth extension of narrowband speech[A].Proceedings of Interspeech[C].Florence,Italy:ISCA,2011.1185-1188.
  • 7P Jax,P Vary.Wideband extension of telephone speech using a hidden Markov model[A].Proceedings of IEEE Workshop on Speech Coding[C].Delavan,WI,USA:IEEE,2000.133-135.
  • 8H Pulakka,P Alku.Bandwidth extension of telephone speech using a neural network and a filter bank implementation for highband mel spectrum[J].IEEE Transactions on Audio,Speech,and Language Processing,2011,19(7):2170-2183.
  • 9ITU-T G.722.1 Annex C,Low complexity coding at 24 and 32 kb/s for hands-free operation in systems with low frame loss annex C 14khz mode at 24,32 and 48 kb/s[S].
  • 10F Nagel,S Disch.A harmonic bandwidth extension method for audio codecs[A].Proceedings of IEEE International Conference on Acoustics,Speech and Signal Processing[C].Taipei,Taiwan:IEEE,2009.145-148.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部