期刊文献+

Nonlinear Prediction with Deep Recurrent Neural Networks for Non-Blind Audio Bandwidth Extension 被引量:2

Nonlinear Prediction with Deep Recurrent Neural Networks for Non-Blind Audio Bandwidth Extension
下载PDF
导出
摘要 Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually generated by a duplication of the corresponding low frequencies and some parameters of high frequencies. However, the perception quality of coding will significantly degrade if the correlation between high frequencies and low frequencies becomes weak. In this paper, we quantitatively analyse the correlation via computing mutual information value. The analysis results show the correlation also exists in low frequency signal of the context dependent frames besides the current frame. In order to improve the perception quality of coding, we propose a novel method of high frequency coarse spectrum generation to improve the conventional replication method. In the proposed method, the coarse high frequency spectrums are generated by a nonlinear mapping model using deep recurrent neural network. The experiments confirm that the proposed method shows better performance than the reference methods. Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually generated by a duplication of the corresponding low frequencies and some parameters of high frequencies. However, the perception quality of coding will significantly degrade if the correlation between high frequencies and low frequencies becomes weak. In this paper, we quantitatively analyse the correlation via computing mutual information value. The analysis results show the correlation also exists in low frequency signal of the context dependent frames besides the current frame. In order to improve the perception quality of coding, we propose a novel method of high frequency coarse spectrum generation to improve the conventional replication method. In the proposed method, the coarse high frequency spectrums are generated by a nonlinear mapping model using deep recurrent neural network. The experiments confirm that the proposed method shows better performance than the reference methods.
出处 《China Communications》 SCIE CSCD 2018年第1期72-85,共14页 中国通信(英文版)
基金 supported by the National Natural Science Foundation of China under Grant No. 61762005, 61231015, 61671335, 61702472, 61701194, 61761044, 61471271 National High Technology Research and Development Program of China (863 Program) under Grant No. 2015AA016306 Hubei Province Technological Innovation Major Project under Grant No. 2016AAA015 the Science Project of Education Department of Jiangxi Province under No. GJJ150585 The Opening Project of Collaborative Innovation Center for Economics Crime Investigation and Prevention Technology, Jiangxi Province, under Grant No. JXJZXTCX-025
关键词 AUDIO CODING non-blind audiobandwidth EXTENSION context correlation deeprecurrent neural network audio coding non-blind audio bandwidth extension context correlation deep recurrent neural network
  • 相关文献

参考文献2

二级参考文献36

  • 1窦庚欣.4kb/s快速DP-CELP语音编码与频带扩展技术研究[D].北京:北京工业大学,2009.
  • 2Larsen E,Aarts R M.Audio Bandwidth Extension-Application of Psychoacoustics,Signal Processing and Loudspeaker Design[M].UK:John Wiley & Sore Ltd,2004:145-234.
  • 3Nagel F and Disch S.A Harmonic Bandwidth Extension Method for Audio Codecs[C],Taipei:IEEE International Conference on Acoustics,Speech and Signal Processing.2009:145-148.
  • 4Sha Y T,Bao C C,Jia M S,and Liu X.High Frequency Reconstruction of Audio Signal Based on Chaotic Prediction Theory[C],USA:IEEE International Conferences on Acoustics Speech and Signal Processing (ICASSP2010).2010:381-384.
  • 5Lin X,Bao C C,Jia M S and Sha Y T.Nonlinear Bandwidth Extension Based on Nearest-Neighbor Matching[C].Singapore:Journal Processing of the 2nd APSIPA ASC,2010:169-172.
  • 6Hua S,Lin Y.R/S analysis of China securities markets[J].Tsinghua Science and Technology.2003,8 (5):537-540.
  • 7Wang Z C,Zhao Y Q and Zhao C.Fractal Prediction Algorithm of L-PLC Channel Noise Based on Phase Space Reconstruction[C].CHINA:2010 International Conference on Intelligent Computation Technology and Automation (ICICTA2010),2010:343-346.
  • 8Mazel D S,Hayes M H.Using Iterated Function Systems to Model Discrete Sequences[J].IEEE Transactions on Signal Processing.1992,40(7):1724-1734.
  • 9LinX,Bao C C,Jia M S and Sha Y T.A Harmonic Bandwidth extension based on Gaussian Mixture Model[C].CHINA:10th International Conference on Signal Processing (ICSP2010).2010:474-477.
  • 10Deng J D,Simmermacher C,Cranefield S.A study on feature analysis for musical instrument classification[J].IEEE Transactions on Systems,Man,and Cybernetics,2008,38(2):429-438.

共引文献3

同被引文献15

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部