期刊文献+

基于局部最小二乘支持向量机的音频频带扩展方法 被引量:3

Audio Bandwidth Extension Method Based on Local Least Square Support Vector Machine
下载PDF
导出
摘要 在网络传输过程中宽带音频会由于高频信息的缺失导致音频质量下降,因此,本文提出了一种基于局部最小二乘支持向量机的宽带向超宽带音频频带扩展方法.根据音频频域序列的非线性特性,本文采用相空间重构和局部最小二乘支持向量机对音频信号的高频频谱细节进行预测,并结合高斯混合模型对高频子带能量进行估计,最后经过高频频谱包络调整,所提方法能够有效地恢复7k Hz^14k Hz频率范围内的高频成分.主客观测试结果表明,该方法改善了宽带音频的听觉质量,其性能优于参考音频频带扩展方法. The auditory quality of wideband audio is generally degraded due to the lack of the high-frequency in network transmission,so this paper presents a kind of audio bandwidth extension method from wideband to super wideband based on local least square support vector machine. In the light of the nonlinearity of audio spectrum,the high-frequency fine spectrum of audio signals is predicted by using phase space reconstruction and local least square support vector machine.Combining with the estimation of high-frequency sub-band energy based on Gaussian mixture model,the proposed method can effectively recover the high-frequency components in the frequency range 7k Hz ~ 14 k Hz through the envelope adjustment of high-frequency spectrum at last. Subjective and objective testing results indicate that the proposed method improves the auditory quality of wideband audio and outperforms the reference methods of audio bandwidth extension.
出处 《电子学报》 EI CAS CSCD 北大核心 2016年第9期2203-2210,共8页 Acta Electronica Sinica
基金 国家自然科学基金项目(No.61072089 No.61471014)
关键词 音频编码 频带扩展 高斯混合模型 局部最小二乘支持向量机 audio coding bandwidth extension Gaussian mixture model local least square support vector machine
  • 相关文献

参考文献2

二级参考文献14

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 3党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量:2
  • 4俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量:7
  • 5Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111.
  • 6Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509.
  • 7Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244.
  • 8Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646.
  • 9Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173.
  • 10Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846.

共引文献9

同被引文献19

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部