期刊文献+

基于矢量量化的语音信号频带扩展 被引量:4

Bandwidth Expansion of Speech Based on Vector Quantization
下载PDF
导出
摘要 对基于矢量量化的频带扩展方法进行了改进.在码本形成上提出了重新量化的方法,并用码本结合浊音度的方法调整增益.首先根据清浊度和能量被划分为标准将窄带输入信号清音、浊音和静音3类;然后每类信号选择不同的码本,用基于矢量量化的方法将窄带信号的谱包络转换成高频带信号的谱包络;再用激励信号(高斯白噪声信号)和重建的高频谱包络合成高频带语音;最后将高频带与原窄带信号之和作为最终的宽带信号.仿真及与其他方法比较说明,本文的方法所需计算量小,适合实时环境. A progress on the traditional technique for high-band spectral envelope prediction based on codebook mapping is presented. A re-quantize method to train the codebook and a new refinement method based on VQ and voicing degree are also proposed. First, the narrow band speech is divided into three groups by voicing degree and energy, and in each group, the high-band spectral envelope is determined from the high-band code vector which is closest in shape to the spectral envelope of the frame of input narrowband speech under analysis. Second, the Gaussian white noise is adopted as the excitation signal to synthesis the high band signal. The method mentioned here is proved suitable for real time signal processing.
出处 《北京理工大学学报》 EI CAS CSCD 北大核心 2005年第3期260-264,共5页 Transactions of Beijing Institute of Technology
基金 与爱立信公司的国际合作项目
关键词 矢量量化 频带扩展 语音信号处理 vector quantization bandwidth expansion speech signal processing
  • 相关文献

参考文献10

  • 1Cheng Y M, O'Shaugnessy D, Mermelstein P. Statistical recovery of wideband speech from narrowband speech[J]. IEEE Transaction Speech Audio Process, 1994(2): 544-548.
  • 2Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation[Z]. International Conference on Acoustic Speech Signal Process, Istanbul, 2000.
  • 3Jax P, Vary P. Wideband extension of telephone speech using a hidden markov model[A]. IEEE Workshop on Speech Coding[C], Delavan: IEEE,2000.
  • 4Yoshida Y, Abe M. An algorithm to reconstruct wideband speech from narrow band speech based on codebook mapping[Z]. IEEE International Conference on Spoken Language Processing, Yokohama, 1994.
  • 5Enbom N, Kleijn W B. Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients[A]. IEEE Workshop on Speech Coding[C], Porvoo, Finland: IEEE,1999.
  • 6Atal B, Rabiner R. A pattern recognition approach to voiced unvoiced silence classification with applications to speech recognition[J]. IEEE Transaction on Acoustics, Speech and Signal Processing, 1976, 24: 201-212.
  • 7Bistritz Y, Peller S. Immittance spectral pairs (ISP) for speech encoding[J]. IEEE Transaction on Acoustics, Speech and Signal Processing, 1993(2): 27-30.
  • 8Makhoul J, Berouti M. High frequency regeneration in speech coding systems[Z]. IEEE International Conference Acoustic Speech Signal Process, Washington, 1979.
  • 9Nilsson M, Kleijn W B. Avoiding over-estimation in bandwidth extension of telephony speech[Z]. IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake Sity, 2001.
  • 10Yasukawa, Hiroshi. Adaptive filtering for broad band signal reconstruction using spectrum extrapolation[A]. IEEE Digital Signal Processing Workshop[C], Loen Norway: IEEE,1996.

同被引文献25

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量:2
  • 3俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量:7
  • 4Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111.
  • 5Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509.
  • 6Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244.
  • 7Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646.
  • 8Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173.
  • 9Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846.
  • 10Bernhard H P. A tight upper bound on the gain of linear and nonlinear predictors for stationary stochastic processes. IEEE Transactions on Signal Processing, 1998; 46(11): 2909--2917.

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部