期刊文献+

窄带语音带宽扩展算法研究 被引量:5

Narrowband speech wideband extension algorithm research
下载PDF
导出
摘要 为了降低谱失真,提出了一种基于隐马尔科夫模型的窄带语音带宽扩展算法。首先,算法选取与宽带谱包络互信息大的参数构成特征矢量,并利用隐马尔可夫状态和过去观察特征矢量的联合先验概率估计条件后验概率。其次,以条件后验概率为基础,算法结合贝叶斯条件参数估计法和最小均方差准则估计宽带谱包络。针对宽带激励信号估计,基于信号高频和低频的谐波相关性,提出了一种中频激励扩展算法。实验结果表明,与传统的基于隐马尔可夫模型的带宽扩展算法相比,本文算法可降低0.187 dB的平均谱失真,将谱失真大于10 dB的语音帧减少了34.3%。 To reduce the spectral distortion,a Hidden Markov Model-based narrowband speech bandwidth extension algorithm is presented.Firstly,the parameters which have higher mutual information with wideband envelope are extracted to constitute the feature vector,and then a posterior probability is calculated via the joint probability of the past observation feature vector sequence and the Markov states.Secondly,based on the posterior probability,the wideband envelope is estimated using Bayesian parameter estimation method and minimum mean square error criteria.For estimation of wideband excitation signal,intermediate frequency extension algorithm is presented based on the harmonic correlation between the low frequency and high frequency.The experimental results show that,compared with the traditional bandwidth extension algorithm based on Hidden Markov Model,the average spectral distortion is reduced by 0.187 dB and the number of speech frame with spectral distortion over 10 dB is decreased by 34.3%.
作者 张勇 刘轶
出处 《声学学报》 EI CSCD 北大核心 2014年第6期764-773,共10页 Acta Acustica
关键词 扩展算法 带谱 后验概率 隐马尔可夫 最小均方差 信号估计 先验概率 状态空间 概率值 特征参数 Bandwidth Frequency estimation Hidden Markov models Markov processes Probability
  • 相关文献

参考文献12

  • 1Gajjar P,Bhatt N,Kosta Y.Artificial bandwidth extension of speech&its applications in wireless communication systems:a review.International Conference on Communication Systems and Network Technologies,Rajkot India,2012(2):563-568.
  • 2Pulakka H,Myllyla V,Laaksonen L,Alku P.Bandwidth extension of telephone speech using a filter bank implementation for highband Mel spectrim.18th European Signal Processing Conference,Aalborg Denmark,2010(4):979-983.
  • 3郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 4Pulakka H,Remes U,Palomaki K,Kurimo M.Speech bandwidth extension using gaussian mixture model-based estimation of the highband Mel spectrum.IEEE International Conference on Acoustics,Speech,Signal Processing,Prague Czekh,2011(6):5100-5103.
  • 5Geiser B,Jax P.Bandwidth extension for hierarchical speech and audio coding in ITU-T Rec.G.729.1.IEEE Transactions on Audio,Speech and Language Processing,2007;15(8):2496-2509.
  • 6Nakatoh Y,Tsushima M,Norimatsu T.Generation of broadband speech from narrowband speech using piecewise linear mapping.5th European Conference on Speech Communication and Technology,Rhodes Greece,1997(5):1643-1646.
  • 7Park K Y,Kim H S.Narrowband to wideband conversion of speech using GMM based transformation.IEEE International Conference on Acoustics,Speech,Signal Processing,Istanbul Turkey,2000(4):1843-1846.
  • 8Jax P,Vary P.Wideband extension of telephone speech using a hidden markov model.IEEE Workshop on Speech Coding,Delavan America,2000(1):133-135.
  • 9Bauer P,Fingscheidt T.An HMM-based artificial bandwidth extension evaluated by cross-language training and test.IEEE International Conference on Acoustics,Speech,Signal Processing,Las Vegas America,2008(4):4589-4592.
  • 10张勇,胡瑞敏.基于高斯混合模型的语音带宽扩展算法的研究[J].声学学报,2009,34(5):471-480. 被引量:7

二级参考文献25

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 3党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量:2
  • 4俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量:7
  • 5Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111.
  • 6Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509.
  • 7Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244.
  • 8Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646.
  • 9Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173.
  • 10Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846.

共引文献15

同被引文献25

  • 1何英 何强.MATLAB扩展编程[M].北京:清华大学出版社,2002..
  • 2Hunt M, Yates J, Bridle J. Automatic speaker recognition for use over communication channels. Acoustics, Speech,and Signal Processing, 1997 ;2:764-767.
  • 3Steven S S S ,Volkman N J, Newwan E B. A scale for measurement of the psychological magnitude pitch. Journal of the Acoustical Socie- ty of American,1937 ;8(3) :185-190.
  • 4Bhasher P V, Rao S R M. A computer aided MFCC-HMM based speech controlled automation system. International Journal of Elec- tronic Communications Engineering Advanced Research, 2014; 2 (2) :148-155.
  • 5窦庚欣,鲍长春.一种基于矢量量化的语音信号频带扩展方法[C]∥第十二届全国信号处理学术年会(CCSP-2005).苏州:信号处理,2005,21(z1).
  • 6Jax P, Vary P. On artificial bandwidth extension of telephone speech [J].Signal Processing,2003, 83(8): 1707-1719.
  • 7Nels Rohde, Svend Aage Vedstesen. Artificial bandwidth extension of narrowband Speech[D]. Aalborg: Aalborg University, 2007.
  • 8Kominek J, Black A W. The CMU Arctic speech databases[J]. Proc of Isca Speech Synthesis Workshop, 2004, 99(4):223--224.
  • 9Liu X, Bao C C. Audio bandwidth extension based on temporal smoothing cepslral coefficients[J]. Eurasip Journal on Audio Speech & Music Processing, 2014, 2014(1):1-16.
  • 10何勇军,韩纪庆.一种语音频带扩展的方法及其改进[C].乌鲁木齐:第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会论文摘要集,2009:40-41.

引证文献5

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部