期刊文献+

一种甚低码率声码器的设计 被引量:1

Design of an ultra-low bit rate vocoder
下载PDF
导出
摘要 在混合激励线性预测(mixed excitation linear prediction,MELP)模型的基础上,以超帧为单位,采用多帧联合编码技术,分模式对子帧的语音特征参数进行联合量化,实现了一种码率为600 bit/s的声码器。为了进一步减小量化误差,设计出了一种基于高斯混合模型的预测分类分裂矢量量化器(predictive switched split vector quantization based on Gauss mixture model,GMM-PSSVQ),该量化器对超帧中某些子帧的线谱频率进行量化,并利用帧间预测和线性插值等方法提高编码效率。采用谱失真对设计的矢量量化器进行性能评估,并分别与多级矢量量化和预测分裂矢量量化算法进行性能比较;通过客观感知语音质量评估和主观判断韵字测试对实现的声码器进行性能测试。测试结果表明,设计的矢量量化器平均谱失真最低,实现的声码器合成语音具有较高的清晰度和可懂度。 Based on the mixed excitation linear prediction(MELP)model,this paper designs a vocoder with a bit rate of600bit/s.It adopts a multi-frame joint coding technique with the super frame,and then through the divided model to realize joint quantification for the speech feature parameters of sub frames in the super frame.To deal with the problem that the performance of the existing vector quantization is non-optimal,a predictive switched split vector quantization based on Gauss mixture model(GMM-PSSVQ)is adopted.It quantizes the line spectrum frequency of some sub frames and uses the inter prediction and linear interpolation method to improve the coding efficiency.The performance of the designed vector quantization is evaluated by spectral distortion and it is compared with the multistage vector quantization and predictive splitting vector quantization.The performance of the vocoder is tested by the perceptual evaluation of speech quality and Diagnostic Rhymer Test.Experimental results show that the proposed algorithm has the lowest average spectral distortion,and the speech synthesized by the vocoder proposed in this thesis has high clarity and intelligibility.
作者 李强 张玲 朱兰 明艳 LI Qiang;ZHANG Ling;ZHU Lan;MING Yan(Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications,Chongqing 400065, P. R. China)
出处 《重庆邮电大学学报(自然科学版)》 CSCD 北大核心 2018年第6期776-782,共7页 Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金 国家高技术研究发展计划("863"计划)(2012AA01A508)~~
关键词 混合激励线性预测(MELP) 多帧联合量化 矢量量化器 性能测试 mixed excitation linear prediction(MELP) multi-frame joint quantization vector quantization performance test
  • 相关文献

参考文献3

二级参考文献17

  • 1张学工.模式识别[M].北京:清华大学出版社,2010.
  • 2鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.
  • 3Bryt O, Elad M. Compression of facial images using the K-SVD algorithm [ J ]. Journal of Visual Communication and Image Representation, 2008,19 (4) : 270 - 282.
  • 4Aharon M, Elad M, Bruckstein A. K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation [J]. IEEE Transactions on Signal Processing, 2006, 54( 11 ) :4311 -4322.
  • 5Protter M,Elad M. Image sequence denoising via sparse and redundant representations [ J ]. IEEE Transactions on Image Processing, 2009, 18 ( 1 ) : 27 - 35.
  • 6Yang A Y, Wright J, Ma Y, et al. Feature selection in face recognition: a sparse representation perspective[R]. UC Berkeley Technical Report UCB/EECS - 2007 - 99, 2007.
  • 7Rubinstein R, Zibulevsky M, Elad M. Efficient implementation of the K-SVD algorithm using batch orthogonal matching pursuit[R]. CS Technical Report, Technion, Israel Institute of Technology, 2008.
  • 8Pati Y C, Rezaiifar R, Krishnaprasad P S. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition [ C ]//Proceedings of the 27th Asilomar Conference on Signals, Systems & Computers. CA : IEEE, 1993:40 - 44.
  • 9张新鹏,王朔中.基于稀疏表示的密写编码[J].电子学报,2007,35(10):1892-1896. 被引量:9
  • 10Daniel W Griffin, Jae S Lira. Multi-Band Excitation Vocoder[J]. IEEE Transactions on ASSP, 1988,36:1223-1235.

共引文献7

同被引文献6

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部