期刊文献+

基于贝叶斯阴阳机的2kb/s NMF-WI语音编码算法 被引量:3

2kb/s Bayesian Ying-Yang Waveform Interpolative Speech Coding Based on Non-Negative Matrix Factorization
下载PDF
导出
摘要 本文提出了一种改进型的基于非负矩阵分解(Nonnegative Matrix Factorization,NMF)的特征波形(Charac-teristic Waveform,CW)分解算法,一方面应用惩罚次胜者竞争学习算法(Rival Penalized Competitive Learning,RPCL)和贝叶斯阴阳机(Bayesian Ying-Yang,BYY)和谐学习算法,来计算NMF分解阶数,在没有明显降低语音质量的前提下,降低了编码器的复杂度;另一方面根据CW的能量与编码矩阵的能量间的变化关系,提出了相位谱的混合自回归合成方法,提高了语音的自然度.最后,开发出一套改进型2kb/s NMF-WI低复杂度语音编码方法,采用基于K-L散度的NMF迭代算法和收敛速度更快的基矢量Mel刻度分带初始化方法,按照基音周期的统计分布将特征波形分为6类,在CW分解模块,复杂度下降了10MOPS,语音质量提高,与采用4bit散布矢量量化相位谱的2.16kb/s NMF-WI语音编码器的语音质量相当. An improved charracteristic waveform decomposition based on nonnegative matrix factorization was proposed. Two methods based on Bayesian Ying-Yang(BYY)harmony learning and rival penalized competitive learning( RPCL)to compute factorization rank of nonnegafive matrix factorization(NMF)were proposed. Computational complexity is decreased and speech quality is not decreased obviously.Mixed autoregressive model for construction of WI phase was proposed according to the energy of CW and coding matrix, which improves the naturalness. In the end, a low complexity NMF-WI speech coding at 2kb/s was developed. NMF based on Kullback-Leibler divergence and Mel scale band-partitioning initialization used for basis vectors were proposed, and CWs were classified into six based on pitch dislribution. In CW factorization, computational complexity dropped by 10 MOPS. Speech quality is increased,and equivalent to 2.16kb/s NMF-WI using 4bit phase VQ.
出处 《电子学报》 EI CAS CSCD 北大核心 2009年第5期1146-1152,F0003,共8页 Acta Electronica Sinica
基金 北京市教委科技发展计划项目(No.KM200710005001) 国家自然科学基金(No.60372063) 北京市自然科学基金(No.4042009)
关键词 语音编码 波形内插 特征波形 非负矩阵分解 speech coding waveform interpolation characteristic waveform non-negative matrix factorization
  • 相关文献

参考文献8

二级参考文献100

  • 1陈悦,鲍长春.WI语音编码中相位信息的量化与重建[J].信号处理,2005,21(z1):164-167. 被引量:1
  • 2王贵平,鲍长春,张鹏.基于奇异值分解的低速率波形内插语音编码算法[J].电子学报,2006,34(1):135-140. 被引量:13
  • 3齐峰岩,鲍长春.波形内插语音编码中特征波形表达和对齐快速算法[J].北京工业大学学报,2006,32(6):514-519. 被引量:3
  • 4[2]KLEIJN W.B, HAAGEN J. A Speech Coder Based on Decomposition of Characteristic Waveforms. Proc. IEEE Int.Conf. On Acoustics, Speech, Signal Processing. vo1.1,1995: pp.508~511
  • 5[6]GUIPING WANG, CHANGCHUN BAO. Low Complexity Decomposition for the Characteristic Waveform of Speech Signal[J]. ISCSLP2004, Hong Kong. 2004. pp. 145-149.
  • 6[9]J LUKASIAK, I S BURNETT. Scalable Decomposition of Speech Waveforms[J], Whisper Laboratories, University of Wollongong. 2002. pp. 135-137
  • 7徐仲 张凯院 陆全.矩阵论简明教程[M].北京:科学出版社,2002.140-143.
  • 8陈景良 陈向晖.特殊矩阵[M].北京:清华大学出版社,2000..
  • 9Kleijn W B.Continuous representation in linear predictive coding[A].IEEE ICASSP'91[C].Toronto,1991.201-204.
  • 10Kleijn W B,Haagen J.Waveform Interpolation for Coding and Synthesis.Speech coding and Synthesis[M].Elsevier Science,1995.175-207.

共引文献16

同被引文献38

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部