期刊文献+

基于二维非负矩阵分解的1kb/s WI语音编码算法 被引量:3

1kb/s Waveform Interpolative Speech Coding Based on Two-Dimensional Nonnegative Matrix Factorization
下载PDF
导出
摘要 本文针对波形内插(WI)语音编码模型和参数量化等技术进行了研究,并最终提出了一种基于二维非负矩阵分解的1kb/s波形内插(2DNMF-WI)语音编码算法.文中采用二维非负矩阵分解(2D-NMF)方法来分解语音特征波形(CW),该分解方法在行和列两个方向上同时压缩CW幅度谱矩阵的维数,使得CW幅度谱矩阵降维后得到的编码矩阵维数较小,易于量化.此外,在甚低速率语音编码中,由于没有足够的比特数来描述编码参数,往往很难得到高质量的合成语音.本算法采用两帧联合编码、帧间后向预测三级矢量量化、离散余弦变换(DCT)和分裂式矩阵量化等技术来降低编码速率和改善音质.非正式主观听觉测试显示,1kb/s 2DNMF-WI编码器合成语音的质量稍差于2kb/s的NMF-WI语音编码算法. This paper is focused on the model of waveform interpolation(WI) and its parameters quantization,then a waveform interpolation speech coding algorithm based on two-dimensional nonnegative matrix factorization at 1kb/s is presented.This method makes the dimensions of CW magnitude matrix much lower in columns and rows,so it is convenient for quantizing the coding matrix.In addition,speech coders at very low bit rates can hardly get good performance,for there are no sufficient bits to express these coding parameters.Then two-frame joint,inter-frame backward prediction three-stage vector quantization,discrete cosine transform(DCT) and split matrix quantization techniques are promoted in this paper,in order to reduce the speech coding bit rates as well as to improve the quality of the speech.The results of informal subjective listening test show that the performance of 1kb/s 2DNMF-WI coder is a little worse than that of 2kb/s NMF-WI coder.
出处 《电子学报》 EI CAS CSCD 北大核心 2010年第7期1574-1579,共6页 Acta Electronica Sinica
基金 北京市教委科技发展计划(No.KM200710005001) 国家自然科学基金(No.60372063) 北京市自然科学基金(No.4042009) 北京市属高校人才强教计划
关键词 语音编码 波形内插 特征波形 二维非负矩阵分解 两帧联合 speech coding waveform interpolation characteristic waveform two-dimensional nonnegative matrix factorization two-frame joint
  • 相关文献

参考文献21

  • 1鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.
  • 2W B Kleijn,Haagen J.Waveform Interpolation for Coding and Synthesis.Speech coding and Synthesis[M].Holland:Elsevier Science,1995.175-207.
  • 3W B Kleijn,J Haagen.Transformation and decomposition of the speech signal for coding[J].IEEE signal processing letters,1994,1 (9):136-139.
  • 4N R Chong,I S Burnett,J F Chicharo.Use of pitch synchor wavelet transform as a new decomposition method for WI[A].Proceeding of IEEE International Conferance on Acoustics,Speech,Signal Processing[C].Seattle,Wash,USA:IEEE,1998.513-516.
  • 5J Lukasiak,I S Burnett.Scalable decomposition of speech waveforms[A].2002 IEEE Speech Coding Workshop Proceedings[C].Tsukuba City,Ibaraki,Japan:IEEE,2002.135-137.
  • 6王贵平,鲍长春,张鹏.基于奇异值分解的低速率波形内插语音编码算法[J].电子学报,2006,34(1):135-140. 被引量:13
  • 7张鹏,鲍长春.基于SVD的低复杂度语音特征波形分解方法[J].信号处理,2005,21(z1):160-163. 被引量:2
  • 8张鹏,鲍长春,郭莉莉.基于非负矩阵分解的2kb/s波形内插语音编码算法[J].电子学报,2008,36(4):632-638. 被引量:5
  • 9Peng Zhang,Changchun BAO.A novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization[A].Interspeech[C].Antwerp,Belgium:ICSA,2007.1661-1664.
  • 10D D Lee,H S Seung.Learning the parts of objects by nonnegative matrix factorization[J].Nature,1999,401:788 -791.

二级参考文献79

共引文献68

同被引文献48

  • 1高兴斌,刘永坦.ISAR目标象的特征提取和特征选择[J].哈尔滨工业大学学报,1994,26(5):77-81. 被引量:6
  • 2许人灿,刘朝军,黄小红,陈曾平.基于超分辨ISAR成像的空中目标自动识别[J].系统工程与电子技术,2006,28(1):46-48. 被引量:10
  • 3高宏娟,潘晨.基于(2D)^2NMF及其改进算法的人脸识别[J].计算机应用,2007,27(7):1660-1662. 被引量:7
  • 4鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社,2007.
  • 5Toumi A,Hoeltzener B,Khenchaf A. Using watersheds segmentation on ISAR image for automatic target recognition[A].Lyon,France:IEEE,2007.285-290.
  • 6Lin Bo,Yan Fengxia,Zhu Jubo. Feature extraction of 2D radar profile via double-sides 2DPCA for target recognition[A].Tianjin,China:IEEE,2009.1-5.
  • 7Lee D D,Seung H S. Learning the parts of objects by non-negative matrix factorization[J].{H}NATURE,1999,(6755):788-791.
  • 8Lin Chinjen. Projected gradient methods for non-negative matrix factorization[J].{H}Neural Computation,2007,(10):2756-2779.
  • 9Zhang Daoqiang,Chen Songcan,Zhou Zhihua. Two-di-mensional non-negative matrix factorization for face representation and recognition[A].Beijing,China:Springer,2005.350-363.
  • 10Kim K T,Seo D K,Kim H T. Efficient classification of ISAR images[J].{H}IEEE Transactions on Antennas and Propagation,2005,(05):1611-1621.

引证文献3

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部