期刊文献+

采用模型自适应的语音转换方法 被引量:2

Voice Conversion Method Based On Model Adaptation
下载PDF
导出
摘要 针对非对称语音库情况下的语音转换,提出了一种有效的基于模型自适应的语音转换方法。首先,通过最大后验概率(Maximum A Posteriori,MAP)方法从背景模型分别自适应训练得到源说话人和目标说话人的模型;然后,通过说话人模型中的均值向量训练得到频谱特征的转换函数;并进一步与传统的INCA转换方法相结合,提出了基于模型自适应的INCA语音转换方法,有效实现了源说话人频谱特征向目标说话人频谱特征的转换。通过客观测试和主观测听实验对提出的方法进行评价,实验结果表明,与INCA语音转换方法相比,本文提出的方法可以取得更低的倒谱失真、更高的语音感知质量和目标倾向度;同时更接近传统基于对称语音库的高斯混合模型(Gaussian Mixture Model,GMM)的语音转换方法的效果。 In order to realize voice conversion using non-parallel corpus,an efficient voice conversion method based on model adaptation is proposed in the paper.Firstly,the source and target speaker models were trained from background model using Maximum a Posteriori (MAP) adaptation algorithm,respectively.Then,a conversion function was trained by using mean vectors of adapted speaker models,and in order to improve the conversion performance,the conversion function was combined with INCA conversion algorithm,and a model adaptation based INCA method was further presented.The proposed method could efficiently transform the spectral features from source speaker to target one.Subjective and objective experiments were carried out to evaluate the performance of the proposed method,the results demonstrate that the proposed method obtains lower cepstral distortion,higher perceptual quality and similarity than INCA method.Meanwhile,compared with INCA algorithm,the proposed method using non-parallel speech corpus can achieve more comparable performance to Gaussian Mixture Model (GMM) based voice conversion method using parallel speech corpus.
出处 《信号处理》 CSCD 北大核心 2013年第10期1294-1299,共6页 Journal of Signal Processing
基金 国家自然科学基金(面向非特定说话人的实用情感语音特征分析与识别的关键技术及应用研究 61273266 汉语数字助听器语音处理核心算法研究 60872073)
关键词 模型自适应 语音转换 非对称语音库 model adaptation voice conversion non-parallel speech corpus
  • 相关文献

参考文献13

  • 1Abe M, Nakamura S, Shikano K, et al. Voice conversionthrough vector quantization [ C ]. In: Acoustics, Speechand Signal Processing ( ICASSP),1988 IEEE Interna-tional Conference on. 1988 : 655-658.
  • 2Stylianou Y, Cappe 0,Moulines E. Continuous probabi-listic transform for voice conversion [ J]. IEEE Transac-tions on Speech and Audio Processing, 1998, 6(2) :131-142.
  • 3Kain A, Macon M W. Spectral voice conversion for text- to-speech synthesis [ C ]. In: Acoustics, Speech and Sig- nal Processing (ICASSP), 1998 IEEE International Con- ference on. 1998 : 285-288.
  • 4Toda T, Saruwatari H, Shikano K. Voice conversion al-gorithm based on Gaussian mixture model with dynamicfrequency warping of STRAIGHT spectrum [ C] . In: A-coustics, Speech and Signal Processing ( ICASSP) , 2001IEEE International Conference on. 2001 : 841-844.
  • 5Godoy E, Rosec 0,Chonavel T. Voice conversion usingdynamic frequency warping with amplitude scaling, forparallel or nonparallel corpora [ J]. Audio, Speech,andLanguage Processing, IEEE Transactions on, 2012: 20(4):1313-1323.
  • 6Qiao Y, Saito D, Minematsu N. HMM-based sequence-to-frame mapping for voice conversion [ C] . In: Acous-tics, Speech and Signal Processing ( ICASSP) , 2010IEEE International Conference on. 2010. 4830-4833.
  • 7Desai S, Black A, Yegnanarayana B, et al. Spectralmapping using artificial neural networks for voice conver-sion [J], Audio, Speech, and Language Processing,IEEE Transactions on, 2010, 18(5) :954-964.
  • 8Mouchtaris A, Van der Spiegel J, Mueller P. Nonparalleltraining for voice conversion based on a parameter adapta-tion approach [ J ] . Audio,Speech,and Language Pro-cessing, IEEE Transactions on, 2006, 14(3) :952-963.
  • 9Popa V,Silen H, Nurminen J, et al. Local linear trans-formation for voice conversion [ C ]. In: Acoustics,Speech and Signal Processing ( ICASSP),2012 IEEE In-ternational Conference on. 2012 ; 4517-4520.
  • 10徐小峰,俞一彪.基于说话人独立建模的语音转换系统研究[J].信号处理,2009, 25(8A) :171-174.

同被引文献6

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部