期刊文献+

基于高斯混合模型和残差预测的说话人转换系统 被引量:4

A voice conversion system based on GMM and residual prediction
下载PDF
导出
摘要 说话人转换是将源说话人的语音特征转换成目标说话人的特征,使得听起来像是目标说话人的语音。提出的说话人转换系统分为2个部分,第一部分利用高斯混合模型进行谱包络的转换,训练采用时间对齐的源说话人和目标说话人的语音数据进行。第二部分基于一个分类器和残差码本对残差信号预测。该系统在现有的说话人转换系统的基础上做了一些改进,改进后不再需要说话人模仿别人的语调,并且在某些性能上超过了现有的系统。 Voice conversion is the process of transforming the characteristics of speech uttered by a source speaker, such that a listener would believe that the speech was uttered by a target speaker. In this paper, the system is divided into two main parts. By using a Gaussian mixture model, which is trained on aligned speech from source and target speakers, the first part transforms the spectral envelope. The second part of the system predicts the spectral detail from the transformed LPC parameters, which is based on a classifier and residual codebooks. The system has some similarities with some existing systems, however, this system is not restricted to speech spoken in a monotone and with mimicked prosody. Also, on the basis of some performance metrics it outperforms existing systems.
出处 《电声技术》 北大核心 2004年第6期33-36,共4页 Audio Engineering
关键词 说话人转换 高斯混合模型 残差预测 谱包络 voice conversion Gaussian mixture model residual prediction
  • 相关文献

参考文献4

  • 1Kain A., Macon M.W. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings,2001,2:813-816.
  • 2Arslan L. Speaker transformation algorithm using segment codebook. Speech Communication Journal. 1999, 28:211-226.
  • 3Y. Stylianou, O. Cappe, E. Moulines. Statistical method for voice quality transformation. In Proc. EUROSPPECH, 1995.
  • 4Y. Stylianou, O. Cappe, E. Moulines. Continuous probabilistic transform for voice conversion In IEEE Transaction on speech and audio processing, 1998,6 (2):131-142.

同被引文献17

  • 1康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量:13
  • 2张凯 朱立新 赵义正.改进的基于高斯混合模型的语音转换方法研究.声学技术,2008,27(3):392-397.
  • 3Yannis Stylianou, Olivier Cappe, Eric Moulines. Continuous probabilistic transform for voice conversion[J]. Transactions on Speech and Audio Processing, 1998, 6(2): 131-142.
  • 4Kain. High resoulation voice transformation[D]. Computer Science and Mathematics, Rockford College, 1995, 47-52.
  • 5QIN Long, CHEN Gaopeng, LING Zhenghua. An improved spectral and prosodic transformation methed in STRAIGHT-based voice conversion[A]. ICASSP[C]. 2005, 21-24.
  • 6Toda T,Alan W B,Kellchi.Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter.Proceedings of ICASSP2005,2005,1:9-12.
  • 7Chen Yining,Chu Min,Chang E,et al.Voice conversion with smoothed GMM and MAP adaptation.Proc Eurospeech Geneva,Switzerland:ISCA,Sept,2003:2413-2416.
  • 8Toda T.High-quality and flexible speech synthesis with segment selection and voice conversion.Graduate School of Information Science,Nara Institute of Science and Technology,2003.
  • 9Stylianou Y, Cappe O, Moulines E. Continuous Probabilistic Transformation for Voice Conversion. Speech and Audio Processing IEEE, 1998, (6) : 131-142.
  • 10Abe M. A Segment-based Approach to Voice Conversion. Proc IEEE ICASSP, 1991,(2):765-768.

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部