期刊文献+

基于STRAIGHT模型的语音转换系统研究

Research on Speech Conversion System Based on STRAIGHT Model
下载PDF
导出
摘要 语音转换是将源说话人的个性特征转换为目标说话人个性特征的过程。主要研究了基于STRAIGHT模型的语音转换系统原理及实现过程。通过STRAIGHT模型提取目标语音和源语音的基本频率以及平滑的声道频谱作为特征参数,并将声道频谱转换为LSF参数,进行时间对齐和GMM训练。从实验结果数据分析可以看出:由STRAIGHT模型提取的参数很好地避免了声道谱过平滑的现象,合成后的目标语音与源语音的相似度较高。 Speech conversion is the process of transforming the personality characteristics of the source speaker into the personality characteristics of the target speaker.This paper mainly studies the principle and implementation process of speech conversion system based on STRAIGHT model.The STRAIGHT model is used to extract the basic frequency and smooth channel spectrum of target and source speech as feature parameters,and the channel spectrum is converted into LSF parameters for time alignment and GMM training.The data analysis of the experimental results shows that the parameters extracted by the STRAIGHT model can avoid the phenomenon of too smooth channel spectrum,and the synthesized target speech has a high similarity with the source speech.
作者 祝琼珂 王光艳 江淇 罗雨章 ZHU Qiongke;WANG Guangyan;JIANG Qi;LUO Yuzhang
出处 《山西科技》 2020年第5期60-66,共7页 Shanxi Science and Technology
基金 国家级大学生创新创业训练计划项目(项目编号:201810069005)。
关键词 语音转换 STRAIGHT模型 GMM LSF参数 speech conversion STRAIGHT model GMM LSF parameter
  • 相关文献

参考文献4

二级参考文献27

  • 1Abe M, Nakamura S, Shikano K, et al. Voice conversionthrough vector quantization [ C ]. In: Acoustics, Speechand Signal Processing ( ICASSP),1988 IEEE Interna-tional Conference on. 1988 : 655-658.
  • 2Stylianou Y, Cappe 0,Moulines E. Continuous probabi-listic transform for voice conversion [ J]. IEEE Transac-tions on Speech and Audio Processing, 1998, 6(2) :131-142.
  • 3Kain A, Macon M W. Spectral voice conversion for text- to-speech synthesis [ C ]. In: Acoustics, Speech and Sig- nal Processing (ICASSP), 1998 IEEE International Con- ference on. 1998 : 285-288.
  • 4Toda T, Saruwatari H, Shikano K. Voice conversion al-gorithm based on Gaussian mixture model with dynamicfrequency warping of STRAIGHT spectrum [ C] . In: A-coustics, Speech and Signal Processing ( ICASSP) , 2001IEEE International Conference on. 2001 : 841-844.
  • 5Godoy E, Rosec 0,Chonavel T. Voice conversion usingdynamic frequency warping with amplitude scaling, forparallel or nonparallel corpora [ J]. Audio, Speech,andLanguage Processing, IEEE Transactions on, 2012: 20(4):1313-1323.
  • 6Qiao Y, Saito D, Minematsu N. HMM-based sequence-to-frame mapping for voice conversion [ C] . In: Acous-tics, Speech and Signal Processing ( ICASSP) , 2010IEEE International Conference on. 2010. 4830-4833.
  • 7Desai S, Black A, Yegnanarayana B, et al. Spectralmapping using artificial neural networks for voice conver-sion [J], Audio, Speech, and Language Processing,IEEE Transactions on, 2010, 18(5) :954-964.
  • 8Mouchtaris A, Van der Spiegel J, Mueller P. Nonparalleltraining for voice conversion based on a parameter adapta-tion approach [ J ] . Audio,Speech,and Language Pro-cessing, IEEE Transactions on, 2006, 14(3) :952-963.
  • 9Popa V,Silen H, Nurminen J, et al. Local linear trans-formation for voice conversion [ C ]. In: Acoustics,Speech and Signal Processing ( ICASSP),2012 IEEE In-ternational Conference on. 2012 ; 4517-4520.
  • 10徐小峰,俞一彪.基于说话人独立建模的语音转换系统研究[J].信号处理,2009, 25(8A) :171-174.

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部