期刊文献+

基于音素HMM模型语音转换

Voice Conversion Using Phoneme-dependent HMMS
下载PDF
导出
摘要 通过对语音转换的研究,提出了一种把源说话人特征转换为目标说话人特征的方法。语音转换特征参数分为两类:(1)频谱特征参数;(2)基音和声调模式。分别描述信号模型和转换方法。频谱特征用基于音素的2维HMMS建模,F0轨迹用来表示基音和音调。用基音同步叠加法对基音厨期、声调和语速进行变换。 This paper presents a voice conversion method based on transformation of characteristic features of source speaker towards a target.Voice characteristic features are grouped into two main categories:(1)the spectral features at formants;(2)the pitch and intonation patterns. Signal modeling and transformation methods for each group of voice features are outlined.The spectral features at formants are modeled using a set of two-dimension phoneme-dependent HMMS.F0 contour is used for modeling the pitch and intonation patterns of speech.A PSOLA based method is employed for transformation of pitch ,intonation patterns and speaking rate.
作者 钱开华 QIAN Kai-hua (Nanjing Umversity of Posts and Telecoms Signal and Information Processing,Nanjing 210003,China)
出处 《电脑知识与技术》 2008年第4期132-134,共3页 Computer Knowledge and Technology
关键词 语音转换 语音频谱 基频曲线 声门激励 voice conversion speech spectrum pitch contour glottal excitation
  • 相关文献

参考文献2

二级参考文献82

  • 1初敏.韵律研究与合成语音的自然度[A].第五届全国现代语音学学术会议.新世纪的现代语音学[C].北京: 清华大学出版社,2001.295-301.
  • 2H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173.
  • 3D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857.
  • 4P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096.
  • 5H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436.
  • 6S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197.
  • 7K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404.
  • 8L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226.
  • 9H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173.
  • 10T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526.

共引文献57

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部