期刊文献+

基于基音同步的时频域插值的汉语语音合成

Mandarin speech synthesis based on pitch synchronous time-frequency interpolation
下载PDF
导出
摘要 针对TD-PSOLA韵律调整能力的不足,将基于基音同步的时频域插值(TFI)方法应用于汉语语音合成中,该方法能够保证基频调整和时长的调整不会相互影响.为了提高计算精度,在频谱的插值计算中还引入了差商型插值方法.实验结果表明,采用差商型插值的TFI方法能取得比较好的合成效果. The ability of wide range of prosody adjustment will be helpful to improve the naturalness and expressiveness of synthesized speech. The method of pitch synchronous time-frequency interpolation (TFI) is applied to Mandarin speech synthesis. Compared with TD-PSOLA, TFI can adjust the pitch without affecting the duration. The difference quotient interpolation is also used to improve the accuracy of the interpolation cal-culation, which shows satisfied quality of synthesized speech.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2007年第1期110-113,共4页 Journal of Harbin Institute of Technology
关键词 语音合成 基音同步 韵律调整 时频域插值 差商型插值 speech synthesis pitch synchronous prosody adjustment time-frequency interpolation difference quotient interpolation
  • 相关文献

参考文献5

  • 1HUANG Xuedong,ACERO A,HON Hsiao-Wuen,et al.Spoken Language Processing:A Guide to Theory,Algorithm and System Development[M].[s.l.]:Prentice Hall PTR,2001:800 -815.
  • 2MORAIS E S,VIOLARO F,BARBOSA P.Prosodic Speech Modifications Using Pitch -Synchronous Time-Frequency Interpolation[C]//Proceedings of the Internaional Telecommunication Symposium.SP,Brazil:S(a)o Paulo,1998:225-230.
  • 3KLEIJN W B.Encoding speech using prototype waveforms[J].IEEE Transactions on Speech and Audio Processing.1993,1(4):386 -399.
  • 4MORALS E S,TAYLOR P,VIOLARO F.Concatenative Text-To-Speech Synthesis Based on Prototype Waveform Interpolation[C]// Proceedings of ICSLP.Beijing:International Speech Communication Association,2000:387-391.
  • 5黄有谦,李岳生.数值逼近[M].北京:高等教育出版社,1987:27-38.

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部