期刊文献+

人脸语音动画中基于PSOLA的情感语音合成系统

Emotional speech synthesis system based on PSOLA in facial speech animation
下载PDF
导出
摘要 提出一种基于时域基音同步叠加TD-PSOLA算法的情感语音合成系统。根据情感语音库分析总结情感规则,在此基础上利用TD-PSOLA算法对中性语音的韵律参数进行改变,并提出一种能够对基频曲线尾部形状改变的方法,使句子表达出丰富的情感。实验表明,合成出的语音具有明显的情感色彩,证明了该系统能以简单明了的方式实现情感语音的合成,有助于提高人脸语音动画表达的丰富性和生动性。 This paper proposed a emotional speech synthesis system based on pitch synchronous overlap-add(PSOLA).Prosodic parameters could be changed in this system freely.First,analyzing pre-recorded emotional speech samples it concluded some acoustic features associated closely with happiness,angry,surprise and sadness.Then it used TD(time domain)-PSOLA algorithm to change the speech prosodic parameters of neutral speeches.Especially,it proposed a approach to change the F0 contour.Experiments demonstrates that the system is effective,which helps to express the facial speech animation more vivi-dly.
作者 王华 樊养余
出处 《计算机应用研究》 CSCD 北大核心 2012年第3期1002-1004,共3页 Application Research of Computers
关键词 人脸语音动画 时域基音同步叠加 韵律参数 基频曲线 情感语音合成 facial speech animation TD-PSOLA prosodic parameters F0 contour emotional speech synthesis
  • 相关文献

参考文献8

  • 1VINE D S G, SAHANDI R. Synthesis of emotional speech using RP- PSOLA [ C ]//IEEE Seminar State of the Art in Speech Synthesis Pro- ceedings. 2000.
  • 2BURKHART F. Verification of acoustical correlates of emotional speech using formant synthesis [ C ]//Proc of ISCA Workshop on Speech and Emotion. 2000 : 151 - 156.
  • 3HIROSE K,TAGO J, MINEMATSU N. Speech generation from con- cept for realizing conversation with an agent in a virtual room[ C~// Proc of the 8th European Conference on Speech Communication and Technology. 2003 : 1693-1696.
  • 4MORIYAMA T, SAITO H, OZAWA S. Evaluation of relation between emotional concepts and emotional parameters in speech[ J]. Systems and Computers in Japan,2001,32(4) :59-68.
  • 5REN Rui, MIAO Zhen-jiang. Emotional speech synthesis and its appli- cation to pervasive E-learning[ C ]//Proc of the 1 st IEEE International Conference on Ubi-Media Computing and Workshops. 2008:431-435.
  • 6赵力,钱向民,邹采荣,吴镇扬.语音信号中的情感识别研究[J].软件学报,2001,12(7):1050-1055. 被引量:56
  • 7HYUN K H,KIM E H, KWAK Y K. Robust speech emotion recogni- tion using log frequency power ratio[ C ]//Proc of SICE-ICASE Inter- national Joint Conference. 2006 : 2586-2589.
  • 8陈明义,党培霞.基于情感基音模板的情感语音合成[J].中南大学学报(自然科学版),2010,41(6):2258-2263. 被引量:4

二级参考文献21

  • 1张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报(自然科学版),2008,48(S1):652-657. 被引量:14
  • 2周迪伟 高东杰(译).计算机语音处理[M].国防工业出版社,1987..
  • 3唐守正.多元统计方法[M].北京:中国林业出版社,1987..
  • 4王学仁 王松桂.实用多元统计分析[M].上海:上海科学技术出版社,1995.150-187.
  • 5Vine D S G,Sahandi R.Synthesis of emotional speech using RP-PSOLA[C] //IEEE Seminar State of the Art in Speech Synthesis Proceedings.London,2000:8/1-8/6.
  • 6Murray I R.Emotion in concatenated speech[C] //IEEE Seminar State of the Arts in Speech Synthesis Proceedings.London,2000:7/1-7/8.
  • 7Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C] //Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 8Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C]//Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 9Hyun K H,Kim E H,Kwak Y K.Robust speech emotion recognition using log frequency power ratio[C] //SICE-ICASE International Joint Conference.Busan,2006:2586-2589.
  • 10GAO Hui,CHEN Shan-guang.Emotion classification of infant voice based on features derived from teenager energy operator[C] //IEEE Congress on Image and Signal Processing.Sanya,China,2008:333-337.

共引文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部