期刊文献+

基于情感基音模板的情感语音合成 被引量:4

Synthesis of emotional speech based on emotional pitch template
下载PDF
导出
摘要 为了合成能够模拟表达说话人的情感状态的语音,提出一种基于情感基音模板的情感语音合成方法。该方法分别建立高兴、愤怒、悲伤和中立4种不同情感下的韵母基音模板库,建立4种声调模型,统计分析语音库中情感语音的韵律特征参数,运用基音同步叠加算法(PSOLA)合成含情感色彩的语音。实验以音节为合成单位,根据情感特征参数的统计分析结果调节合成语音的韵律特征,合成各种情感的语音。仿真实验结果表明:用情感基音模板合成的目标情感语音具有目标情感的音质色彩,再通过韵律参数调节,可合成较理想的情感语音。该方法可用于增加语音合成系统的智能化,提高人机交互的能力。 In order to synthesize the speech which can express the speaker's emotional state,a method of emotional speech synthesis based on the emotional pitch template was presented.By the method,happy,angry,sad and neutral vowel pitch template libraries were established,and four kinds of tone model were also established,the prosody characteristic parameters of the emotional speech were analyzed,and pitch synchronous overlap algorithm(PSOLA) to synthesis speech with emotional colors was used.Using the syllable as the synthetic unit,the prosodic parameters of the synthetic speech were adjusted according to the statistical analysis of the prosodic parameters to synthesize various emotional speech.Simulation results show that with the same prosodic parameters,the emotional speech synthesized with the targeted emotional pitch template has the tone color of the targeted emotion.After the adjustment of prosodic parameters,the ideal emotional speech can be gotten.The method can be used to increase the intelligence of speech synthesis system and improve the capabilities of human-computer interaction.
出处 《中南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2010年第6期2258-2263,共6页 Journal of Central South University:Science and Technology
基金 国家自然科学基金资助项目(50275150) 高等学校博士学科点专项科研基金资助项目(20040533035)
关键词 情感语音合成 情感基音模板 基音同步叠加算法 韵律参数 emotional speech synthesis emotional pitch template pitch synchronous overlap algorithm(PSOLA) prosodic parameters
  • 相关文献

参考文献14

  • 1Cahn J E.The generation of affect in synthesized speech[J].Journal of the American Voice I/O Society,1990,8(1):1-19.
  • 2Burkhart F.Verification of acoustical correlates of emotional speech using formant synthesis[C] //Proceedings of the ISCA Workshop on Speech and Emotion.Northern Ireland,2000:151-156.
  • 3Moriyama T,Saito H,Ozawa S.Evaluation of relation between emotional concepts and emotional parameters in speech[J].Systems and Computers in Japan,2001,32(4):59-68.
  • 4Vine D S G,Sahandi R.Synthesis of emotional speech using RP-PSOLA[C] //IEEE Seminar State of the Art in Speech Synthesis Proceedings.London,2000:8/1-8/6.
  • 5Murray I R.Emotion in concatenated speech[C] //IEEE Seminar State of the Arts in Speech Synthesis Proceedings.London,2000:7/1-7/8.
  • 6邵艳秋,韩纪庆,王卓然,刘挺.韵律参数和频谱包络修改相结合的情感语音合成技术研究[J].信号处理,2007,23(4):526-530. 被引量:7
  • 7Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C] //Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 8张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报(自然科学版),2008,48(S1):652-657. 被引量:14
  • 9Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C]//Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 10Hyun K H,Kim E H,Kwak Y K.Robust speech emotion recognition using log frequency power ratio[C] //SICE-ICASE International Joint Conference.Busan,2006:2586-2589.

二级参考文献21

  • 1高慧,苏广川,陈善广.情绪化语音特征分析与识别的研究进展[J].航天医学与医学工程,2004,17(5):386-390. 被引量:11
  • 2陈建厦,李翠华.语音情感识别的研究进展[J].计算机工程,2005,31(13):35-37. 被引量:8
  • 3蒋丹宁,蔡莲红.基于语音声学特征的情感信息识别[J].清华大学学报(自然科学版),2006,46(1):86-89. 被引量:38
  • 4韩纪庆,邵艳秋.基于语音信号的情感处理研究进展[J].电声技术,2006,30(5):58-62. 被引量:11
  • 5M. Schroder. Emotional speech synthesis: A review. In: Proceedings of the 7th European Conference on Speech Communication and Technology Eurospeech 2001, Aalborg, 2001:561-564.
  • 6J. E. Cahn. Generating expression in synthesized speech. Master' s thesis, Massachusetts Institute of Technology, 1989.
  • 7I. R. Murray, J. L. Arnott. Implementation and testing of a system for producing emotion-by-rule in synthetic speech. Speech Communication. 1995,16 : 369 - 390.
  • 8Iida A, Campbell N, Higuchi F, Yasumura M, A Corpusbased Speech Synthesis System with Emotion, Speech Communication, 2003, 40,161-187.
  • 9Iida A, Campbell N, A Speech Synthesis System with Emotion for Assisting Communication, In: Proceedings of ISCA Workshop (ITRW) on Speech and Emotion. Newcastle, Northern Ireland, 2000, 167 - 172.
  • 10E. Rank and H. Pirker, "Generating emotional speech with a concatenative synthesizer", in Proceedings, ICSLP '98, Sydney, Australia, 1998, 3:671-674.

共引文献19

同被引文献23

  • 1张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报(自然科学版),2008,48(S1):652-657. 被引量:14
  • 2蒋丹宁,蔡莲红.基于语音声学特征的情感信息识别[J].清华大学学报(自然科学版),2006,46(1):86-89. 被引量:38
  • 3VINE D S G, SAHANDI R. Synthesis of emotional speech using RP- PSOLA [ C ]//IEEE Seminar State of the Art in Speech Synthesis Pro- ceedings. 2000.
  • 4BURKHART F. Verification of acoustical correlates of emotional speech using formant synthesis [ C ]//Proc of ISCA Workshop on Speech and Emotion. 2000 : 151 - 156.
  • 5HIROSE K,TAGO J, MINEMATSU N. Speech generation from con- cept for realizing conversation with an agent in a virtual room[ C~// Proc of the 8th European Conference on Speech Communication and Technology. 2003 : 1693-1696.
  • 6MORIYAMA T, SAITO H, OZAWA S. Evaluation of relation between emotional concepts and emotional parameters in speech[ J]. Systems and Computers in Japan,2001,32(4) :59-68.
  • 7REN Rui, MIAO Zhen-jiang. Emotional speech synthesis and its appli- cation to pervasive E-learning[ C ]//Proc of the 1 st IEEE International Conference on Ubi-Media Computing and Workshops. 2008:431-435.
  • 8HYUN K H,KIM E H, KWAK Y K. Robust speech emotion recogni- tion using log frequency power ratio[ C ]//Proc of SICE-ICASE Inter- national Joint Conference. 2006 : 2586-2589.
  • 9田韶东.昆曲旦角演唱的用嗓特点[J].南昌高专学报,2008,23(5):68-71. 被引量:3
  • 10曾一鸣,朱杰.基于规则的汉语情感语音系统的设计与实现[J].电子测量技术,2009,32(11):62-64. 被引量:3

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部