期刊文献+

基于深度学习的中文情感语音合成方法 被引量:5

Chinese emotional speech synthesis method based on deep learning
原文传递
导出
摘要 语音合成是人机交互的组成部分,并在该过程中起到了闭环的作用。人在说话时能表现出自己的情绪状态,如高兴、悲伤、愤怒等,在现有的语音合成模型中没有得到充分体现,致力于合成出具有情感特征的中文语音,提出了一种中文情感语音合成方法,该方法结合情感语音库对模型优化训练,添加中文处理模型,通过对语音参数的修正,可以提高语音的情感度。结果表明:成熟的模型可以合成出优质的中文语音,情感方面也得到有效的体现。 Speech synthesis is an integral part of human-computer interaction and plays a role in closing the loop in the process. People can show their emotional states when speaking, such as happy, sad, angry, etc., which are not fully reflected in existing speech synthesis models. Dedicated to synthesizing Chinese speech with emotional features, we propose a Chinese emotional speech synthesis method, which combines an emotional speech library to optimize the training of the model, add Chinese processing model, and through the correction of speech parameters, can improve the emotionality. The results show that the mature model can synthesize high quality Chinese speech and the emotional aspect is effectively reflected.
作者 王智 刘银华 WANG Zhi;LIU Yinhua(Institute of Future,School of Automation,Qingdao University,Qingdao,Shandong 266071,China)
出处 《自动化与仪器仪表》 2022年第9期10-15,共6页 Automation & Instrumentation
关键词 语音合成 情感 深度学习 神经网络 speech synthesis emotional deep learning neural Networks
  • 相关文献

参考文献7

二级参考文献116

  • 1张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报(自然科学版),2008,48(S1):652-657. 被引量:14
  • 2Vine D S G,Sahandi R.Synthesis of emotional speech using RP-PSOLA[C] //IEEE Seminar State of the Art in Speech Synthesis Proceedings.London,2000:8/1-8/6.
  • 3Murray I R.Emotion in concatenated speech[C] //IEEE Seminar State of the Arts in Speech Synthesis Proceedings.London,2000:7/1-7/8.
  • 4Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C] //Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 5Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C]//Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
  • 6Hyun K H,Kim E H,Kwak Y K.Robust speech emotion recognition using log frequency power ratio[C] //SICE-ICASE International Joint Conference.Busan,2006:2586-2589.
  • 7GAO Hui,CHEN Shan-guang.Emotion classification of infant voice based on features derived from teenager energy operator[C] //IEEE Congress on Image and Signal Processing.Sanya,China,2008:333-337.
  • 8Gu W,Hirose K,Fujisaki H.A method for automatic tone command parameter extraction for the model of F0 contour generation for mandarin[C] //IEEE Workshop on Automatic Speech Recognition and Understanding.Nara,Japan,2004:435-438.
  • 9Iida A,Campbell N,Higuhi F.A corpus based speech synthesis system with emotion[J].Speech Communication,2003,40(1):87-161.
  • 10Ververidisand D,Kotropoulos C.Emotional speech recognition:Resources,features and methods[J].Speech Communication,2006,48(9):1151-1162.

共引文献214

同被引文献51

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部