Control Emotion Intensity for LSTM-Based Expressive Speech Synthesis

下载PDF

导出

摘要 To improve the performance of human-computer interaction interfaces, emotion is considered to be one of the most important factors. The major objective of expressive speech synthesis is to inject various expressions reflecting different emotions to the synthesized speech. To effectively model and control the emotion, emotion intensity is introduced for expressive speech synthesis model to generate speech conveyed the delicate and complicate emotional states. The system was composed of an emotion analysis module with the goal of extracting control emotion intensity vector and a speech synthesis module responsible for mapping text characters to speech waveform. The proposed continuous variable “perception vector” is a data-driven approach of controlling the model to synthesize speech with different emotion intensities. Compared with the system using a one-hot vector to control emotion intensity, this model using perception vector is able to learn the high-level emotion information from low-level acoustic features. In terms of the model controllability and flexibility, both the objective and subjective evaluations demonstrate perception vector outperforms one-hot vector.

作者 Xiaolian Zhu Liumeng Xue

机构地区 School of Computer Science Public Computer Education Center

出处《国际计算机前沿大会会议论文集》 2019年第2期654-656,共3页 International Conference of Pioneering Computer Scientists, Engineers and Educators（ICPCSEE）

基金 the results of the research project funded by Natural Science Foundation of Hebei University of Economics and Business (No. 2016KYQ05).

关键词 EMOTION INTENSITY Expressive SPEECH synthesis CONTROLLABLE TEXT-TO-SPEECH NEURAL networks

分类号 C [社会学]

引文网络
相关文献

1LI Aijun,CAO Mengxue,FANG Qiang,HU Fang,DANG Jianwu.ACOUSTIC AND ARTICULATORY ANALYSIS ON CHINESE AND JAPANESE VOWELS IN EMOTIONAL SPEECH[J].中国语音学报,2013(1):125-142. 被引量：1
2Haiyan Xu,Yuren You,Hongwu Yang.Donggan Speech Recognition Based on Convolution Neural Networks[J].国际计算机前沿大会会议论文集,2019(1):583-584.
3周天舒.The Oppression of Women in Feudal Society-Analysis of Raise the Red Lantern[J].校园英语,2019(36):241-242.
4Xin Liu,Lun Xie,Zhiliang Wang.Empathizing with Emotional Robot Based on Cognition Reappraisal[J].China Communications,2017,14(9):100-113. 被引量：3
5刘俊利.基于TensorFlow的Q-Learning算法研究与实现[J].现代计算机,2019,0(29):26-28. 被引量：1
6Cultural Exchange Along the Silk Road:Masterpieces of the Tubo Period[J].China & The World Cultural Exchange,2019,85(8):12-16.
7郑昊冉.The impact of the development of Guzheng on people[J].校园英语,2019(39):249-250.
8Wanxia Huang,Xiyue Zhang,Qianjin Wang,Maosheng Wang,Chaogang Li,Kuanguo Li,Xinyan Yang,Jianping Shi.Controllability of surface plasmon polariton far-field radiation using a metasurface[J].Photonics Research,2019,7(7):43-48.
9陶新民,李晨曦,李青,任超,刘锐,邹俊荣.不均衡最大软间隔SVDD轴承故障检测模型[J].振动工程学报,2019,32(4):718-729. 被引量：7
10王晶,傅松波,郭潇,范力.红景天乳膏联合木丹颗粒治疗2型糖尿病周围神经病变临床研究[J].新中医,2019,0(10):143-147. 被引量：4

国际计算机前沿大会会议论文集

2019年第2期

浏览历史

内容加载中请稍等...

Control Emotion Intensity for LSTM-Based Expressive Speech Synthesis

相关作者

相关机构

相关主题

浏览历史