基于情感基音模板的情感语音合成被引量：4

Synthesis of emotional speech based on emotional pitch template

下载PDF

导出

摘要为了合成能够模拟表达说话人的情感状态的语音,提出一种基于情感基音模板的情感语音合成方法。该方法分别建立高兴、愤怒、悲伤和中立4种不同情感下的韵母基音模板库,建立4种声调模型,统计分析语音库中情感语音的韵律特征参数,运用基音同步叠加算法(PSOLA)合成含情感色彩的语音。实验以音节为合成单位,根据情感特征参数的统计分析结果调节合成语音的韵律特征,合成各种情感的语音。仿真实验结果表明:用情感基音模板合成的目标情感语音具有目标情感的音质色彩,再通过韵律参数调节,可合成较理想的情感语音。该方法可用于增加语音合成系统的智能化,提高人机交互的能力。 In order to synthesize the speech which can express the speaker＇s emotional state,a method of emotional speech synthesis based on the emotional pitch template was presented.By the method,happy,angry,sad and neutral vowel pitch template libraries were established,and four kinds of tone model were also established,the prosody characteristic parameters of the emotional speech were analyzed,and pitch synchronous overlap algorithm（PSOLA） to synthesis speech with emotional colors was used.Using the syllable as the synthetic unit,the prosodic parameters of the synthetic speech were adjusted according to the statistical analysis of the prosodic parameters to synthesize various emotional speech.Simulation results show that with the same prosodic parameters,the emotional speech synthesized with the targeted emotional pitch template has the tone color of the targeted emotion.After the adjustment of prosodic parameters,the ideal emotional speech can be gotten.The method can be used to increase the intelligence of speech synthesis system and improve the capabilities of human-computer interaction.

作者陈明义党培霞

机构地区中南大学信息科学与工程学院

出处《中南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2010年第6期2258-2263,共6页 Journal of Central South University:Science and Technology

基金国家自然科学基金资助项目(50275150) 高等学校博士学科点专项科研基金资助项目(20040533035)

关键词情感语音合成情感基音模板基音同步叠加算法韵律参数 emotional speech synthesis emotional pitch template pitch synchronous overlap algorithm（PSOLA） prosodic parameters

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Cahn J E.The generation of affect in synthesized speech[J].Journal of the American Voice I/O Society,1990,8(1):1-19.
2Burkhart F.Verification of acoustical correlates of emotional speech using formant synthesis[C] //Proceedings of the ISCA Workshop on Speech and Emotion.Northern Ireland,2000:151-156.
3Moriyama T,Saito H,Ozawa S.Evaluation of relation between emotional concepts and emotional parameters in speech[J].Systems and Computers in Japan,2001,32(4):59-68.
4Vine D S G,Sahandi R.Synthesis of emotional speech using RP-PSOLA[C] //IEEE Seminar State of the Art in Speech Synthesis Proceedings.London,2000:8/1-8/6.
5Murray I R.Emotion in concatenated speech[C] //IEEE Seminar State of the Arts in Speech Synthesis Proceedings.London,2000:7/1-7/8.
6邵艳秋,韩纪庆,王卓然,刘挺.韵律参数和频谱包络修改相结合的情感语音合成技术研究[J].信号处理,2007,23(4):526-530. 被引量：7
7Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C] //Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
8张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报（自然科学版）,2008,48(S1):652-657. 被引量：14
9Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C]//Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
10Hyun K H,Kim E H,Kwak Y K.Robust speech emotion recognition using log frequency power ratio[C] //SICE-ICASE International Joint Conference.Busan,2006:2586-2589.

二级参考文献21

1高慧,苏广川,陈善广.情绪化语音特征分析与识别的研究进展[J].航天医学与医学工程,2004,17(5):386-390. 被引量：11
2陈建厦,李翠华.语音情感识别的研究进展[J].计算机工程,2005,31(13):35-37. 被引量：8
3蒋丹宁,蔡莲红.基于语音声学特征的情感信息识别[J].清华大学学报（自然科学版）,2006,46(1):86-89. 被引量：38
4韩纪庆,邵艳秋.基于语音信号的情感处理研究进展[J].电声技术,2006,30(5):58-62. 被引量：11
5M. Schroder. Emotional speech synthesis: A review. In: Proceedings of the 7th European Conference on Speech Communication and Technology Eurospeech 2001, Aalborg, 2001:561-564.
6J. E. Cahn. Generating expression in synthesized speech. Master' s thesis, Massachusetts Institute of Technology, 1989.
7I. R. Murray, J. L. Arnott. Implementation and testing of a system for producing emotion-by-rule in synthetic speech. Speech Communication. 1995,16 : 369 - 390.
8Iida A, Campbell N, Higuchi F, Yasumura M, A Corpusbased Speech Synthesis System with Emotion, Speech Communication, 2003, 40,161-187.
9Iida A, Campbell N, A Speech Synthesis System with Emotion for Assisting Communication, In: Proceedings of ISCA Workshop (ITRW) on Speech and Emotion. Newcastle, Northern Ireland, 2000, 167 - 172.
10E. Rank and H. Pirker, "Generating emotional speech with a concatenative synthesizer", in Proceedings, ICSLP '98, Sydney, Australia, 1998, 3:671-674.

共引文献19

1董理,谈笑.昆剧小生行当情感念白声学研究[J].中国语音学报,2021(2):52-60.
2张高媛,王韫佳,黄靖雯.声学线索掩蔽下普通话情感语音的听辨研究[J].中国语音学报,2020(1):14-23.
3徐露,徐明星,杨大利.面向情感变化检测的汉语情感语音数据库[J].清华大学学报（自然科学版）,2009(S1):1413-1418. 被引量：6
4陈明义,许玲玲,陈宁.基于高斯混合模型的情感LPC系数的研究[J].中南大学学报（自然科学版）,2013,44(9):3701-3706.
5孙凯,于俊清.面向观众的个性化电影情感内容表示与识别[J].计算机辅助设计与图形学学报,2010,22(1):136-144. 被引量：3
6韩文静,李海峰,王朝友.语音情感信息可视化建模研究与探析[J].燕山大学学报,2010,34(2):128-132.
7李东风,郑桂敏.潮州方言单字调的实验研究[J].辽宁师范大学学报（自然科学版）,2010,33(3):369-373. 被引量：3
8汪成亮,张玉维.基于共振峰合成和韵律调整的语音验证码方法研究[J].计算机应用研究,2011,28(7):2458-2461. 被引量：4
9任蕊,苗振江.基于PSOLA算法的情感语音合成[J].系统仿真学报,2008,20(S1):423-426. 被引量：2
10何凌,黄华,刘肖珩.基于韵律特征参数的情感语音合成算法研究[J].计算机工程与设计,2013,34(7):2566-2569. 被引量：8

同被引文献23

1张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报（自然科学版）,2008,48(S1):652-657. 被引量：14
2蒋丹宁,蔡莲红.基于语音声学特征的情感信息识别[J].清华大学学报（自然科学版）,2006,46(1):86-89. 被引量：38
3VINE D S G, SAHANDI R. Synthesis of emotional speech using RP- PSOLA [ C ]//IEEE Seminar State of the Art in Speech Synthesis Pro- ceedings. 2000.
4BURKHART F. Verification of acoustical correlates of emotional speech using formant synthesis [ C ]//Proc of ISCA Workshop on Speech and Emotion. 2000 : 151 - 156.
5HIROSE K,TAGO J, MINEMATSU N. Speech generation from con- cept for realizing conversation with an agent in a virtual room[ C~// Proc of the 8th European Conference on Speech Communication and Technology. 2003 : 1693-1696.
6MORIYAMA T, SAITO H, OZAWA S. Evaluation of relation between emotional concepts and emotional parameters in speech[ J]. Systems and Computers in Japan,2001,32(4) :59-68.
7REN Rui, MIAO Zhen-jiang. Emotional speech synthesis and its appli- cation to pervasive E-learning[ C ]//Proc of the 1 st IEEE International Conference on Ubi-Media Computing and Workshops. 2008:431-435.
8HYUN K H,KIM E H, KWAK Y K. Robust speech emotion recogni- tion using log frequency power ratio[ C ]//Proc of SICE-ICASE Inter- national Joint Conference. 2006 : 2586-2589.
9田韶东.昆曲旦角演唱的用嗓特点[J].南昌高专学报,2008,23(5):68-71. 被引量：3
10曾一鸣,朱杰.基于规则的汉语情感语音系统的设计与实现[J].电子测量技术,2009,32(11):62-64. 被引量：3

引证文献4

1王华,樊养余.人脸语音动画中基于PSOLA的情感语音合成系统[J].计算机应用研究,2012,29(3):1002-1004.
2董理,梁晓静,黄慧怡.昆曲女性行当情感念白时长特征[J].语言学论丛,2021(2):272-290.
3张昕,胡航烨,曹欣怡,王蔚.基于Tacotron模型和韵律修正的情感语音合成方法[J].数据采集与处理,2022,37(4):909-916. 被引量：2
4王智,刘银华.基于深度学习的中文情感语音合成方法[J].自动化与仪器仪表,2022(9):10-15. 被引量：5

二级引证文献7

1王雨佳.基于语音合成的机器翻译机器人设计[J].自动化与仪器仪表,2023(4):185-190. 被引量：1
2房小绵.基于语音识别的英语智能对话机器人人机交互系统设计[J].自动化与仪器仪表,2023(4):225-228. 被引量：7
3何娟.基于深度学习网络的手写英文自动化识别模型在机器英汉互译中的应用研究[J].自动化与仪器仪表,2023(7):191-195.
4付志霞,王然然,刘伟,万国睿.铁路售票厅排队叫号系统设计与实现[J].铁路计算机应用,2024,33(2):53-56. 被引量：1
5高盛祥,杨元樟,王琳钦,莫尚斌,余正涛,董凌.面向域外说话人适应场景的多层级解耦个性化语音合成[J].广西师范大学学报（自然科学版）,2024,42(4):11-21.
6张凌益,朱甦,徐一通,汪燕,余昀锴,周浩辉,周磊.一种基于Jetson Nano的智能辅助导盲装置设计[J].物联网技术,2024,14(10):70-72.
7李瞳.基于虚拟仿真技术的配音技术研究[J].自动化与仪器仪表,2024(9):285-288.

1何凌,黄华,刘肖珩.基于韵律特征参数的情感语音合成算法研究[J].计算机工程与设计,2013,34(7):2566-2569. 被引量：8
2王敬华,刘建银,张国燕,赵新想.情感语音合成中韵律参数的基频研究[J].小型微型计算机系统,2013,34(9):2047-2050. 被引量：2
3李虎孬,赵晖.情感语音合成综述[J].现代计算机（中旬刊）,2014(7):31-37. 被引量：1
4陈坚红,李蔚,盛德仁,任浩仁.火电厂语音报警系统中的动态文语转换方法[J].浙江大学学报（工学版）,2007,41(12):1997-2001. 被引量：4
5林时来,刘光远,张慧玲.蚁群算法在呼吸信号情感识别中的应用研究[J].计算机工程与应用,2011,47(2):169-172. 被引量：5
6王秀,谢志成,张栋.一种基于特征差异度和SVM投票机制的数字音乐语音情感识别算法[J].福州大学学报（自然科学版）,2015,43(4):460-465. 被引量：2
7李杰,周萍.语音情感识别中特征参数的研究进展[J].传感器与微系统,2012,31(2):4-7. 被引量：2
8邵艳秋,穗志方,韩纪庆,王志伟.小规模情感数据和大规模中性数据相结合的情感韵律建模研究[J].计算机研究与发展,2007,44(9):1624-1631.
9鲁小勇,潘涛,高兰德.基于广义回归神经网络的情感语音韵律特征预测[J].自动化与仪器仪表,2015(2):145-146.
10辛贤龙.结合情感信息的个性化推荐算法[J].微型电脑应用,2014,30(4):38-40.

中南大学学报（自然科学版）

2010年第6期

浏览历史

内容加载中请稍等...

基于情感基音模板的情感语音合成被引量：4

参考文献14

二级参考文献21

共引文献19

同被引文献23

引证文献4

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于情感基音模板的情感语音合成 被引量：4

参考文献14

二级参考文献21

共引文献19

同被引文献23

引证文献4

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于情感基音模板的情感语音合成被引量：4