人脸语音动画中基于PSOLA的情感语音合成系统

Emotional speech synthesis system based on PSOLA in facial speech animation

下载PDF

导出

摘要提出一种基于时域基音同步叠加TD-PSOLA算法的情感语音合成系统。根据情感语音库分析总结情感规则,在此基础上利用TD-PSOLA算法对中性语音的韵律参数进行改变,并提出一种能够对基频曲线尾部形状改变的方法,使句子表达出丰富的情感。实验表明,合成出的语音具有明显的情感色彩,证明了该系统能以简单明了的方式实现情感语音的合成,有助于提高人脸语音动画表达的丰富性和生动性。 This paper proposed a emotional speech synthesis system based on pitch synchronous overlap-add（PSOLA）.Prosodic parameters could be changed in this system freely.First,analyzing pre-recorded emotional speech samples it concluded some acoustic features associated closely with happiness,angry,surprise and sadness.Then it used TD（time domain）-PSOLA algorithm to change the speech prosodic parameters of neutral speeches.Especially,it proposed a approach to change the F0 contour.Experiments demonstrates that the system is effective,which helps to express the facial speech animation more vivi-dly.

作者王华樊养余

机构地区西北工业大学电子信息学院

出处《计算机应用研究》 CSCD 北大核心 2012年第3期1002-1004,共3页 Application Research of Computers

关键词人脸语音动画时域基音同步叠加韵律参数基频曲线情感语音合成 facial speech animation TD-PSOLA prosodic parameters F0 contour emotional speech synthesis

分类号 TN912.33 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献8

1VINE D S G, SAHANDI R. Synthesis of emotional speech using RP- PSOLA [ C ]//IEEE Seminar State of the Art in Speech Synthesis Pro- ceedings. 2000.
2BURKHART F. Verification of acoustical correlates of emotional speech using formant synthesis [ C ]//Proc of ISCA Workshop on Speech and Emotion. 2000 : 151 - 156.
3HIROSE K,TAGO J, MINEMATSU N. Speech generation from con- cept for realizing conversation with an agent in a virtual room[ C~// Proc of the 8th European Conference on Speech Communication and Technology. 2003 : 1693-1696.
4MORIYAMA T, SAITO H, OZAWA S. Evaluation of relation between emotional concepts and emotional parameters in speech[ J]. Systems and Computers in Japan,2001,32(4) :59-68.
5REN Rui, MIAO Zhen-jiang. Emotional speech synthesis and its appli- cation to pervasive E-learning[ C ]//Proc of the 1 st IEEE International Conference on Ubi-Media Computing and Workshops. 2008:431-435.
6赵力,钱向民,邹采荣,吴镇扬.语音信号中的情感识别研究[J].软件学报,2001,12(7):1050-1055. 被引量：56
7HYUN K H,KIM E H, KWAK Y K. Robust speech emotion recogni- tion using log frequency power ratio[ C ]//Proc of SICE-ICASE Inter- national Joint Conference. 2006 : 2586-2589.
8陈明义,党培霞.基于情感基音模板的情感语音合成[J].中南大学学报（自然科学版）,2010,41(6):2258-2263. 被引量：4

二级参考文献21

1张立华,杨莹春.情感语音变化规律的特征分析[J].清华大学学报（自然科学版）,2008,48(S1):652-657. 被引量：14
2周迪伟高东杰（译）.计算机语音处理[M].国防工业出版社,1987..
3唐守正.多元统计方法[M].北京:中国林业出版社,1987..
4王学仁王松桂.实用多元统计分析[M].上海:上海科学技术出版社,1995.150-187.
5Vine D S G,Sahandi R.Synthesis of emotional speech using RP-PSOLA[C] //IEEE Seminar State of the Art in Speech Synthesis Proceedings.London,2000:8/1-8/6.
6Murray I R.Emotion in concatenated speech[C] //IEEE Seminar State of the Arts in Speech Synthesis Proceedings.London,2000:7/1-7/8.
7Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C] //Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
8Su Z,Wang Z.An approach to affective-tone modeling for mandarin[C]//Affective Computing and Intelligent Interaction.Beijing,2005:390-396.
9Hyun K H,Kim E H,Kwak Y K.Robust speech emotion recognition using log frequency power ratio[C] //SICE-ICASE International Joint Conference.Busan,2006:2586-2589.
10GAO Hui,CHEN Shan-guang.Emotion classification of infant voice based on features derived from teenager energy operator[C] //IEEE Congress on Image and Signal Processing.Sanya,China,2008:333-337.

共引文献58

1赵力,王治平,卢韦,邹采荣,吴镇扬.全局和时序结构特征并用的语音信号情感特征识别方法[J].自动化学报,2004,30(3):423-429. 被引量：15
2陈建厦,李翠华.语音情感识别的研究进展[J].计算机工程,2005,31(13):35-37. 被引量：8
3田岚,姜晓庆,侯正信.多语种下情感语音基频参数变化的统计分析[J].控制与决策,2005,20(11):1311-1313. 被引量：2
4周洁,赵力,邹采荣.情感语音合成的研究[J].电声技术,2005,29(10):57-59. 被引量：10
5WANG Zhiping ZHAO Li ZOU Cairong.Speech emotion recognition based on statistical pitch model[J].Chinese Journal of Acoustics,2006,25(1):87-96. 被引量：3
6王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量：26
7姜晓庆,田岚,崔国辉.多语种情感语音的韵律特征分析和情感识别研究[J].声学学报,2006,31(3):217-221. 被引量：8
8陈明义,余伶俐,朱晗,周昆湘.基于特征参数融合的语音情感识别方法[J].微电子学与计算机,2006,23(12):168-171. 被引量：10
9林奕琳,韦岗,杨康才.语音情感识别的研究进展[J].电路与系统学报,2007,12(1):90-98. 被引量：33
10余伶俐,蔡自兴,陈明义.语音信号的情感特征分析与识别研究综述[J].电路与系统学报,2007,12(4):76-84. 被引量：27

1谢贵武,杨继红,张雄伟,闵刚,肖勇.时域基音同步叠加(TD-PSOLA)算法研究[J].军事通信技术,2008(3):26-29.
2肖沛,孙霞,闫继红,丁泽军.电子束光刻中邻近效应校正的几种方法[J].电子显微学报,2005,24(5):464-468. 被引量：7
3林睿,樊养余.人脸语音动画中语音特征参数提取算法研究[J].现代电子技术,2011,34(6):74-77. 被引量：1
4刘颖,王成儒.用于人脸动画的语音特征提取算法研究[J].电声技术,2008,32(12):49-53. 被引量：2
5周洁,赵力,邹采荣.情感语音合成的研究[J].电声技术,2005,29(10):57-59. 被引量：10
6何峰,于东武,林嘉宇.一种语音更改技术的研究与实现[J].电声技术,2007,31(2):54-56. 被引量：1
7陈明义,许玲玲,陈宁.基于高斯混合模型的情感LPC系数的研究[J].中南大学学报（自然科学版）,2013,44(9):3701-3706.
8叶静,董兰芳,王洵.用于语音动画合成的语音特征提取和聚类技术[J].微型机与应用,2004,23(8):47-49. 被引量：4
9王亮,朱杰.基于时域基音同步叠加技术的普通话语音调节系统[J].电子测量技术,2009,32(12):74-76. 被引量：1
10邵艳秋,韩纪庆,王卓然,刘挺.韵律参数和频谱包络修改相结合的情感语音合成技术研究[J].信号处理,2007,23(4):526-530. 被引量：7

计算机应用研究

2012年第3期

浏览历史

内容加载中请稍等...

人脸语音动画中基于PSOLA的情感语音合成系统

参考文献8

二级参考文献21

共引文献58

相关作者

相关机构

相关主题

浏览历史