摘要
该文介绍了面向普通话情感变化检测的情感语音数据库CESD。该数据库的语音以对话形式录制,包括男女声情感对话语音1 200段。以生气、着急、中性、愉悦、高兴为基本情感,共包含20种情感变化模式。除语音文件外,还包含带有静音段/有效语音段、情感类别、情感变化段、情感质量等内容的标注文件。为了使更多的研究人员可以使用该数据库,利用P raat工具提取出67维常用声学特征,作为特征文件一同存储在该数据库中。对该数据库进行主观评价和情感变化检测的结果表明:语音情感状态自然、情感变化真实,能够满足语音情感识别和语音情感变化检测研究的双重需求。
This paper describes a database of emotional speech variations named CESD.The database contains 600 utterances in the form of dialogues with 20 emotional variation modes consisting of 3 different emotions including anger,impatience,neutral,joy,and happiness.Besides the utterances,the database also includes the corresponding label files which include silence or effective speech segments,emotional classes,emotional variation segments,and emotional quality. 67 normal acoustical features are extracted based on the Praat tool and stored in the database.Subjective assessments of the emotional variations demonstrate that the database is suitable for research on speech emotion recognition and emotional variations.
出处
《清华大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2009年第S1期1413-1418,共6页
Journal of Tsinghua University(Science and Technology)
基金
国家自然科学基金重点项目(60433030)
关键词
语音识别
情感识别
汉语
数据库
speech recognition
emotion recognition
Chinese
database