摘要
为了更加准确地进行语音情感识别,提出了一种基于去噪自编码器的语音情感识别模型。该模型用Open SMILE提取了语音中的声学特征,利用构建好的去噪自编码器获得更高阶的特征,用SVM分类器对语音中的情感进行识别分类。在EmoDB情感语料库上进行了实验,结果表明,与直接使用SVM进行分类相比,该模型对语音情感的识别准确率至少提高了2%。
In order to make the speech emotion recognition more accurate, a speech emotion recognition model based on denoising autoencoder is proposed. In this model, the OpenSMILE is used to extract the acoustic characteristics of speech, the built denoising autoencoder is used to acquire the higher-order characteristics, and SVM classifier is used for emotion recognition in speech classification. The results of experiments on the EmoDB emotional corpus show-that this model can improve the recognition accuracy of speech emotion by at least 2%, compared with the direct use of SVM.
作者
雷沛之
傅洪亮
LEI Peizhi;FU Hongliang(College of Information Science and Engineering,He'nan University of Technology,Zhengzhou,He'nan 450001,China)
出处
《计算机与网络》
2018年第18期67-68,共2页
Computer & Network
关键词
情感识别
语音特征
SVM
去噪自编码器
emotion recognition
voice features
SVM
denoising autoencoder