Transfer learning with deep sparse auto-encoder for speech emotion recognition

一种基于深度稀疏自编码的语音情感迁移学习方法（英文）

下载PDF

导出

摘要 In order to improve the efficiency of speech emotion recognition across corpora,a speech emotion transfer learning method based on the deep sparse auto-encoder is proposed.The algorithm first reconstructs a small amount of data in the target domain by training the deep sparse auto-encoder,so that the encoder can learn the low-dimensional structural representation of the target domain data.Then,the source domain data and the target domain data are coded by the trained deep sparse auto-encoder to obtain the reconstruction data of the low-dimensional structural representation close to the target domain.Finally,a part of the reconstructed tagged target domain data is mixed with the reconstructed source domain data to jointly train the classifier.This part of the target domain data is used to guide the source domain data.Experiments on the CASIA,SoutheastLab corpus show that the model recognition rate after a small amount of data transferred reached 89.2%and 72.4%on the DNN.Compared to the training results of the complete original corpus,it only decreased by 2%in the CASIA corpus,and only 3.4%in the SoutheastLab corpus.Experiments show that the algorithm can achieve the effect of labeling all data in the extreme case that the data set has only a small amount of data tagged. 为了提高跨语料库的语音情感识别效率,提出了一种基于深度稀疏自编码的语音情感迁移学习方法.算法首先通过训练深度稀疏自编码器来对目标域中的少量数据进行重建,使得编码器可以学习到目标域数据低维度的结构表征.然后,将源域数据和目标域数据通过训练好的深度稀疏自编码器,得到靠近目标域低维度的结构表征的重建数据.最后,利用部分重建的含标签的目标域数据与重建的源域数据混合后共同训练分类器,以便完成对源域数据的引导.在CASIA、SoutheastLab语料库上的实验表明,通过少量数据迁移后的模型识别率在DNN上达到了89.2%和72.4%.和完整原始语料库训练的结果相比,在CASIA上仅下降了2%,在SoutheastLab上仅下降了3.4%.实验说明,该算法能够在数据集只有少量数据有标签的极端情况下,达到逼近于所有数据都有标签的效果.

作者 Liang Zhenlin Liang Ruiyu Tang Manting Xie Yue Zhao Li Wang Shijia 梁镇麟;梁瑞宇;唐曼婷;谢跃;赵力;王诗佳(东南大学信息科学工程学院,南京210096;南京工程学院通信工程学院,南京211167;金陵科技学院计算机工程学院,南京211169)

机构地区 School of Information Science and Engineering School of Communication Engineering School of Computer Engineering

出处《Journal of Southeast University(English Edition)》 EI CAS 2019年第2期160-167,共8页 东南大学学报（英文版）

基金 The National Natural Science Foundation of China(No.61871213,61673108,61571106) Six Talent Peaks Project in Jiangsu Province(No.2016-DZXX-023)

关键词 sparse auto-encoder transfer learning speech emotion recognition 稀疏自编码器迁移学习语音情感识别

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

1吴俊法,蔡忠亮,徐健,陈运.浮动车数据的混合修复方法[J].地理空间信息,2019,17(4):11-14.
2董彦斌.计算机云存储中数据迁移问题的分析[J].中国信息化,2019,0(6):42-43. 被引量：3
3强彦,董林佳,赵涓涓,张婷.基于栈式稀疏自编码多特征融合的快速手势识别方法[J].北京理工大学学报,2019,39(6):638-643. 被引量：4
4王海英,李亚宁,周畅,王琳.反义标识的认知实验研究[J].东北大学学报（自然科学版）,2019,40(7):1056-1060. 被引量：1
5马昌娥,杨鉴.缅甸语分词方法及其实现[J].计算机科学与应用,2018,8(11):1682-1688. 被引量：1
6许磊,王建新.基于模糊神经网络的异常网络数据挖掘算法[J].计算机科学,2019,46(4):73-76. 被引量：19
7魏雷,李超.“自然生态”主题绘本的隐喻研究——以The Lorax为例[J].湖州师范学院学报,2019,41(7):30-36. 被引量：3
8王浩玮,陈旭.旧工业区再生利用方案决策[J].土木工程与管理学报,2019,36(3):170-176. 被引量：1
9李升波,关阳,侯廉,高洪波,段京良,梁爽,汪玉,成波,李克强,任伟,李骏.深度神经网络的关键技术及其在自动驾驶领域的应用[J].汽车安全与节能学报,2019,10(2):119-145. 被引量：29
10李青林,赵小明,白雪娇,祁钰茜.AI技术在北京市公路地质灾害监测中的设想与应用[J].市政技术,2019,37(4):41-43. 被引量：2

Journal of Southeast University(English Edition)

2019年第2期

浏览历史

内容加载中请稍等...

Transfer learning with deep sparse auto-encoder for speech emotion recognition

相关作者

相关机构

相关主题

浏览历史