期刊文献+

融合CNN和EWC算法的不平衡文本情绪分类方法 被引量:5

Emotion Classification Based on CNN and EWC Algorithm for Unbalanced Texts
下载PDF
导出
摘要 文本情绪分类是自然语言处理领域的一个基本任务。然而,基于不平衡数据的学习使得传统文本情绪分类方法的分类性能降低。针对这个问题,该文提出了一种融合CNN和EWC算法的不平衡文本情绪分类方法。首先,该方法使用随机欠采样方法得到多组平衡数据;其次,按顺序单独使用每一组平衡数据输入CNN训练,同时在训练过程中引入EWC算法用以克服CNN中的灾难性遗忘;最后,把使用最后一组平衡数据输入CNN训练得到的模型作为最终分类模型。实验结果表明,该方法在分类性能上明显优于基于欠采样和多分类算法的集成学习框架,且该方法比基于多通道LSTM神经网络的不平衡情绪分类方法在Accuracy和G-mean上分别提高了1.9%和2.1%。 Text emotion classification is a well-addressed task in the field of natural language processing. To deal with the unbalanced data which hurt the classification performance, this paper proposes an emotion classification method combining CNN and EWC algorithms. First, the method uses the random under-sampling method to obtain multiple sets of balanced data for training. Then it feeds each balanced dataset to CNN training in sequence, introducing EWC algorithm in the training process to overcome the catastrophic forgetting issue in CNN. Finally, the CNN model trained by the last data set is treated as the final classification model. The experimental results show that the proposed method is superior to the ensemble learning framework based on under-sampling and multi-classification algorithms, and outperforms the multi-channel LSTM neural network with 1.9% and 2.1% improvements in accuracy and G-mean, respectively.
作者 程艳 朱海 项国雄 唐天伟 钟林辉 王国玮 CHENG Yan;ZHU Hai;XIANG Guoxiong;TANG Tianwei;ZHONG Linhui;WANG Guowei(School of Computer Information Engineering,Jiangxi Normal University,Nanchang,Jiangxi 330022,China;School of Journalism and Communication,Jiangxi Normal University,Nanchang,Jiangxi 330022,China;Management Decision Evaluation Research Center,Jiangxi Normal University,Nanchang,Jiangxi 330022,China)
出处 《中文信息学报》 CSCD 北大核心 2020年第4期92-100,共9页 Journal of Chinese Information Processing
基金 国家自然科学基金(61967011) 江西省科技重点项目(20161BBE50086) 江西省教育厅科技重点项目(GJJ150299) 江西省教育厅人文社科重点(重大)项目(文)(JD19056) 江西省教育厅科学技术项目(GJJ170207)
关键词 情绪分类 不平衡分类 CNN EWC算法 emotion classification imbalanced classification CNN EWC algorithm
  • 相关文献

参考文献12

二级参考文献117

  • 1蒋盛益,谢照青,余雯.基于代价敏感的朴素贝叶斯不平衡数据分类研究[J].计算机研究与发展,2011,48(S1):387-390. 被引量:21
  • 2林传鼎,无.社会主义心理学中的情绪问题——在中国社会心理学研究会成立大会上的报告(摘要)[J].社会心理科学,2006,21(1):37-37. 被引量:15
  • 3赵积春,王志良,王超.情绪建模与情感虚拟人研究[J].计算机工程,2007,33(1):212-215. 被引量:11
  • 4Pang B.,Lee L.,Vaithyanathan S.Thumbs up?:Sentiment Classification using Machine LearningTechniques[C] //Proceedings of EMNLP.2002.
  • 5Blitzer J.,Dredze M.,Pereira F.Biographies.Bollywood,Boom-boxes and Blenders:DomainAdaptation for Sentiment Classification[C] //Proceedings of ACL.2007.
  • 6Li S.,Huang C.,Zhou G.,et al.EmployingPersonal/Impersonal Views in Supervised and Semi-supervised Sentiment Classification[C] //Proceedingsof ACL.2010.
  • 7Barandela R.,Sánchez J.S.,García V.,et al.Strategiesfor Learning in Class Imbalance Problems[J].PatternRecognition,2003,36:849-851.
  • 8Kubat M.,Matwin S.Addressing the Curse ofImbalanced Training Sets:One-Sided Selection[C] //Proceedings of ICML.1997.
  • 9Chawla N.,Bowyer K.,Hall L.,et al.SMOTE:Synthetic Minority Over-Sampling Technique[J].Journal of Artificial Intelligence Research,2002,16:321-357.
  • 10Juszczak P.,Duin R.Uncertainty Sampling Methodsfor One-Class Classifiers[C] //Proceedings of ICML,Workshop on Learning with Imbalanced Data Sets II.2003.

共引文献230

同被引文献69

引证文献5

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部