基于BERT-CW的公共卫生事件情绪识别模型研究

Research on Emotion Recognition Model of Public Health Events Based on BERT-CW

下载PDF

导出

摘要针对情绪识别任务中,单一存在的模型存在片面性,无法充分提取语义特征等问题,本文提出了一种领域情感词典与字词特征融合相结合的文本分类方法。首先构建并扩展有关于“传染性疾病”事件的领域情感词典,其次融合文本的字向量特征和词向量特征,最后将BERT模型应用于“传染性疾病”事件微博文本分类任务中。实验结果显示,相较于其它神经网络模型,BERT-CW(字词融合)模型的精确率、召回率和F1值各项评价指标的表现更好;相比于字划分或词划分的BERT-C模型和BERT-W模型,BERT-CW模型的可靠性更高,实验结果在微博用户评论数据集的网络情绪识别任务上准确率达到了94.59%,F1值达到了94.08%,证实了此模型的有效性。 In order to solve the problems of one-sidedness and inadequacy of semantic features in emotion recognition tasks,a text classification method combining domain sentiment dictionary and word feature fusion is proposed in this paper.Firstly,a domain sentiment dictionary about“infectious disease”event is constructed and extended.Secondly,word vector features and word vector features of text are integrated.Finally,Burt model is applied to the classification task of“infectious disease”event microblog post.The experimental results show that compared with other neural network models,the accuracy rate,recall rate and F1 value of BERT-CW model perform better.Compared with the BERT-C model and the BERT-W model of word division or word division,the BERT-CW model has higher reliability.The experimental results show that the accuracy of the Internet emotion recognition task in the microblog user comment data set reaches 94.59%,and the F1 value reaches 94.08%,which confirms the validity of this model.

作者曹涛白书臣 Cao Tao;Bai Shuchen(Dalian Polytechnic University,Dalian,China)

机构地区大连工业大学

出处《科学技术创新》 2023年第10期72-76,共5页 Scientific and Technological Innovation

关键词情绪识别字词融合文本分类 BERT 领域词典 emotion recognition word fusion text classification BERT domain dictionary

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1刘忠宝,秦权,赵文娟.微博环境下新冠肺炎疫情事件对网民情绪的影响分析[J].情报杂志,2021,40(2):138-145. 被引量：18
2曾子明,万品玉.基于双层注意力和Bi-LSTM的公共安全事件微博情感分析[J].情报科学,2019,37(6):23-29. 被引量：31
3刘继,顾凤云.基于BERT与BiLSTM混合方法的网络舆情非平衡文本情感分析[J].情报杂志,2022,41(4):104-110. 被引量：26
4钟佳娃,刘巍,王思丽,杨恒.文本情感分析方法及应用综述[J].数据分析与知识发现,2021,5(6):1-13. 被引量：76
5赵宏,傅兆阳,王乐.基于特征融合的中文文本情感分析方法[J].兰州理工大学学报,2022,48(3):94-102. 被引量：7
6谭翠萍.文本细粒度情感分析研究综述[J].大学图书馆学报,2022,40(4):85-99. 被引量：11
7张继东,张慧迪.融合注意力机制的多模态突发事件用户情感分析[J].情报理论与实践,2022,45(11):170-177. 被引量：8
8栗雨晴,礼欣,韩煦,宋丹丹,廖乐健.基于双语词典的微博多类情感分析方法[J].电子学报,2016,44(9):2068-2073. 被引量：30
9何铠,管有庆,龚锐.一种基于权重预处理的中文文本分类算法[J].计算机技术与发展,2022,32(3):40-45. 被引量：4

二级参考文献124

1赵妍妍,秦兵,车万翔,刘挺.中文事件抽取技术研究[J].中文信息学报,2008,22(1):3-8. 被引量：105
2Melville P,Gryc W,Lawrence R D.Sentiment analysis of blogs by combining lexical knowledge with text classification[A] .Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C] .New York:ACM SIGKDD Explorations Newsletter,2009.1275-1284.
3Wan X.Bilingual co-training for sentiment classification of Chinese product reviews[J] .Computational Linguistics,2011,37(3):587-616.
4Meng X,Wei F,Liu X,et al.Cross-lingual mixture model for sentiment classification[A] .Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics:Long Papers-Volume 1[C] .Stroudsburg:Association for Computational Linguistics,2012.572-581.
5Pang B,Lee L.Opinion mining and sentiment analysis[J] .Foundations and Trends in Information Retrieval,2008,2(1-2):1-135.
6Li Y,Li X,Li F,et al.A lexicon-based multi-class semantic orientation analysis for microblogs[A] .Web Technologies and Applications[C] .Cham:Springer International Publishing,2014.81-92.
7Dong Z,Dong Q.HowNet and the Computation of Meaning[M] .Singapore:World Scientific,2006.
8Miller G A.WordNet:a lexical database for English[J] .Communications of the ACM,1995,38(11):39-41.
9Hu M,Liu B.Opinion extraction and summarization on the web[A] .Proceedings of the 21st National Conference on Artificial Intelligence(AAAI 2006)[C] .California:AAAI Press,2006.1621-1624.
10Zhu Y L,Min J,Zhou Y,et al.Semantic orientation computing based on HowNet[J] .Journal of Chinese Information Processing,2006,20(1):14-20.

共引文献191

1王君泽,詹若贤,李怡,杜洪涛.融合主题与细粒度情感特征的气候变化微博舆情分析研究[J].信息技术与管理应用,2023(4):87-104.
2张苑,祝小兰,杨东晓.基于深度学习的疫情情感分析[J].智能计算机与应用,2022,12(3):40-45. 被引量：1
3敦欣卉,张云秋,杨铠西.基于微博的细粒度情感分析[J].数据分析与知识发现,2017,1(7):61-72. 被引量：27
4ZHANG Yangsen,ZHANG Yaorong,JIANG Yuru,HUANG Gaijuan.Multi-feature-Based Subjective-Sentence Classification Method for Chinese Micro-blogs[J].Chinese Journal of Electronics,2017,26(6):1111-1117. 被引量：2
5张仰森,郑佳,黄改娟,蒋玉茹.基于双重注意力模型的微博情感分析方法[J].清华大学学报（自然科学版）,2018,58(2):122-130. 被引量：48
6陈志雄,王时绘,高榕.基于情感倾向性分析的微博意见领袖识别模型[J].计算机科学,2018,45(5):168-175. 被引量：9
7郝苗苗,徐秀娟,于红,赵小薇,许真珍.基于中文微博的情绪分类与预测算法[J].计算机应用,2018,38(A02):89-96. 被引量：15
8洪巍,李敏.文本情感分析方法研究综述[J].计算机工程与科学,2019,41(4):750-757. 被引量：84
9蔡晨,罗可.融合BTM和图论的微博检索模型[J].计算机工程与科学,2019,41(8):1512-1518. 被引量：2
10徐善山.基于领域词典和机器学习的影评情感分析[J].电脑知识与技术,2019,15(8Z):222-223. 被引量：1

1程健.面孔情绪识别及其机制的研究进展[J].中文科技期刊数据库（引文版）医药卫生,2022(2):4-8.
2唐烨伟,卜凡丽,赵一婷.学习者多模态情绪融合分析:动因、框架与路向[J].开放教育研究,2023,29(3):96-103. 被引量：2
3权威声音[J].中国金融,2023(8):2-3.
4王泽宇,李秦,唐云清,韩增林.海洋强国战略政策效应评估——基于HCW模型的实证分析[J].地理研究,2023,42(5):1215-1233. 被引量：3
5马晓荷,谭成仟,郭泽坤.博文盆地M煤层气田含气量主控因素分析及预测[J].科学技术与工程,2023,23(11):4586-4595.
6郑少毅,雷艺炎.从多组学角度探讨食管鳞状细胞癌的免疫相关特征[J].中山大学学报（医学科学版）,2023,44(3):519-527. 被引量：1
7常存,徐红慧,陈燕冰.特色旅游小镇游客情感特征研究——以深圳甘坑古镇为例[J].旅游与摄影,2023(4):54-56.
8曹善文.基于流程挖掘视角下的数据要素利用研究[J].信息通信技术与政策,2023,49(4):59-64.
9张草霞.“产出导向法”在民办高校英语专业写作教学中的应用研究[J].湖北开放职业学院学报,2023,36(9):183-185. 被引量：2
10刘鹏,任汀,谢韬,田汇冬,靳守锋,王青于.换流变阀侧套管表带触指接触电阻数值计算[J].高电压技术,2023,49(3):1184-1193. 被引量：5

科学技术创新

2023年第10期

浏览历史

内容加载中请稍等...

基于BERT-CW的公共卫生事件情绪识别模型研究

参考文献9

二级参考文献124

共引文献191

相关作者

相关机构

相关主题

浏览历史