摘要
针对目前基于单文本语义特征深度学习的微博谣言实时检测模型泛化能力不足的问题,提出一种融合多元用户特征和内容特征的实时检测模型.首先,在传统用户基本特征和内容统计特征的基础上,利用用户的历史行为数据,挖掘用户理性值和用户专业度两个深层次特征;然后,基于词向量和带有注意力机制的双向GRU神经网络构建文本语义特征学习模型;最后,采用分层特征级联和全连接的方式进行特征融合,把融合特征输入分类模型进行训练.实验结果表明,该模型的检测准确率达到了91.74%,相比其他只关注文本语义特征的深度学习实时检测模型具有更好的识别效果,相比于其他改进型的实时检测模型F1-Measure值也提高了2.19%.
In order to solve the problem of low generalization ability of Weibo rumor real-time detection model based on deep learning of text semantic features,a real-time detection model fusion of multi user features and content features is proposed.First of all,in addition to the traditional user basic features and content statistical features,two implicit features of user rationality and user professionalism are mined based on user′s historical behavior data;Then,a text semantic feature learning model is constructed based on word vector and bidirectional GRU neural network with attention mechanism;Finally,the hierarchical cascade and full connection are used for feature fusion,and the fused features are input into the classification model for training.The experimental results show that the accuracy of the model is 91.74%,which is better than other deep learning real-time detection models based on text semantic features.Compared with other improved real-time detection models,F1-Measure value also increased by 2.19%.
作者
黄学坚
王根生
罗远胜
闵潞
吴小芳
李志鹏
HUANG Xue-jian;WANG Gen-sheng;LUO Yuan-sheng;MIN Lu;WU Xiao-fang;LI Zhi-peng(School of Humanities,Jiangxi University of Finance and Economic,Nanchang 330013,China;Computer Practice Teaching Center,Jiangxi University of Finance and Economics,Nanchang 330013,China;School of International Trade and Economics,Jiangxi University of Finance and Economic,Nanchang 330013,China)
出处
《小型微型计算机系统》
CSCD
北大核心
2022年第12期2518-2527,共10页
Journal of Chinese Computer Systems
基金
国家自然科学基金项目(72061015,61562031)资助
江西省教育厅科技项目(GJJ200539)资助。
关键词
微博谣言
实时检测
特征融合
深层特征
深度学习
Weibo rumors
real-time detection
feature fusion
implied features
deep learning