摘要
谣言的传播会破坏社会秩序、危害国家稳定、造成大众恐慌,而社交平台的广泛应用使得信息传播速度更快、波及范围更广,加大了谣言造成的负面影响,如何快速准确地识别网络谣言成为信息传播领域的热点问题。谣言识别本质上是一个二分类问题,因而基于贝叶斯分类的思想设计了网络谣言识别的朴素贝叶斯分类算法,利用Matlab软件构建朴素贝叶斯分类器,并采用从微博中收集的数据对该算法进行实验验证,通过控制训练集,对比识别结果的准确率、精确率、召回率和F1值,探究了不同训练条件下的朴素贝叶斯分类器对谣言与非谣言的识别情况和内含规律。研究表明,朴素贝叶斯分类器对于网络谣言识别具有有效性,且训练集的选取与控制对识别结果的影响较大,识别准确率随着训练条件的不同发生波动。
Rumors spread can destroy social order,endanger national stability and cause public panic.The wide application of social platforms makes information spread faster and more widely,increasing the negative impact caused by rumors.How to quickly and accurately identify online rumors has become a hot issue in the field of information dissemination.Rumor recognition is a binary classification problem.Therefore,based on the idea of Bayesian classification,a Naive Bayesian classification algorithm for network rumor recognition is designed.The naive Bayesian classifier is constructed by Matlab software,and the algorithm is verified by experiments with data collected from microblogs.By controlling the training set,the accuracy,precision,recall rate and F1 value of the identification results are compared,and the identification situation and inherent laws of the naive Bayesian classifier for rumor and non-rumor under different training conditions are explored.The research shows that naive Bayesian classifier is effective for online rumor identification,the selection and control of training sets have great influence on the identification results,and the identification accuracy fluctuates with different training conditions.
作者
李文丽
LI Wen-li(School of Management,Shanghai University,Shanghai 200444,China)
出处
《计算机工程与科学》
CSCD
北大核心
2022年第3期495-501,共7页
Computer Engineering & Science
关键词
朴素贝叶斯分类
谣言识别
机器学习
Naive Bayesian classification
rumor identification
machine learning