期刊文献+

传播用户代表性特征学习的谣言检测方法 被引量:3

Rumor Detection Based on Representative User Characteristics Learning Through Propagation
下载PDF
导出
摘要 谣言的及时发现和有效管控,是互联网+政务服务中公共舆情治理的重要组成部分。互联网和移动互联网的发展,提高了民众沟通交流的便利度,同时也加速了谣言的传播速度和广度,极大地提高了谣言的影响力和危害力,给民众的生产生活带来干扰,也严重影响社会秩序。现有的网络平台辟谣工作大多依赖于人工举报筛查,往往耗费大量的时间和精力。而利用数据挖掘、机器学习技术实现的谣言检测算法大多基于文本信息,常用于追溯性谣言检测,不适用于谣言扩散早期数据量不足的情况。首先收集最新的网络平台数据进行标注构造数据集Weibo2020,对其中用户特征分布进行统计分析并选择具有代表性的用户特征,进而提出了基于传播用户代表性特征学习的早期谣言检测方法(RPPC)。经实验验证,RPPC与同样基于传播路径的算法在同等条件下,在输入数据规模减少50%的同时,将准确率提高了2.57个百分点。此外,该方法能对5 min内发布的消息进行检测,快速发现互联网内容中的疑似谣言且准确率达到近80%。因此可以认为提出的方法在现有的数据集中有较好的表现,能够在一定程度上辅助政府部门的舆情治理工作,从而提高政务服务的时效及质量。 Effective rumor detection and management has become an essential part of Internet plus government services initiative.The Internet era brings great convenience to people's communication as well as speeds up the propagation of rumors,which not only interferes people's normal living but also does harm to the social confidence system.Existing work of rumor debunking on the Internet is mostly based on manual work of public tip-offs and screening,which is time consuming and demanding.Meanwhile,work on algorithm of rumor detection based on data mining and machine learning depends heavily on text content,which is deficient during the early stage of rumor propagation.This paper constructs latest dataset Weibo2020,composed of both rumors and normal information,and extracts representative user characteristics from the perspective of statistics,then proposes an algorithm of early-stage rumor detection based on brief propagation path,named RPPC(representative propagation path classification).The experimental results indicate that the proposed method can improve the prediction accuracy by 2.57 percentage points while reducing the input data scale by 50%.Meanwhile,the proposed method can predict the authenticity of news released in 5 minutes and achieve an accuracy of about 80%.Therefore,the proposed method achieves good results in a limited size of dataset and can to some degree help with network public opinion governance and improve the efficiency and quality of government service.
作者 谢欣彤 胡悦阳 刘譞哲 赵耀帅 姜海鸥 XIE Xintong;HU Yueyang;LIU Xuanzhe;ZHAO Yaoshuai;JIANG Hai'ou(School of Electronics Engineering and Computer Science,Peking University,Beijing 100871,China;Key Laboratory of High Confidence Software Technologies of Ministry of Education(Peking University),Beijing 100871,China;TravelSky Technology Limited,Beijing 101318,China;Key Laboratory of Intelligent Passenger Service of Civil Aviation,Civil Aviation Administration of China,Beijing 101318,China;School of Software and Microelectronics,Peking University,Beijing 102600,China;Peking University Information Technology Institute(Tianjin Binhai),Tianjin 300452,China)
出处 《计算机科学与探索》 CSCD 北大核心 2022年第6期1334-1342,共9页 Journal of Frontiers of Computer Science and Technology
基金 国家重点研发计划(2018YFB1004400) 北京高等学校卓越青年科学家项目(BJJWZYJH01201910001004)。
关键词 谣言检测 机器学习 特征分析 传播路径 互联网+政务服务 舆情治理 rumor detection machine learning characteristic analysis propagation path Internet plus government services initiative public opinion management
  • 相关文献

参考文献2

二级参考文献19

  • 1奥尔波特.谣言心理学[M].沈阳:辽宁教育出版社,2003.
  • 2卡普费雷.谣言:世界最古老的传媒[M].郑若麟,译.上海:上海人民出版社,2008.
  • 3拉扎斯菲尔德.人民的选择[M].唐茜,译.北京:中国人民大学出版社,2012:139-150.
  • 4CNNIC.中国互联网络发展状况统计报告(2015年1月)[R].北京:中国互联网信息中心,2015.
  • 5Qazvinian V, Rosengren E, Radev D R, et al. Rumor has it: identi- fying misinformation in microblogs [ C ]//Proc of Conference on Em- pirical Methods in Natural Language Processing. [ S. t. ] : Association for Computational Linguistics, 2011: 1589-1599.
  • 6Castillo C, Mendoza M, Poblete B. Information credibility on twitter [ C]//Proc of the 20th International Conference on World Wide Web. New York:ACM Press, 2011: 675-684.
  • 7Takahashi T, Igata N. Rumor detection on twitter [ C ]//Proc of the 13th Intet~aational Symposium on Advanced Intelligent Systems, and the Joint 6th International Conference on Soft Computing and Intelli- gent Systems. [ S. 1. ] :IEEE Press, 2012: 452-457.
  • 8Yang Fan, Liu Yang, Yu Xiaohui, et al. Automatic detection of ru- mor on Sina Weibo[ C]//Proc of ACM SIGKDD Workshop on Mining Data Semantics. New York:ACM Press, 2012: 13.
  • 9Sun Shengyun, Liu Hongyan, He Jun, et al. Detecting event rumors on Sina Weibo automatically [ M ]//Web Technologies and Applica- tions. Berlin:Springer, 2013.. 120-131.
  • 10Mikolov T, Chen Kai, Con'ado G, et al. Efficient estimation of word representations in vector space [ C ]//Proc of International Conference on Learning Representations. 2013.

共引文献57

同被引文献26

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部