摘要
社交网络方便人们信息交流的同时也为谣言的传播提供了新的温床。由于社交媒体帖子通常十分精简,大多数基于内容语义特征的谣言检测方法面临着语义信息不足的挑战。同时,目前基于传播特征的谣言检测方法常常忽略了评论用户的个体特征,未能合理分配不同用户评论的权重。因此,提出一种结合文本语义增强和评论立场加权的网络谣言检测方法。通过外部知识图谱获取帖子中的实体和概念的解释,以提供更多上下文信息,从而增强语义理解。借助点互信息将增强后的文本转化为加权图表示,并利用加权图注意力网络学习帖子的增强语义特征。通过预训练的立场检测模型提取帖子中每条评论的立场信息,并根据评论用户的特征来学习立场信息的权重值。将评论立场的时序数据和相应的评论用户序列数据输入跨模态的Transformer,以学习评论立场的时序特征。将增强的语义特征与加权的评论立场时序特征进行自适应融合,并输入多层感知机中进行分类。在PHEME和Weibo两个数据集上的实验结果表明,该方法不仅准确率高于最先进的基线方法1.6个百分点以上,而且在早期谣言检测方面,比最好的基线方法提前12 h。
Social networks,while enabling information exchange among individuals,also serve as fertile grounds for the dissemination of rumors.The succinct nature of social media posts poses a challenge for most rumor detection methods reliant on content semantic features due to the insufficiency of semantic information.Additionally,numerous rumor detection techniques focusing on propagation features often disregard the unique attributes of commenters,leading to inadequate allocation of weights to different user comments.Thus,a network rumor detection approach is proposed,integrating text semantic enhancement and weighted comment stance.Initially,entities and concepts in posts are elucidated via an external knowledge graph to furnish additional contextual information,thereby augmenting semantic comprehension.Subsequently,leveraging pointwise mutual information,the enhanced text is translated into a weighted graph representation,and a weighted graph attention network is employed to assimilate enhanced semantic features of posts.Stance information for each comment within the post is then extracted using a pre-trained stance detection model,with weight values of stance information being learnt based on commenters’characteristics.Furthermore,temporal data of comment stances and corresponding commenter sequences are fed into a cross-modal Transformer to glean the temporal features of comment stances.Ultimately,the enhanced semantic features are adaptively merged with the weighted temporal features of comment stances and fed into a multilayer perceptron for classification.Experimental results on the PHEME and Weibo datasets demonstrate that this method not only achieves an accuracy improvement of over 1.6 percentage points compared with the state-of-the-art baseline method but also outperforms the best baseline method by at least 12 hours in early rumor detection.
作者
朱奕
王根生
金文文
黄学坚
李胜
ZHU Yi;WANG Gensheng;JIN Wenwen;HUANG Xuejian;LI Sheng(School of Finance,Taxation and Public Administration,Jiangxi University of Finance and Economics,Nanchang 330013,China;School of Information Management,Jiangxi University of Finance and Economics,Nanchang 330013,China;School of Humanities,Jiangxi University of Finance and Economics,Nanchang 330013,China)
出处
《计算机科学与探索》
CSCD
北大核心
2024年第12期3311-3323,共13页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金(72061015)
江西省自然科学基金(20242BAB20074)。
关键词
谣言检测
语义增强
评论立场
图神经网络
知识图谱
rumor detection
semantic enhancement
comment stance
graph neural network
knowledge graph