摘要
【目的】准确计算Wikipedia中词条的可信度。【方法】采用文本分析法将词条当前版本与其历史版本进行比较,获取各版本作者的有效编辑内容,并结合词条当前版本包含的参考文献数和图片数等结构信息,构建一个动态的词条信任评价模型。【结果】通过仿真实验表明该模型能够很好地区分Wikipedia中高信任词条和低信任词条。【局限】通过该算法得出的词条等级划分阈值对处于信任等级中间的B和C两类词条区分不明显。【结论】该算法简单有效,能够从微观层面了解词条的变化过程,动态计算其信任值。
[Objective] Accurately calculate the credibility of the Wikipedia entry. [Methods] This paper builds a trust evaluation model which makes a comparison between the current version and their historical version by the text analysis to obtain each version of the edior's effective edit content, and combined with the number of reference and image in the current version of the Wikipedia article. [Results] It shows that the model is able to distinguish the high trust Wikipedia article and low trust through empirical research. [Limitations] The entry level threshold by this algorithm is not very obvious to distinguish the two types of B level and C level. [Conclusions] The algorithm is simple and effective, and can understand the changing process of entry from the microscopic level, dynamically compute its trust value.
出处
《现代图书情报技术》
CSSCI
2015年第3期33-38,共6页
New Technology of Library and Information Service
基金
国家自然科学基金青年基金项目"基于可信语义Wiki的知识库构建方法与应用研究"(项目编号:71203173)的研究成果之一