摘要
链接数据为知识图谱的主要表现形式,但目前链接数据的发布缺少统一标准,导致数据质量参差不齐.本文回顾了链接数据质量评价的相关研究,并根据链接数据的发展过程,将衡量数据质量的维度划分为7个类型,分别描述每个数据质量维度的特性.同时,具体介绍了冗余度、可信性的量化评价方法及波动性对链接数据质量的影响.
Linked Data is the mainly expression form of knowledge graph, however. We have found that RDF datasets on the Web are of varying quality due to the lack of unified publishing principles, which influences the quality and accessibility of Linked Data. This paper divides the data quality dimensions into 7 types, and presents the charac- teristics of each data dimension. In addition, the quantitative calculation of redundancy and trustworthiness is de- scribed in detail. Our goal is to help the relevant researchers have a comprehensive understanding about the existing work and to promote the further development of Linked Data quality assessment.
出处
《武汉大学学报(理学版)》
CAS
CSCD
北大核心
2017年第1期22-38,共17页
Journal of Wuhan University:Natural Science Edition
基金
国家自然科学基金(61673304
61272110)
国家社会科学基金重大计划(11&ZD189)
武汉市科技攻关计划(201110821225)
软件工程国家重点实验室(武汉大学)开放基金SKLSE2012-09-07
关键词
链接数据
质量评价
质量维度
知识图谱
linked data
quality assessment
quality dimension
knowledge graph