期刊文献+

语义知识库构建中的异常数据发现

Discovering Abnormal Data in RDF Knowledge Base
下载PDF
导出
摘要 为了提高RDF知识库的数据质量,提出RDF图数据的异常检测及其自动修复的方法。首先,原创性地定义了基于图的条件函数依赖(GCFD),能够将属性值和语义结构的依赖关系统一表示;然后,提出有效的算法框架以及优化策略,挖掘RDF数据中的GCFD,并给出异常数据的自动修复流程;最后,在真实的数据集上,通过大量实验确认解决方案的可行性和优越性。 To effectively improve the data quality of RDF knowledge base, a solution is proposed about abnoraml data discovery and errouneous data repair in RDF graphs. Firstly, the authors innovatively define graph-based conditional functional dependency(GCFD) that can represent the attribute value and semantic structure dependencies of RDF data in a uniform manner. Then, an efficient framework and some novel pruning rules are proposed to discover GCFDs, and the workflow of auto-repairing errorneous data are given. Extensive experiments on several real-life RDF repositories confirm the superiority of proposed solution.
出处 《北京大学学报(自然科学版)》 EI CAS CSCD 北大核心 2015年第2期195-202,共8页 Acta Scientiarum Naturalium Universitatis Pekinensis
基金 国家自然科学基金(61370055)资助
关键词 RDF数据质量 基于图的条件函数依赖 条件函数依赖 函数依赖 RDF data quality graph-based conditional functional dependencies(GCFD) conditional functional dependency functional dependency
  • 相关文献

参考文献15

  • 1Auer S C, Kobilarov B G, Lehmann J, et al. Dbpedia: a nucleus for a web of open data // ISWC/ASWC. Busan, 2007:722-735.
  • 2Bollacker K D, Evans C, Paritosh P, et al. Freebase: a collaboratively created graph database for structuring human knowledge // SIGMOD Conference. Vancou- ver, 2008:1247-1250.
  • 3Suchanek F M, Kasneci G, Weikum G. Yago: a core of semantic knowledge//WWW. Banff, 2007:697-706.
  • 4Yu Y, Heflin J. Extending functional dependency to detect abnormal data in rdf graphs // International Semantic Web Conference. Heidelberg: Springer, 2011 : 794-809.
  • 5Serge A, Richard H, Victor V. Foundations of databases. Reading, Massachusetts: Addison-Wesley, 1995.
  • 6Huhtala Y, K/irkk/iinen J, Porkka P, et al. Tane: an efficient algorithm for discovering functional and approximate dependencies. Comput J, 1999, 42(2): 100-111.
  • 7Wyss C M, Giannella C, Robertson E L. Fastfds: a heuristic-driven, depth-first algorithm for mining functional dependencies from relation instances- extended abstract // DaWaK. Heidelberg: Springer, 2001:101 110.
  • 8Fan W, Geerts F, Li J, et al. Discovering conditional functional dependencies. IEEE Trans Knowl Data Eng, 2011, 23(5): 683-698.
  • 9Levene M, Poulovanssilis A. An object-oriented data model formalised through hypergraphs. Data Knowl Eng, 1991, 6(3): 205-224.
  • 10Weddell G E. Reasoning about functional dependen- cies generalized for semantic data models. ACM Trans Database Syst, 1992:32-64.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部