期刊文献+

一种模式匹配和实体统一相互促进的方法 被引量:3

An Improvement Method of Schema Matching and Entity Resolution
下载PDF
导出
摘要 在数据库研究领域,模式匹配和实体统一是被广泛关注的两个方向。随着对Web数据集成需求的增长,无论是在模式和实体层次,研究这两方面问题是很有实际意义的。当前的研究大多针对两项任务的其中之一。在文章中,基于模式匹配促进实体统一的新思路,提出了一种同时解决这两项任务的方法,实现了它们之间的相互促进机制。在现实的Web异构数据源场景中应用该方法,得到的查准率和查全率都很高,证明了该方法的正确性和有效性。 Schema matching and entity resolution have been two topics widely studied in the field of database research. With the rising demand in the Web data integration, both in schema and instance level, the study of the two tasks is becoming more practical importance. Most current study efforts at resolving one of the two matching tasks. In this paper, based on the new ideas of schema matching benefit from entity resolution, we propose a method that simultaneously attacks these two tasks and achieves a kind of improvement between them. By applying our method to a realistic Web heterogeneous data source scenario, we show that precision and recall are both quite high, and show this method's correctness and validity.
出处 《计算机与数字工程》 2009年第9期4-6,19,共4页 Computer & Digital Engineering
基金 国家自然科学基金(编号:60673130) 山东省自然科学基金(编号:Y2007G38) 山东省科技攻关计划(编号:2008GG30001005) 山东省科学技术发展计划(编号:2007GG1QX01036) 山东省博士后创新项目专项资金(200703084)资助
关键词 数据集成 模式匹配 实体统一 data integration, schema matching, entity resolution
  • 相关文献

参考文献6

  • 1David Guy Brizan, Abdullah Uz Tansel. A Survey of Entity Resolution and Record Linkage Methodologies [J]. Communications of the IIMA,2006,6(3):41-50.
  • 2郑文怡,鞠时光.模式匹配方法研究[J].计算机应用研究,2006,23(2):60-63. 被引量:10
  • 3Alexander Bilke, Felix Naumara Schema Matching using Duplicates[C]. Proe of the 21st International Conference on Data Engineering, ICDE, 2005 : 69-80.
  • 4李由,刘东波,张维明.基于数据实例分布特征的自动模式匹配方法[J].计算机科学,2005,32(11):85-87. 被引量:11
  • 5Mingchuan Guo, Yong Yu. Mutual enhancement of schema mapping and data mapping[C]. Proc of the ACM SIGKDD 2004 Workshop on Mining for and from the Semantic Web, USA: Seattle, 2004: 129-141.
  • 6Xuan Zhou, Julien Gaugaz, Wolf-Tilo Balke etc. Query relaxation using malleable schemas[C]. Proc of the 2007 ACM SIGMOD international conference on Management of data, China: Beijing, 2007 : 152-16.

二级参考文献13

  • 1史忠植.知识发现[M].北京:清华大学出版社,2004.137-140.
  • 2Doan A H. Learning to Map between Structured Representations of Data:[PhD thesis]. University of Washington.
  • 3Madhavan J, Bernstein P A, Rahm E. Generic Schema Matching with Cupid. VLDB 2001.
  • 4Doan A H,Domingos P, Halevy A. Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In: SIGMOD Record, 2001.
  • 5Melink S,Garcia-Molina H,Rahm E. Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In:Proc 18^th Intl Conf. on Data Engineering,2002.
  • 6Dell'Erba M,Fodor O,Ricci F,et al. Harmonise: A Solution for Data Interoperability. IFIP I3E 2002.
  • 7Rahm E, Philip A. Bernstein. A survey of approaches to automatic schema matching. VLDB Journal, 2001,10: 334-350.
  • 8Do H H, Rahm E. COMA-A system for flexible combination of schema matching approaches. In:Proc. 28th VLDB Conference.
  • 9Madhavan J, P A Bernstein,E Rahm. Geueric Schema Matching Using Cupid[R]. MSR Tech. Report, MSR-TR-2001-58, 2001.
  • 10Miller R, I, Haas, M A Hernandez. Schema Mapping as Query Discovery[C]. VLDB, 2000. 77-88.

共引文献19

同被引文献24

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部