期刊文献+

基于实体分类的数据库模式匹配方法 被引量:8

An Approach to Database Schema Matching Based on Entity Classification
下载PDF
导出
摘要 模式匹配在诸如数据集成、数据仓库、信息共享和计算机网络交换等许多应用领域起到关键作用。目前,自动模式匹配方法也不能解决复杂模式情况下的匹配问题。本文提出一种基于关系模式领域中实体分类的数据库模式匹配方法。该方法通过朴素贝叶斯学习将实体分为不同的类(子模式),然后以同样的类来匹配子模式之间的模式元素。本方法在复杂模式条件下可有效提高匹配效率,减少匹配工作量,节省人力资源。 Schema matching plays a key role in many application domains,such as data integration,data warehouse, and information share and exchange on computer network. Currently,approaches of automatic schema matching cannot solve matching issue under the circumstance of complex schema well. This paper introduces an approach based on entity classification in the domain of relation schema. It divides entities into different categories (sub-schema) using Na ive Bayes Learning ,and then matches schema elements between the sub-schemas with the same category. It can effectively improve matching results, reduce the number of element-to-element comparisons and save user efforts under the circumstance of complex schema.
出处 《计算机科学》 CSCD 北大核心 2004年第10期157-159,F004,共4页 Computer Science
基金 国家"973"重点基础研究发展规划项目(G1999032705) 国家"十五"科技攻关计划(2001BA102A01)
关键词 数据库模式 模式匹配 子模式 朴素贝叶斯 匹配方法 关系模式 计算机网络 实体分类 关键作用 工作量 Schema matching,Entity,Sub-schema ,Naive bayes learning,Data warehouse
  • 相关文献

参考文献10

  • 1Rahm E, Bernstein P A. A survey of approaches to automatic schema matching. The VLDB Journal,2001,10: 334-350
  • 2Berlin J, Motro A. Database Schema Matching Using Machine Learning with Feature Selection. In: Proc. of the 14th Intl.Conf. on Advanced Information Systems Engineering (CAiSE),2002
  • 3Kang J, Naughton J F. On Schema Matching with Opaque Column Names and Data Values. SIGMOD Conf. 2003
  • 4Kang J ,Naughton J F. On Schema Matching with Opaque Column Names and Data Values. SIGMOD Conf. 2003
  • 5Milo T,Zohar S. Using Schema Matching to Simplify Heterogeneous Data Translation. In Proc. of the Intl. Conf. on Very Large Databases(VLDB), 1998
  • 6Madhavan J, Bernstein P A, Rahm E. Generic Schema Matching with Cupid. In: Proc 27th Int Conf. on Very Large Data Bases (VLDB), 2001
  • 7Li W,Clifton C. SemInt: a Tool for Identifying attribute correspondences in heterogeneous databases using neural network. Data Knowledge Engineering, 2000,33(1):49-84
  • 8Berlin J, Motro A. Database Schema Matching Using Machine Learning with Feature Selection. In: Proc. of the 14th Intl.Conf. Advanced Information Systems Engineering (CAiSE),2002
  • 9Joachims T A. Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization:[ Computer Science Technical Report CMU-CS-96-118]. Carnegie Mellon University,1996
  • 10Velegrakis Y,Miller R J, Popa L. Mapping Adaptation Under Evolving Schemas. In: VLDB, Berlin, Germany, 2003

同被引文献44

引证文献8

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部