期刊文献+

数据集成中的一种数据合并技术 被引量:2

A Technology of Data Merging in Data Integration
下载PDF
导出
摘要 本文讨论了在数据集成过程中遇到的数据合并问题,主要包括重复记录判断(对象识别技术)和重复记录的冲突处理(冲突解决机制)等,提出了比较实用、有效的方法,并通过实验对多表合并的两种算法进行了比较,指出了需要进一步改进的方向。 This paper presentes some problems and their solutions when carrying out data integrating. The problems mainly include duplicated records identification (Object Identification Technique) and confilict processing for duplicated records (Conflict Resolution Mechanism). We propose a practical and available method and according our experiment results we compare two merging algorithms merging multiple tables into one target table.At last we pointe out some aspects that needs to improve.
出处 《现代计算机》 2003年第11期6-9,36,共5页 Modern Computer
关键词 数据集成 数据合并 数据质量 数据源 数据模式 数据处理 Information Integration Object Identification Conflict Resolution ETL
  • 相关文献

参考文献4

  • 1王能斌.数据库系统原理[M].北京:电子工业出版社,2001..
  • 2AnHai Doan, Pedro Domingos. Alon Y.Levy : Data Integration: A "Killer App" for Multistrategy Learning. In Proceedings of the Workshop on Multi-Strategy Learning (MSL--00),2000.
  • 3Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Daniele Nardi, Riccardo Rosafi; A Principled Approach to Data Integration and Reconciliation in Data Warehousing in Design and Management of Data Warehouses,1999.
  • 4Felix Naumann, Matthias Haeussler. Declarative Data Merging with Conflict Resolution. In Proceedings of the International Conference on Information Quality, 2002.

共引文献16

同被引文献11

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部