期刊文献+

信息系统数据集成技术浅析

A Review of Data Integration Techniques in Information Systems
下载PDF
导出
摘要 根据目前信息化的发展现状和趋势,对信息系统中数据集成技术进行了综合分析。重点阐述了ETL(数据抽取、转换和装载)和数据清洗的工作过程,并对目前流行的几种主要商业ETL工具进行了评估和比较。这种评估比较能进一步指导信息系统的使用,同时也为信息系统的演化提供了理论支持。 This paper reviews the state-of-the-art of data integration and future trend of information techniques. Mainly we analyze the processes of ETL ( data extraction, transformation and loading) and data cleaning Besides, we also compare and evaluate many commercial ETL tools from the perspective of supporting data source and supporting data transformation. This comparison would help the using of information systems, and it also provides a theoretical support for the evolution of such systems.
作者 赵淼
出处 《电子工程师》 2006年第8期44-47,共4页 Electronic Engineer
关键词 信息化 数据集成 ETL 数据清洗 information data integration ETL data cleaning
  • 相关文献

参考文献20

  • 1HANG D, MANNILA H, SMYTI-I P. Principles of data mining[M]. Cambridge, MA, USA: The MIT Press, 2001.
  • 2DASU T, JOHNSON T. Exploratory data mining and data cleaning[M]. New York, NY, USA: John Wiley & Sons,2003.
  • 3THOMSEN E. OLAP solutions: Building multidimensional information systems[ M]. New York, NY, USA: John Wiley &Sons, 1997.
  • 4KIMBALL R, REEVES L, ROSS M, et al. The data warehouse lifecycle toolkit: Expert methods for designing, developing, and deploying data warehouses[ M ]. New York, NY,USA: John Wiley & Sons, 2000.
  • 5KIMBALL R, CASERTA, J. The data warehouse ETL toolkit: Practical techniques for extracting, Cleaning[ M]. New York,NY, USA : John Wiley & Sons, 2004.
  • 6MARCO D. Building and managing the meta data repository: A full lifecycle guide [ M ]. New York, NY, USA : Wiley &Sons, 2000.
  • 7RAHM E, DO H H. Data cleaning: Problems and current approaches[ J ]. IEEE Data Engineering Bulletin, 2000, 23(4):3-13.
  • 8MAYDANCHIK A. Challenges of efficient data cleansing[ J].DM Review, September 1999.
  • 9郭志懋,周傲英.数据质量和数据清洗研究综述[J].软件学报,2002,13(11):2076-2082. 被引量:265
  • 10GALHARDAS H, FLORESCU D, SHASHA D, et al. Declarative data cleaning: language, models, and algorithms[ C]//Proceedings of 27th International Conference on Very Large Data Bases(VLDB'O1 ), Sep 11-14, 2001, Roma, Italy. 2001 : 271-380.

二级参考文献24

  • 1Aebi, D., Perrochon, L. Towards improving data quality. In: Sarda, N.L., ed. Proceedings of the International Conference on Information Systems and Management of Data. Delhi, 1993. 273~281.
  • 2Wang, R.Y., Kon, H.B., Madnick, S.E. Data quality requirements analysis and modeling. In: Proceedings of the 9th International Conference on Data Engineering. Vienna: IEEE Computer Society, 1993. 670~677.
  • 3Rahm, E., Do, H.H. Data cleaning: problems and current approaches. IEEE Data Engineering Bulletin, 2000,23(4):3~13.
  • 4Galhardas, H., Florescu, D., Shasha, D., et al. AJAX: an extensible data cleaning tool. In: Chen, W.D., Naughton, J.F., Bernstein, P.A., eds. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. Texas: ACM, 2000. 590.
  • 5Hernandez, M.A., Stolfo, S.J. Real-World data is dirty: data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 1998,2(1):9~37.
  • 6Lee, M.L., Ling, T.W., Lu, H.J., et al. Cleansing data for mining and warehousing. In: Bench-Capon, T., Soda, G., Tjoa, A.M., eds. Database and Expert Systems Applications. Florence: Springer, 1999. 751~760.
  • 7Monge, A.E. Matching algorithm within a duplicate detection system. IEEE Data Engineering Bulletin, 2000,23(4):14~20.
  • 8Monge, A.E., Elkan, C. The field matching problem: algorithms and applications. In: Simoudis, E., Han, J.W., Fayyad, U., eds. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. Oregon: AAAI Press, 1996. 267~270.
  • 9Savasere, A., Omiecinski, E., Navathe, S.B. An efficient algorithm for mining association rules in large databases. In: Dayal, U., Gray, P., Nishio, S., eds. Proceedings of the 21st International Conference on Very Large Data Bases. Zurich: Morgan Kaufmann, 1995. 432~444.
  • 10Srikant, R., Agrawal, R. Mining Generalized Association Rules. In: Dayal, U., Gray, P., Nishio, S., eds. Proceedings of the 21st International Conference on Very Large Data Bases. Zurich: Morgan Kaufmann, 1995. 407~419.

共引文献264

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部