期刊文献+

分布式ETL中协同机制的研究与设计 被引量:10

Research and design of collaborative mechanism in distributed ETL tool
下载PDF
导出
摘要 ETL工具在构造数据仓库过程中负责数据抽取、转换和加载的工作。商用的ETL工具一般需要运行在高性能的服务器如小型机上进行大量的计算工作,造成数据仓库项目在硬件方面的成本必须大大的提高。针对计算效率的问题,提出了一种多计算服务器的协同计算模型,通过创建一协调中心来控制多计算服务器的协同处理计算来增加计算能力。与原有的ETL工具相比,可以在一定程度上降低ETL工具对服务器硬件性能的要求。 ETL tool do extraction, transformation and loading job in the process of building a data warehouse. Commercial ETL tool must run at high-powered server such as minicomputers to do large number of computation, It greatly increased the cost in purchasing hardware. According to the problem about computation efficiency, a cooperative computation model among multi-servers was put forward. A coordinator was created to control multi-server's cooperative process for reinforcing server's computation ability. Compared to the former ETL tools, it reduces the requirements for the server's performance in a certain extent.
出处 《通信学报》 EI CSCD 北大核心 2006年第11期177-182,共6页 Journal on Communications
关键词 ETL 协同工作 分布式计算 ETL cooperative work distributed computation
  • 相关文献

参考文献9

二级参考文献33

  • 1[美]Microsoft Corporation.Microsoft SQL Sever 2000数据操作和复制[M].北京:科学出版社,2001.547-552.
  • 2姜炜.ETL 应用浅析[EB/OL].http://www.dwway.com/document.phpid=375sortid=8,2003-05-12.
  • 3吴悦.如何选择ETL工具[EB/OL].http://www.dwway.com/document.phpid=376sortid=8,2003-05-12.
  • 4Common Warehouse Metamodel(CWM) Specification [S]. version 1.0,2001.
  • 5Aebi, D., Perrochon, L. Towards improving data quality. In: Sarda, N.L., ed. Proceedings of the International Conference on Information Systems and Management of Data. Delhi, 1993. 273~281.
  • 6Wang, R.Y., Kon, H.B., Madnick, S.E. Data quality requirements analysis and modeling. In: Proceedings of the 9th International Conference on Data Engineering. Vienna: IEEE Computer Society, 1993. 670~677.
  • 7Rahm, E., Do, H.H. Data cleaning: problems and current approaches. IEEE Data Engineering Bulletin, 2000,23(4):3~13.
  • 8Galhardas, H., Florescu, D., Shasha, D., et al. AJAX: an extensible data cleaning tool. In: Chen, W.D., Naughton, J.F., Bernstein, P.A., eds. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. Texas: ACM, 2000. 590.
  • 9Hernandez, M.A., Stolfo, S.J. Real-World data is dirty: data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 1998,2(1):9~37.
  • 10Lee, M.L., Ling, T.W., Lu, H.J., et al. Cleansing data for mining and warehousing. In: Bench-Capon, T., Soda, G., Tjoa, A.M., eds. Database and Expert Systems Applications. Florence: Springer, 1999. 751~760.

共引文献306

同被引文献58

引证文献10

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部