期刊文献+

A New ETL Approach Based on Data Virtualization 被引量:1

A New ETL Approach Based on Data Virtualization
原文传递
导出
摘要 ETL (Extract-Transform-Load) usually includes three phases: extraction, transformation, and loading. In building data warehouse, it plays the role of data injection and is the most time-consuming activity. Thus it is necessary to improve the performance of ETL. In this paper, a new ETL approach, TEL (Transform-Extract-Load) is proposed. The TEL approach applies virtual tables to realize the transformation stage before extraction stage and loading stage, without data staging area or staging database which stores raw data extracted from each of the disparate source data systems. The TEL approach reduces the data transmission load, and improves the performance of query from access layers. Experimental results based on our proposed benchmarks show that the TEL approach is feasible and practical. ETL (Extract-Transform-Load) usually includes three phases: extraction, transformation, and loading. In building data warehouse, it plays the role of data injection and is the most time-consuming activity. Thus it is necessary to improve the performance of ETL. In this paper, a new ETL approach, TEL (Transform-Extract-Load) is proposed. The TEL approach applies virtual tables to realize the transformation stage before extraction stage and loading stage, without data staging area or staging database which stores raw data extracted from each of the disparate source data systems. The TEL approach reduces the data transmission load, and improves the performance of query from access layers. Experimental results based on our proposed benchmarks show that the TEL approach is feasible and practical.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第2期311-323,共13页 计算机科学技术学报(英文版)
关键词 cloud computing big data ETL heterogeneous database data virtualization cloud computing, big data, ETL, heterogeneous database, data virtualization
  • 相关文献

参考文献1

二级参考文献13

  • 1陈弦,陈松乔.基于数据仓库的通用ETL工具的设计与实现[J].计算机应用研究,2004,21(8):214-216. 被引量:26
  • 2韩京宇,徐立臻,董逸生.ETL执行的流水线优化[J].小型微型计算机系统,2005,26(6):1013-1017. 被引量:15
  • 3Simitsisl A,Vassiliadis P.A Methodology for the Conceptual Modeling of ETL Processes[].The th Conf on Advanced Information Systems Engineering(CAiSE’).2003
  • 4Vassiliadis P,Simitsis A,Georgantas P, et al.A Framework for the Design of ETL Scenarios[].CAiSE’.2003
  • 5Vassiliadis P,Simitsis A,Terrovitis M, et al.Blueprints and Measures for ETL Workflows[].The th Intl Conf on Conceptual Modeling.2005
  • 6Bleiholder J,Naumann F.Declarative Data Fusion Syntax, Semantics, and Implementation[].Advances in Databases and Information Systems.2005
  • 7Vassiliadis P,Simitsis A,Skiadopoulos S.On the Logical Modeling of ETL Processes[].The th Intl Conf on Advanced Information Systems Engineering (CAiSE’).2002
  • 8Business Objects Corporation.Data Integrator Introduce. http://www.businessobjects.com . 2005
  • 9IBM Corporation.Data Integration Software-IBM WebSphere DataStage. http://www.ibm.com . 2006
  • 10Informatica Corporation.Data Integration-Informatica. http://www.informatica.com . 2004

共引文献1

同被引文献1

引证文献1

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部