期刊文献+

一种面向大规模数据处理的数据库引擎 被引量:1

Database Engine for Large Scale Data Processing
下载PDF
导出
摘要 当数据量从GB级上升至TB级甚至PB级时,具有高性能的并行数据库在保证扩展性和容错性的同时计算代价会很高。针对该问题,设计一种面向大规模数据处理的并行数据库引擎FlexDB。利用Map Reduce的并行计算框架作为通信层,调度和协调集群中各节点的计算和通信。实验结果表明,FlexDB的系统性能接近于并行数据库,并且具有较好的扩展性和容错性。 When the amount of data from GB goes up to TB level or even PB level,parallel database with high performance cost too much in order to achieve scalability and fault tolerance.To address the problem,this paper designs a parallel database engine——FlexDB,which is based on Map Reduce.The parallel computing framework of Map Reduce is as a communication layer of FlexDB which is to assign computing tasks and coordinate communications among all nodes in cluster.Experimental results show that the FlexDB system performance is close to parallel database,and has good expansibility and fault tolerance.
出处 《计算机工程》 CAS CSCD 2012年第11期48-50,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60803117)
关键词 海量数据 扩展性 容错性 Map Reduce框架 并行数据库 mass data scalability fault tolerance Map Reduce framework parallel database
  • 相关文献

参考文献12

  • 1EMC Corporation. IDC Digital Universe Study[EB/OL]. (2011- 07-11). http://www.emc.com/collateral/demos/microsites/digital- universe-2011/index.htm.
  • 2Azza A, Kamil B P, Daniel J A. HadoopDB: An Architectural Hybrid of Map Reduce and DBMS Technologies for Analytical Workloads[C]//Proc. of VLDB'09. Lyon, France: ACM Press, 2009.
  • 3Dean J, Ghemawat S. Map Reduce: Simplified Data Processing on Large Clusters[C]//Proc. of the 6th Symposium on Operating Systems Design and Implementation. San Francisco, USA: [s. n.], 2004.
  • 4Sanjay G, Howard G, Shun-Tak L. The Google File System[C]// Proc. of SOSP'03. New York, USA: ACM Press, 2003.
  • 5Apache Hadoop Organization. Hadoop[EB/OL]. (2010-08-16). http://www.hadoop.apache.org.
  • 6de Witt D J, Stonebraker M. Map Reduce: A Major Step Back- wards[EB/OL]. (2010-04-13). http://www.washington.edu/homes/ billhowe/mapreduce_a_maj orstep_backwards.html.
  • 7Michael S, Daniel A, David J D. Map Reduce and Parallel DBMSs: Friends or Foes?[EB/OL]. (2011-05-16). http://database.cs.brown. edu/papers/stonebraker-cacm2010.pdf.
  • 8Apache Hadoop Organization. Hive[EB/OL]. (2011-08-21). http:// www.hive.apache.org/.
  • 9EMC Corporation. Greenplum[EB/OL]. (2011-06-12), http://www. greenplum.com/.
  • 10周敏.Anthill:一种基于MapReduce的分布式DBMS[D].广州:暨南大学,2010.

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部