期刊文献+

基于成本的MapReduce工作流优化器

Cost-based MapReduce workflow optimizers
下载PDF
导出
摘要 对MapReduce栈的不同层进行优化有各自的优缺点。针对MapReduce工作负载的优化问题,提出了相关概念;通过与RoT的对比,介绍了MapReduce工作基于成本的优化及所使用的相关技术,并对MapReduce基于成本的优化进行了评估;基于工作流中的数据流依赖和资源依赖关系,提出了三种工作流优化器,评估了基于成本的工作流优化,并对工作流优化器进行了终端-对-终端的评估;通过实验评估了工作流优化器的优化开销并对这三种工作流优化器的优缺点进行了对比分析。 Optimizations at different levels of the MapReduce stack have their advantages and disadvantages. For MapReduce workload optimization problem, related concepts are proposed;cost-based optimization approaches and related technology of MapReduce jobs are introduced and assessed through comparation with RoT;three MapReduce workflow optimizers are presented for cost-based optimization of MapReduce workflows based on dataflow and resource dependencies. Cost-based workflow optimization is evaluated. End-to-end evaluation of the workflow optimizer is described;the advantages and disad-vantages of these three workflow optimizers are analyzed through experimental evaluation of their overhead.
作者 冯秋燕
出处 《计算机工程与应用》 CSCD 北大核心 2015年第21期64-69,共6页 Computer Engineering and Applications
关键词 MapReduce工作负载 优化 数据流依赖 资源依赖 工作流优化器 MapReduce workloads optimization dataflow dependencies resource dependencies workflow optimizer
  • 相关文献

参考文献17

  • 1Pavlo A,Paulson E,Rasin A,et al.A comparison of approaches to large-scale data analysis[C]//Proc of the2009 ACM SIGMOD Intl Conf on Management of Data,2009:165-178.
  • 2葛君伟,蒋仙,方义秋.消息代理机制下的MapReduce数据流优化[J].计算机工程与应用,2013,49(5):120-122. 被引量:5
  • 3Dittrich J,Quian J A,Jindal A,et al.Hadoop++-making a yellow elephant run like a cheetah[C]//Proc of the VLDB Endowment,2010:515-529.
  • 4Jiang D,Ooi B C,Shi L,et al.The performance of MapReduce-an in-depth study[C]//Proc of the VLDB Endowment,2010:472-483.
  • 5White T.Hadoop:the definitive guide[M].California:Yahoo!,2010.
  • 6Tang H.Mumak:Map-Reduce simulator[EB/OL].Mumak:apache,2009(2009-09-25)[2011-11-26].https://issues.apache.org/jira/browse/MAPREDUCE-728.
  • 7Hadoop Tutorial.Hadoop Map Reduce tutorial[EB/OL].New York:Hadoop Tutorial,2011(2011-03-04)[2013-02-13].http://hadoop.apache.org/docs/r1.0.4/mapred_tutorial.html.
  • 8Kwon Y,Balazinska M,Howe B,et al.Skew-resistant parallel processing of feature extracting scienti fi c user-defined functions[C]//Proc of the 1st Symposium on Cloud Computing,2010:1-5.
  • 9Nykiel T,Potamias M,Mishra C,et al.MRShare:sharing across multiple queries in Map Reduce[C]//Proc of the VLDB Endowment,2010:494-505.
  • 10Lee R,Luo T,Huai Y,et al.YSmart-Yet another SQLto-Map Reduce translator[C]//Proc of the 31st Intl Conf on Distributed Computing Systems,2011:25-36.

二级参考文献6

  • 1李乐平,吴泉源.消息代理中间件InforBroker中集群技术的应用[J].微计算机信息,2006,22(09X):140-142. 被引量:2
  • 2孙兆玉,袁志平,黄字光.面向数据密集型计算Hadoop及其应用研究[C]//2008年全国高性能计算学术年会论文集,2008:441-443.
  • 3Dean J, Ghemawat S.MapReduce: simplified data processing on large clusters[C]//Proceedings of the 6th Symposium onOperating System Design and Implementation.New York: ACM Press,2004: 137-150.
  • 4Apache Hadoop.Hadoop[EB/OL].[2009-03-06].http ://hadoop. apache.org/.
  • 5Condie T, Conway N, Alvaro P, et al.MapReduce online, UCB/EECS-2009-136[R].Berkeley: EECS Department,Univer- sity of California, 2009.
  • 6戴俊,朱晓民.基于ActiveMQ的异步消息总线的设计与实现[J].计算机系统应用,2010,19(8):254-257. 被引量:32

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部