期刊文献+

基于动态带宽分配的Hadoop数据负载均衡方法 被引量:10

Hadoop Data Load Balancing Method Based on Dynamic Bandwidth Allocation
下载PDF
导出
摘要 数据负载均衡对Hadoop分布式文件系统(HDFS)性能有着重要的影响,针对HDFS中默认的数据负载均衡方法存在的效率低和缺乏灵活性的不足,文中提出了一种新的动态负载均衡方法,即通过控制变量来动态分配网络带宽以达到数据负载均衡.在此基础上建立了基于控制变量的数据负载均衡数学模型.实验结果表明,文中提出的方法既能保证HDFS的数据访问性能,又能提高集群加入新节点时的数据负载均衡效率. Data load balancing greatly affects the performance of the Hadoop distributed file system (HDFS). In order to overcome the inefficiency and inflexibility of the default data load balancing method in HDFS, this paper devises a novel dynamic load balancing method, which dynamically allocates network bandwidth to achieve the data load balancing by controlling variables. Then, the corresponding mathematical model is constructed based on the controlled variables. Experimental results show that the devised method can not only guarantee the performance of the HDFS data access system but also improve the data load balancing efficiency in the presence of a new cluster node.
作者 林伟伟 刘波
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2012年第9期42-47,共6页 Journal of South China University of Technology(Natural Science Edition)
基金 广东省自然科学基金资助项目(10451064101005155 S2011010001754) 广东省科技计划项目(2012B010100030) 广东省战略性新兴产业核心技术攻关项目(2011A010801002) 广州市海珠区科技计划项目(x2jsB2120750)
关键词 HADOOP 负载均衡 带宽 Hadoop load balancing bandwidth
  • 相关文献

参考文献15

  • 1Apache. Hadoop [ EB/OL ]. [ 2012- 01- 03 ]. http ://lu- cene. apache, org/hadoop.
  • 2林伟伟.一种改进的Hadoop数据放置策略[J].华南理工大学学报(自然科学版),2012,40(1):152-158. 被引量:35
  • 3Prashant S, Kamalakar K. A multi-agent simulation frame- work on small Hadoop cluster [ J ]. Engineering Applica- tions of Artificial Intelligence ,2011,24 (7) : 1120-1127.
  • 4Qiu Zhi, Lin Zhao-wen, Ma Yan. Research of Hadoop- based data flow management system [ J ]. The Journal of China Universities of Posts and Telecommunications, 2011,18(2) :164-168.
  • 5Ye Xianglong, Huang Mengxing, Zhu Donghai, et al. A no- vel blocks placement strategy for Hadoop [ C ]//Procee- dings of the 1 l th International Conference on Computer and Information Science. Washington D C : IEEE, 2012 : 3-7.
  • 6Sadasivam G S, Selvaraj D. A novel parallel hybrid PSO- GA using MapReduce to schedule jobs in Hadoop data grids [ C] //Proceedings of the Second World Congress on Nature and Biologically Inspired Computing. Fukuoka:IEEE, 2010 : 15-17.
  • 7Dean J, Ghemawat S. MapReduce:simplified data proce- ssing on large clusters [ J]. Communications of the ACM, 2008,51 ( 1 ) : 107-113.
  • 8Ghemawat S, Gogioff H, Leung P T. The google file system [ C ]//Proceedings of the 19th ACM Symposium on Ope- rating Systems Principles. New York : ACM,2003 : 29- 43.
  • 9Jeremy Z. Yahoo!Launches world's largest Hadoop produc- tion application [ EB/OL ]. ( 2008 - 02-19 ) [ 2012- 01- 03 ]. http ://marcboucher. ws/2008/02/hadoop-scales-really-well-yahoo-launches-worlds-largest-hadoop-production-ap- plication, html.
  • 10Loughran Steve. Applications powered by Hadoop [ EB/ OL]. [2012-01-03]. http: //wiki. apache, org/hadoop! PoweredBy.

二级参考文献27

  • 1Deelman E,Chervenak A.Data management challenges of data-intensive scientific workflows//Proceedings of the IEEE International Symposium on Cluster Computing and the Grid(CCGRID).Lyon,France,2008:687-692.
  • 2Deelman E,Blythe J,Gil Y,Kesselman C,Mehta G,Patil S,Su M H,Vahi K,Livny M.Pegasus:Mapping scientific workflows onto the grid//Proceedings of the European Across Grids Conference(AxGrids).Nicosia,Cyprus,2004:11-20.
  • 3Ludascher B,Altintas I,Berkley C,Higgins D,Jaeger E,Jones M,Lee E A.Scientific workflow management and the Kepler system.Concurrency and Computation:Practice and Experience,2005,18(10):1039-1065.
  • 4Oinn T,Addis M,Ferris J,Marvin D,Senger M,Greenwood M,Carver T,Glover K,Pocock M R,Wipat A,Li P.Taverna:A tool for the composition and enactment of bioinformatics workflows.Bioinformatics,2004,20(17):3045-3054.
  • 5Ghemawat S,Gobioff H,Leung S T.The google file system.ACM SIGOPS Operating Systems Review,2003,37(5):29-43.
  • 6Wang L,Tao J,Kunze M,Castellanos A C,Kramer D,Karl W.Scientific cloud computing:Early definition and experience//Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications(HPCC).Dalian,China,2008:825-830.
  • 7Wieczorek M,Prodan R,Fahringer T.Scheduling of scientific workflows in the ASKALON grid environment.SIGMOD Record,2005,34(3):56-62.
  • 8Baru C,Moore R,Rajasekar A,Wan M.The SDSC storage resource broker//Proceedings of the IBMCentre for Advanced Studies Conference.Toronto,Canada,1998:1-12.
  • 9Churches D,Gombas G,Harrison A,Maassen J,Robinson C,Shields M,Taylor I,Wang I.Programming scientific and distributed workflow with Triana services.Concurrency and Computation:Practice and Experience,2006,18:1021-1037.
  • 10Chervenak A,Deelman E,Foster I,Guy L,Hoschek W,Iamnitchi A,Kesselman C,Kunszt P,Ripeanu M,Schwartzkopf B,Stockinger H,Stockinger K,Tierney B.Giggle:A framework for constructing scalable replica location services//Proceedings of the ACM/IEEE Conference on Supercomputing.Baltimore,Maryland,USA,2002:1-17.

共引文献153

同被引文献139

引证文献10

二级引证文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部