期刊文献+

Hadoop异构集群中数据负载均衡的研究 被引量:6

RESEARCH ON DATA LOAD BALANCING IN HETEROGENEOUS HADOOP CLUSTER
下载PDF
导出
摘要 Hadoop平台下,数据的负载均衡对平台性能的发挥有着深远的影响。首先分析默认数据负载均衡的局限性,针对现有默认HDFS(Hadoop Distributed File System)数据负载均衡算法只考虑存储空间利用率,而未考虑节点间异构性的问题,提出一种量化异构集群数据负载均衡的数学模型。该模型根据节点的存储空间及节点性能计算得到各个节点的理论空间利用率,并根据当前集群存储空间利用率动态调整节点最大负载。实验结果表明,提出的数据负载均衡策略能够让异构集群达到更合理的均衡状态,提高集群的效率,并有效减少作业的执行时间。 In Hadoop,the data load balancing has profound effect on the exertion of platform performance. First we analysed the limitation of default data load balancing,aiming at the problem of current default HDFS( Hadoop distributed file system) that the data load balancing algorithm only focuses on the storage space utilisation but not considers the heterogeneity between nodes,we presented a mathematic model which quantifies the data load balancing of heterogeneous clusters. The model calculates the theoretical space utilisation of each node based on their allocated storage space and processing capacity,and dynamically adjusts the maximum load of each node according to current average utilisation of cluster storage space. Experimental result showed that the proposed data balancing strategy could enable the heterogeneous clusters to reach more reasonable balancing state so as to improve clusters efficiency,and to decrease the execution time of job effectively as well.
出处 《计算机应用与软件》 CSCD 2016年第5期31-34,共4页 Computer Applications and Software
基金 国家自然科学基金项目(61202350)
关键词 HADOOP HDFS 数据负载均衡 异构集群 Hadoop HDFS Data load balancing Heterogeneous cluster
  • 相关文献

参考文献10

二级参考文献26

  • 1分布式基础学习[EB/OL]2009-02-22.http://www.cnblogs.com/duguguiyu/archive/2009/02/22/1396034.html.
  • 2Caibinbupt.Hadoop源代码分析[EB/OL].http://eaibin-bupt.iteye.com/blog/262412#bc2244008.
  • 3Dhruba Borthakur. The hadoop distributed file sys- tem.- architecture and design [EB/OL]. (2008-09-02) [2010-08-25]. http://hadoop, apache, org/common/ docs/r0.16.0/hdfs_design, html.
  • 4Jeffrey Dean, Sanjay Ghemawat. MapReduce: sim- plied data processing on large elusters[C]// Proceed- ings of the 6th Symposium on Operating System De- sign and Implementation. New York: ACM Press, 2004:137-150.
  • 5Hadoop HDFS[EB/OL]. (2011 - 10-18) [2011 - 10- 25]. http://hadoop, apache, org/hdfs/.
  • 6Caibinbupt.Hadoop源代码分析(重读GFS的文章)[EB/OL].(2009-01-29)[2010-8-25].http://caibinbupt.javaeye.com/blog/318949.
  • 7Tom White. Hadoop.. the definitive guide[M]. United States of America: O'Reilly Media, Inc. 2009.
  • 8TOM WHITE Hadoop :The Definitive Guide[ M ]. U- nited States of America: O1Reilly Media,Inc,2009.
  • 9JEFFREY DEAN, SAN JAY GHEMAWAT. MapRe- duce : Simplied data processing on large clusters [ C ]// Proceedings of the 6th Symposium on Operating System Design and Implementation. New York: ACM Press. 2004 : 137-150.
  • 10HADOOP HDFS[EB/OL]. (2011-10-18) [2011-10- 25] http ://hadoop. apache, org/hdfs/.

共引文献330

同被引文献38

引证文献6

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部