摘要
数据负载均衡对Hadoop分布式文件系统(HDFS)性能有着重要的影响,针对HDFS中默认的数据负载均衡方法存在的效率低和缺乏灵活性的不足,文中提出了一种新的动态负载均衡方法,即通过控制变量来动态分配网络带宽以达到数据负载均衡.在此基础上建立了基于控制变量的数据负载均衡数学模型.实验结果表明,文中提出的方法既能保证HDFS的数据访问性能,又能提高集群加入新节点时的数据负载均衡效率.
Data load balancing greatly affects the performance of the Hadoop distributed file system (HDFS). In order to overcome the inefficiency and inflexibility of the default data load balancing method in HDFS, this paper devises a novel dynamic load balancing method, which dynamically allocates network bandwidth to achieve the data load balancing by controlling variables. Then, the corresponding mathematical model is constructed based on the controlled variables. Experimental results show that the devised method can not only guarantee the performance of the HDFS data access system but also improve the data load balancing efficiency in the presence of a new cluster node.
出处
《华南理工大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2012年第9期42-47,共6页
Journal of South China University of Technology(Natural Science Edition)
基金
广东省自然科学基金资助项目(10451064101005155
S2011010001754)
广东省科技计划项目(2012B010100030)
广东省战略性新兴产业核心技术攻关项目(2011A010801002)
广州市海珠区科技计划项目(x2jsB2120750)