期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
SMART: Speedup Job Completion Time by Scheduling Reduce Tasks
1
作者 jia-qing dong Ze-Hao He +6 位作者 Yuan-Yuan Gong Pei-Wen Yu Chen Tian Wan-Chun Dou Gui-Hai Chen Nai Xia Hao-Ran Guan 《Journal of Computer Science & Technology》 SCIE EI CSCD 2022年第4期763-778,共16页
Distributed computing systems have been widely used as the amount of data grows exponentially in the era of information explosion. Job completion time (JCT) is a major metric for assessing their effectiveness. How to ... Distributed computing systems have been widely used as the amount of data grows exponentially in the era of information explosion. Job completion time (JCT) is a major metric for assessing their effectiveness. How to reduce the JCT for these systems through reasonable scheduling has become a hot issue in both industry and academia. Data skew is a common phenomenon that can compromise the performance of such distributed computing systems. This paper proposes SMART, which can effectively reduce the JCT through handling the data skew during the reducing phase. SMART predicts the size of reduce tasks based on part of the completed map tasks and then enforces largest-first scheduling in the reducing phase according to the predicted reduce task size. SMART makes minimal modifications to the original Hadoop with only 20 additional lines of code and is readily deployable. The robustness and the effectiveness of SMART have been evaluated with a real-world cluster against a large number of datasets. Experiments show that SMART reduces JCT by up to 6.47%, 9.26%, and 13.66% for Terasort, WordCount and InvertedIndex respectively with the Purdue MapReduce benchmarks suite (PUMA) dataset. 展开更多
关键词 job scheduling job completion time MAPREDUCE HADOOP
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部