期刊文献+

SMART: Speedup Job Completion Time by Scheduling Reduce Tasks

原文传递
导出
摘要 Distributed computing systems have been widely used as the amount of data grows exponentially in the era of information explosion. Job completion time (JCT) is a major metric for assessing their effectiveness. How to reduce the JCT for these systems through reasonable scheduling has become a hot issue in both industry and academia. Data skew is a common phenomenon that can compromise the performance of such distributed computing systems. This paper proposes SMART, which can effectively reduce the JCT through handling the data skew during the reducing phase. SMART predicts the size of reduce tasks based on part of the completed map tasks and then enforces largest-first scheduling in the reducing phase according to the predicted reduce task size. SMART makes minimal modifications to the original Hadoop with only 20 additional lines of code and is readily deployable. The robustness and the effectiveness of SMART have been evaluated with a real-world cluster against a large number of datasets. Experiments show that SMART reduces JCT by up to 6.47%, 9.26%, and 13.66% for Terasort, WordCount and InvertedIndex respectively with the Purdue MapReduce benchmarks suite (PUMA) dataset.
作者 董加卿 何泽昊 龚媛媛 于沛文 田臣 窦万春 陈贵海 夏耐 管浩然 Jia-Qing Dong;Ze-Hao He;Yuan-Yuan Gong;Pei-Wen Yu;Chen Tian;Wan-Chun Dou;Gui-Hai Chen;Nai Xia;Hao-Ran Guan(State Key Laboratory of Media Convergence and Communication,Communication University of China Beijing 100024,China;State Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210023,China;School of Computer Science,The University of Sydney,Sydney NSW 2006,Australia)
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2022年第4期763-778,共16页 计算机科学技术学报(英文版)
基金 This work was supported by the National Key Research and Development Project of China under Grant No.2020YFB1707600 the National Natural Science Foundation of China under Grant Nos.62072228,61972222 and 92067206 the Fundamental Research Funds for the Central Universities of China,the Collaborative Innovation Center of Novel Software Technology and Industrialization,and the Jiangsu Innovation and Entrepreneurship(Shuangchuang)Program.
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部