期刊文献+

Heron环境下基于实例重分配的传输负载优化策略

Transmission load optimization strategy based on instance reallocation in Heron
下载PDF
导出
摘要 作为新一代大数据流式计算框架,Heron忽略了任务实例之间不同通信方式的差异以及节点资源利用率不均衡的问题导致系统性能下降。针对这一问题,设计了节点资源限制模型、通信开销优化模型和实例数据流关系模型,并在此基础上提出了Heron环境下基于实例重分配的传输负载优化策略(transmission load optimization strategy based on instance reallocation in Heron,TLIR-Heron)。该策略包括节点资源限制算法和实例重分配算法,通过判定实例重分配条件并执行重分配算法将节点间数据流转换为节点内数据流,从而降低通信开销。实验结果表明,在三组拓扑测试下,TLIR-Heron相较于Heron默认调度策略能够降低节点间通信开销和系统的计算延迟,并提升了计算节点资源利用的均衡性。 As a new platform in big data stream computing,Apache Heron ignores the difference in communication modes between task instances and the unbalance of processing load among nodes,which leads to the decline system performance.To address the problem,this paper designed the model of node resource limitation,the model of communication overhead optimization and the model of data stream relationships among instances,as the foundation to propose the TLIR-Heron.The strategy was composed of the node resource limitation algorithm and the instance reallocation algorithm.By judging the criteria for instance reallocation and executing instance reallocation algorithm,this strategy transformed the inter-node data streams into intra-node data streams and minimized the communication overhead of the system.The experimental results show that under the three sets of benchmarks,TLIR-Heron reduces the communication overhead between nodes and the response latency of the system compared with the default scheduling strategy,and improves the balance of resource utilization of computing nodes.
作者 刘宇 于炯 蒲勇霖 李梓杨 张译天 Liu Yu;Yu Jiong;Pu Yonglin;Li Ziyang;Zhang Yitian(School of Software,Xinjiang University,Urumqi 830091,China;College of Information Science&Engineering,Xinjiang University,Urumqi 830046,China)
出处 《计算机应用研究》 CSCD 北大核心 2021年第1期198-203,共6页 Application Research of Computers
基金 国家自然科学基金资助项目(61862060,61462079,61562086,61562078) 国家科技部科技支撑基金资助项目(2015BAH02F01) 新疆大学博士生科技创新资助项目(XJUBSCX-201902)。
关键词 大数据 流式计算 Apache Heron 资源限制 通信开销 big data stream computing Apache Heron resource limitation communication overhead
  • 相关文献

参考文献3

二级参考文献26

  • 1Zhang H,Chen G,Ooi B C,et al.Inmemory big data management and processing:a survey. IEEE Transactions on Knowledge and Data Engineering . 2015
  • 2Zaharia M,Das T,Li H,Hunter T,Shenker S,Stoica I.Discretized streams:fault-tolerant streaming computation at scale. ACM Symposium on OperatingSystems Principles (SOSP) . 2013
  • 3Rodrigo Agerri,Xabier Artola,Zuhaitz Beloki,German Rigau,Aitor Soroa.??Big data for Natural Language Processing: A streaming approach(J)Knowledge-Based Systems . 2014
  • 4Marcos D. Assun??o,Rodrigo N. Calheiros,Silvia Bianchi,Marco A.S. Netto,Rajkumar Buyya.??Big Data computing and clouds: Trends and future directions(J)Journal of Parallel and Distributed Computing . 2014
  • 5F. Dehne,Q. Kong,A. Rau-Chaplin,H. Zaboli,R. Zhou.??Scalable real-time OLAP on cloud architectures(J)Journal of Parallel and Distributed Computing . 2014
  • 6Mauro Andreolini,Michele Colajanni,Marcello Pietri,Stefania Tosi.??Adaptive, scalable and reliable monitoring of big data on clouds(J)Journal of Parallel and Distributed Computing . 2014
  • 7Karthik Kambatla,Giorgos Kollias,Vipin Kumar,Ananth Grama.??Trends in big data analytics(J)Journal of Parallel and Distributed Computing . 2014
  • 8C.L. Philip Chen,Chun-Yang Zhang.??Data-intensive applications, challenges, techniques and technologies: A survey on Big Data(J)Information Sciences . 2014
  • 9Yang F,Qian Z P,Chen X W,et al.Sonora:a platform for continuous mobile-cloud computing. http://research.microsoft.com/apps/pubs/default.aspx?id=161446 . 2012
  • 10Sfrent A,Pop F.Asymptotic scheduling for many task computing in big data platforms. Journal of Information Science . 2015

共引文献327

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部