期刊文献+

基于流网络的Flink平台弹性资源调度策略 被引量:15

Flow-network based auto rescale strategy for Flink
下载PDF
导出
摘要 为了解决大数据流式计算平台中存在计算负载波动上升,但集群无法有效应对负载变化的问题,提出了基于流网络的Flink平台弹性资源调度策略(FAR-Flink)。该策略首先建立流网络模型并通过构建算法计算每条边的容量值,其次通过弹性资源调度算法确定集群性能瓶颈并制定动态资源调度计划,最后通过基于数据分簇和分桶管理的状态数据迁移算法,实施调度计划并完成节点间的高效数据迁移。实验结果表明,该策略在状态数据复杂的应用场景中有较好的优化效果,在满足计算时延约束的前提下提高了集群的吞吐量,缩短了状态数据迁移的时间。由此可见,FAR-Flink策略有效提升了集群对负载波动的响应能力。 In order to solve the problem that the load of big data stream computing platform is increasing with fluctuation while the cluster was not able to rescale efficiently,the Flow-network based auto rescale strategy for Flink was proposed.Firstly,the flow-network model was set up and the capacity of each edge that was calculated by self-learning algorithm.Secondly,the bottleneck of the cluster was acquired by maximum-flow algorithm and the resource rescheduling plan was drawn up.Finally,the resource rescheduling plan was executed and the stateful data was migrated efficiently by the data migration algorithm based on the strategy of data partitioning by bulk and bucket.The experimental results show that the strategy can effectively provide performance promotion in the application with complex stateful data.It improved the throughput of the cluster and reduced the time overhead of the data migration on the premise of satisfying the latency constrain of the application,which means that the strategy promotes the scalability of the cluster efficiently.
作者 李梓杨 于炯 卞琛 张译天 蒲勇霖 王跃飞 鲁亮 LI Ziyang;YU Jiong;BIAN Chen;ZHANG Yitian;PU Yonglin;WANG Yuefei;LU Liang(School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China;School of Software,Xinjiang University,Urumqi 830008,China;College of Internet Finance and Information Engineering,Guangdong University of Finance,Guangzhou 510521,China;School of Computer Science and Technology,Civil Aviation University of China,Tianjin 300300,China)
出处 《通信学报》 EI CSCD 北大核心 2019年第8期85-101,共17页 Journal on Communications
基金 国家自然科学基金资助项目(No.61862060,No.61462079,No.61562086,No.61562078) 国家科技部科技支撑基金资助项目(No.2015BAH02F01) 新疆维吾尔自治区自然科学基金资助项目(No.2017D01A20) 新疆维吾尔自治区高校科研计划基金资助项目(No.XJEDU2016S106)
关键词 流式计算 资源调度 弹性集群 负载迁移 Flink stream computing resource scheduling elastic cluster load migration Flink
  • 相关文献

参考文献8

二级参考文献26

  • 1Zhang H,Chen G,Ooi B C,et al.Inmemory big data management and processing:a survey. IEEE Transactions on Knowledge and Data Engineering . 2015
  • 2Zaharia M,Das T,Li H,Hunter T,Shenker S,Stoica I.Discretized streams:fault-tolerant streaming computation at scale. ACM Symposium on OperatingSystems Principles (SOSP) . 2013
  • 3Rodrigo Agerri,Xabier Artola,Zuhaitz Beloki,German Rigau,Aitor Soroa.??Big data for Natural Language Processing: A streaming approach(J)Knowledge-Based Systems . 2014
  • 4Marcos D. Assun??o,Rodrigo N. Calheiros,Silvia Bianchi,Marco A.S. Netto,Rajkumar Buyya.??Big Data computing and clouds: Trends and future directions(J)Journal of Parallel and Distributed Computing . 2014
  • 5F. Dehne,Q. Kong,A. Rau-Chaplin,H. Zaboli,R. Zhou.??Scalable real-time OLAP on cloud architectures(J)Journal of Parallel and Distributed Computing . 2014
  • 6Mauro Andreolini,Michele Colajanni,Marcello Pietri,Stefania Tosi.??Adaptive, scalable and reliable monitoring of big data on clouds(J)Journal of Parallel and Distributed Computing . 2014
  • 7Karthik Kambatla,Giorgos Kollias,Vipin Kumar,Ananth Grama.??Trends in big data analytics(J)Journal of Parallel and Distributed Computing . 2014
  • 8C.L. Philip Chen,Chun-Yang Zhang.??Data-intensive applications, challenges, techniques and technologies: A survey on Big Data(J)Information Sciences . 2014
  • 9Yang F,Qian Z P,Chen X W,et al.Sonora:a platform for continuous mobile-cloud computing. http://research.microsoft.com/apps/pubs/default.aspx?id=161446 . 2012
  • 10Sfrent A,Pop F.Asymptotic scheduling for many task computing in big data platforms. Journal of Information Science . 2015

共引文献119

同被引文献100

引证文献15

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部