基于事件驱动的MapReduce类流量产生方法与网络评测被引量：1

Event-Driven Method for MapReduce Traffic Generation and Network Evaluation

下载PDF

导出

摘要大规模网络结构设计是构建大规模分布式系统和E级高性能计算集群的核心技术之一,底层网络设计者需要结合顶层应用通信流量特征,进行网络结构选型与优化.不当的应用通信模型会引起网络结构设计与实际需求的背离,进而导致系统通信和整体性能的下降.传统基于"黑盒"数据分析的流量建模方法存在业务建模粒度粗和应用数据规模扩展性差等缺陷.该研究引入模拟业务内部逻辑的"事件驱动"思想,提出一种针对主流计算模式MapReduce进行流量建模与流量产生方法.与真实应用流量的对比评测显示,该方法能够准确体现MapReduce计算业务所产生网络流量的特征.基于正确的流量模型,该文对四种主流数据中心网络进行了性能模拟分析.结果表明:相较负载随机均匀分布流量,同一种网络在负载MapReduce特性流量时性能将下降超过30%,因此特性流量能更加明显地展现网络拥塞与瓶颈问题.仿真实验所得到的有关网络性能瓶颈、拓扑可扩展性以及网络性价比的结论,为大规模数据中心网络选型和性能优化提供了新的依据. Interconnection network design is one of the core technologies in the constructions of exascale clusters and large-scale distributed systems.Such large-scale computing system is expected to be achieved in the near future due to the rapid innovations of semiconductor logic and memory,architectures,interconnections and other industry technologies.Among these,due to performance and cost factors,interconnection network plays a critical role in such a large-scale computing system.In large-scale clusters or datacenter,the design of interconnection network is facing greater challenges.Firstly,the increasing computing capacity of a single node requires the network providing higher bandwidth and lower latency.Secondly,the increasing number of nodes requires the network has extremely better scalability.Thirdly,the increasing scale of system leads to worse performance of collective communication,which is harmful to the performance and scalability of applications.Fourthly,the increasing number of devices requires the network has better reliability.As the performance of compute nodes keep increasing,interconnection network has gradually become the bottleneck of large-scale computing system.However,switch chip,the core component of interconnection network,can offer limited aggregate bandwidth because of the constraint of physical processes and packaging technologies.The underlying network designers should consider the processing characteristics of the network traffic when selecting and optimizing the network architecture.Improper traffic model will cause the departure between network architecture and characteristics of communication,which will reduce the overall performance of data centers and clusters.Big data platform has the cost-effective advantage of data processing with the feature of simplified programming and parallel computing,which has being more and more recognized by the industry.In recent years,the community of high-performance computing is also increasingly using Big data platform for HPC data processing,which has become a powerful means for scientific data analysis gradually.Scientific data traffic generated by the application of HPC tends to have many requirements,including high quality processing,compute in communication link and huge date size,which is called as“high-throughput”traffic.The scale of data processing and the port cost of network need to be considered during the design of datacenter for distributed computer system.The most widely used model for computing and communication in distributed system is MapReduce.The traditional traffic generation method for“Black-Box”is coarse granularity with poor scalability.Therefore,this paper presents a methodology for MapReduce traffic modeling and generation based on the idea of“event-driven”.The accuracy evaluation,which compared our methodology with the real application traffic,indicates that the traffic generated by our method can accurately reflect the characteristics of the network traffic generated by MapReduce in distributed computing system.Our performance simulation analysis and bottleneck analysis of four major data center networks,which is conducted by using the characteristic flow in network simulator,shows that the difference of network performance between the one loaded with MapReduce traffic and the one loaded with uniform random traffic is more than 30%,indicating that characteristic traffic could more obviously reveal the issues of network congestion and bottleneck.The results of our simulation,related to the bottleneck of network performance,topology scalability and network cost-effectiveness,provide a new way for large-scale data center network selection and network performance optimization.

作者邵恩孙凝晖郭嘉梁元国军王展曹政 SHAO En;SUN Nin-Hui;GUO Jia-Liang;YUAN Guo-Jun;WANG Zhan;CAO Zheng(State Key Laboratory of Computer Architecture,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;University of Chinese Academy of Sciences,Beijing 100049)

机构地区中国科学院计算技术研究所计算机体系结构国家重点实验室中国科学院大学

出处《计算机学报》 EI CSCD 北大核心 2018年第10期2265-2281,共17页 Chinese Journal of Computers

基金国家重点研发计划项目(2016YFB0200300 2016YFGX030148 2016YFB0200205 2016GZKF0JT006) 国家自然科学基金项目(61572464 61402444) 中国科学院战略性先导科技专项(XDB24060600)资助~~

关键词分布式系统 MAPREDUCE 数据中心网络事件驱动大规模网络模拟 distributed system MapReduce data center network event-driven large-scale network simulation

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1郭得科,罗来龙,李妍,胡智尧,任棒棒.数据中心内Incast流量的网内聚合研究[J].计算机研究与发展,2016,53(1):53-67. 被引量：3
2何高峰,杨明,罗军舟,张璐.Tor匿名通信流量在线识别方法[J].软件学报,2013,24(3):540-556. 被引量：34
3周爱平,程光,郭晓军.高速网络流量测量方法[J].软件学报,2014,25(1):135-153. 被引量：29

二级参考文献44

1郑军,胡铭曾,云晓春,郑仲.基于数据流方法的大规模网络异常发现[J].通信学报,2006,27(2):1-8. 被引量：17
2Condie T,Conway N,Alvaro P, et al. MapReduce online[C] //Proc of USENIX NSDI’10. Berkeley, CA: USENIXAssociation, 2010; 313-328.
3Yu Yuan, Isard M,Fetterly D,et al. DryadLINQ: A systemfor general-purpose distributed data-parallel computing using ahigh-level language [C] //Proc of USENIX OSDI'08. Berkeley,CA: USENIX Association, 2008: 1-14.
4Murray D G,Schwarzkopf M, Smowton C, et al. CIEL: Auniversal execution engine for distributed data-flowcomputing [C] //Proc of USENIX NSDI’ll. Berkeley, CA:USENIX Association, 2011.
5Malewicz G, Austern M H,Bik A J C, et al. Pregel: Asystem for large-scale graph processing [C] //Proc of ACMSIGMOD'IO. New York: ACM, 2010: 135-146.
6Zaharia M,Chowdhury M, Franklin M J,et al. Spark:Cluster computing with working sets [J]. Book of Extremes,2010, 15(1): 1765-1773.
7Chowdhury M? Zaharia M, Ma J, et al. Managing datatransfers in computer clusters with orchestra [C] //Proc ofACM SIGCOMM'll. New York: ACM, 2011: 98-109.
8Al-Fares A, Loukissas A,Vahdat A. A scalable, commoditydata center network architecture [C] //Proc of ACMSIGCOMM'08. New York: ACM, 2008; 63-74.
9Greenberg A, Jain N,Kandula S, et al. VL2: A scalableand flexible data center network [C] //Proc of ACMSIGCOMM'09. New York; ACM, 2009: 51-62.
10Mysore R,Pamboris A, Farrington N. PortLand: A scalablefault-tolerant layer 2 data center network fabric [C] //Proc ofACM SIGCOMM'09. New York: ACM,2009: 39-50.

共引文献62

1李振国,郑惠中.网络流量采集方法研究综述[J].吉林大学学报（信息科学版）,2014,32(1):70-75. 被引量：12
2姜开达,李霄,孙强.基于网络流量元数据的安全大数据分析[J].信息网络安全,2014(5):37-40. 被引量：15
3刘元珍.基于多级CBF的长流识别[J].微型电脑应用,2014,30(9):60-61. 被引量：1
4刘勇,雒江涛,邓生雄,王小平.基于Hadoop的网络分流和流特征计算[J].电信科学,2014,30(12):76-81. 被引量：6
5王啸,方滨兴,刘培朋,郭莉,时金桥.Tor匿名通信网络节点家族的测量与分析[J].通信学报,2015,36(2):80-87. 被引量：4
6侯颖,黄海,兰巨龙,李鹏,朱圣平.基于自适应超时计数布鲁姆过滤器的流量测量算法[J].电子与信息学报,2015,37(4):887-893. 被引量：3
7赵小欢,李明辉.基于CBF-SS策略的大流识别算法[J].中国科学院大学学报（中英文）,2015,32(3):391-397. 被引量：1
8王晶,汪斌强,张校辉.基于可重构测量模型的网络测量任务部署算法[J].电子与信息学报,2015,37(7):1598-1605. 被引量：1
9伊鹏,钱坤,黄万伟,王晶,张震.基于抽样流长与完全抽样阈值的异常流自适应抽样算法[J].电子与信息学报,2015,37(7):1606-1611. 被引量：3
10王东.计算机网络路由算法的理论与进展[J].河南理工大学学报（自然科学版）,2015,34(5):665-670. 被引量：3

同被引文献10

1李强,刘晓峰.基于模拟植物生长算法的云作业调度模型[J].系统仿真学报,2018,30(12):4649-4658. 被引量：9
2陈黄科,祝江汉,朱晓敏,马满好,张振仕.云计算中资源延迟感知的实时任务调度方法[J].计算机研究与发展,2017,54(2):446-456. 被引量：27
3宋杰,孙宗哲,毛克明,鲍玉斌,于戈.MapReduce大数据处理平台与算法研究进展[J].软件学报,2017,28(3):514-543. 被引量：96
4吕瑞,孙林夫.面向产业链云服务平台的分布式备件库存协同控制方法与软件工具研究[J].计算机工程与科学,2017,39(10):1812-1818. 被引量：6
5萨日娜.基于蚁群粒子群优化算法的云计算资源调度方案[J].吉林大学学报（理学版）,2017,55(6):1518-1522. 被引量：12
6金顺福,郝闪闪,王宝帅.融合双速率和工作休眠的虚拟机调度策略及参数优化[J].通信学报,2017,38(12):10-20. 被引量：8
7束柬,梁昌勇,徐健.基于信任的云服务系统多目标任务分配模型[J].计算机研究与发展,2018,55(6):1167-1179. 被引量：11
8任桂山,刘梦泽,陈学梅,李红艳,徐朝农.面向MapReduce异构集群的低功耗调度技术研究[J].计算机应用与软件,2018,35(7):138-141. 被引量：6
9Jianjiang Li,Jie Wang,Bin Lyu,Jie Wu,Xiaolei Yang.An Improved Algorithm for Optimizing MapReduce Based on Locality and Overlapping[J].Tsinghua Science and Technology,2018,23(6):744-753. 被引量：5
10施巍松,张星洲,王一帆,张庆阳.边缘计算:现状与展望[J].计算机研究与发展,2019,56(1):69-89. 被引量：342

引证文献1

1邵孟良,齐德昱.大数据应用中节点休眠结合MapReduce作业的能量感知调度方法[J].计算机应用与软件,2020,37(6):40-47. 被引量：1

二级引证文献1

1关智华,郭志彪.基于遗传算法的通信网络节点自适应调度方法[J].现代传输,2023(5):63-66.

1陈秀锦,王岩,李英木,陆雯怡,杨峰,祝遵坤,何军委.管线网络的评测和优化[J].电信技术,2018(1):68-72.
2段树侠,胥俊丞,杨伟,张传熙.BBU集中化部署及本地传送网应对策略研究[J].邮电设计技术,2017(11):18-21. 被引量：7
3刘琦,尹祖新.UTN网络评测及优化[J].邮电设计技术,2017(11):35-38.
4郭燕梅.“互联网+”环境下开展儿童文学阅读的研究[J].教育信息技术,2018(4):24-26. 被引量：1
5刘梦薇.数据管理中通信及互联网技术运用分析[J].华东科技（综合）,2018,0(3):346-346.
6陆冬妹.基于FPGA的通信原理实验平台研制与应用探析[J].信息与电脑,2018,30(17):60-61.
7何亚桥,黄小军,李明佳.本地UTN测评及降本增效分析[J].电信技术,2018(8):82-84. 被引量：1
8蓝天.对电力综合自动化系统中数字化变电站特点和网络选型的探讨[J].电子测试,2018,29(3):146-147. 被引量：5
9邵恩,元国军,郇志轩,曹政,孙凝晖.面向大规模计算集群的多轨分割网络[J].计算机研究与发展,2017,54(11):2534-2546. 被引量：2
10徐文远,毛力,王晓锋.基于流量约简的网络模拟方法[J].计算机工程,2017,43(1):120-125. 被引量：1

计算机学报

2018年第10期

浏览历史

内容加载中请稍等...

基于事件驱动的MapReduce类流量产生方法与网络评测被引量：1

参考文献3

二级参考文献44

共引文献62

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于事件驱动的MapReduce类流量产生方法与网络评测 被引量：1

参考文献3

二级参考文献44

共引文献62

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于事件驱动的MapReduce类流量产生方法与网络评测被引量：1