BDSim:面向大数据应用的组件化高可配并行模拟框架被引量：5

BDSim:A Component-Based Highly Configurable Parallel Simulation Framework for Big-Data Application Evaluation

下载PDF

导出

摘要大规模并行模拟是研究大数据体系结构的重要方法,对大数据应用及众核体系结构的发展有着不可替代的推动作用.然而,目前的模拟技术不能满足大数据体系结构研究的需求,主要体现在模拟速度慢、配置过程复杂以及可扩展性差等方面.为了解决此问题,评估面向大数据应用的高通量众核体系结构的性能与功耗,该文提出了面向大数据应用的并行模拟框架——BDSim.该框架基于组件化思想,将功能组件与框架服务单元组成并行功能单元,并可根据负载情况,自由配置组件与框架服务单元之间的映射关系.为了提高组件之间的通信和同步效率,该文提出了一种非阻塞无锁通信优化方法,和一种CMB保守同步算法的优化算法——NMTRT-CMB同步算法.模拟不同并发规模的基于2D-Mesh网络的众核系统的实验结果表明,与基于锁的并行通信方法相比,框架采用的非阻塞无锁通信优化方法可以提高并行模拟速度约10%,该算法与CMB同步算法相比,NMTRT-CMB同步算法可以减少空消息数量达90%以上. Large-scale parallel simulation is an important method for big-data architecture research,which plays an irreplaceable role in promoting big data application and many-core architecture development.However,the simulation techniques cannot meet the needs of big dataarchitecture research currently,mainly reflected in respects of low simulation speed,complicate configuration,poor scalability,and so on.To address these problems,this paper proposed BDSim,a highly configurable parallel simulation framework for big data application simulation.This framework is able to evaluate the performance and energy consumption of high throughput computing architecture which targets to big data applications.The basic idea of BDSim is based on the thought of component.In BDSim,aparallel function unit consists of several function components and a framework service（FS）unit.FS unit is the service agent for function components which are attached to it. The mapping between function components and a framework service unit is depended on loadings of function units.To improve communication efficiency,this paper proposed an optimized non-block lock-free communication method.The NMTRT-CMB synchronization algorithm based on CMB conservative synchronization algorithm was also presented to improve synchronization efficiency.The experiments were conducted with many-core architecture based on 2D-Mesh NOC under different parallel scale.According to the result,non-block lock-free communication method can help improving simulation speedup by10%,compared to communication based on locking method. NMTRT-CMB reduces null messages by almost 90% when running with 16 threads,compared to CMB.

作者李文明叶笑春张洋宋风龙王达唐士斌范东睿谢向辉

机构地区中国科学院计算技术研究所计算机体系结构国家重点实验室中国科学院大学华为技术有限公司中央研究院高效能服务器和存储技术国家重点实验室数学工程与先进计算国家重点实验室

出处《计算机学报》 EI CSCD 北大核心 2015年第10期1959-1975,共17页 Chinese Journal of Computers

基金国家"九七三"重点基础研究发展规划项目基金(2011CB302501) 国家"八六三"高技术研究发展计划项目基金(2012AA010901 2015AA011204) "核高基"国家科技重大专项基金项目(2013ZX0102-8001-001-001) 国家自然科学基金(61173007 61204047 61332009)资助~~

关键词组件化并行模拟框架并行离散事件模拟非阻塞无锁通信 CMB算法高可配大数据 component modular parallel simulation framework PDES non-block lock-free communication CMB algorithm highly configurable big data

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献32

1李国杰.大数据对计算机系统的挑战.中国计算机学会通讯,2013,9(12):33-35.
2王元卓,靳小龙,程学旗.网络大数据:现状与展望[J].计算机学报,2013,36(6):1125-1138. 被引量：706
3Ferdman M, Adileh A, Kocberber O, et al. Clearing the clouds: A study of emerging scale-out workloads on modern hardware//Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems. London, UK, 2012:37-48.
4Chen Tian-Shi, Guo Qi, Tang Ke, et al. ArchRanker: A ranking approach to design space exploration//Proceedings of the 41st Annual International Symposium on Computer Architecuture. Minneapolis, USA, 2014:85-96.
5Wang Lei, Zhan Jian-Feng, Luo Chun-Jie, et al. BigDataBench: A big data benchmark suite from internet services/ /Proceedings of the 19th International Symposium on High Performance Computer Architecture. Orlando, USA, 2014:488-499.
6Ghasemi H R, Kim N S. RCS: Runtime resource and core scaling for power-constrained multi-core processors// Proceedings of the 23rd International Conference on Parallel Architectures and Compilation. Edmonton, Canada, 2014: 251-262.
7Nilmini A, Rcetuparna D, Li Qing-Kun, et al. Scaling towards kilo-core processors with asymmetric high-radix topologies//Proceedings of the 19th International Symposium on High Performance Computer Architecture. Shenzhen, China, 2013:496-507.
8Guthmuller E, Leti C E A, Grenoble F, et al. Architectural exploration of a fine-grained 3D cache for high performance in a manycore context//Proceedings of the 21st International Conference on Very Large Scale Integration. lstanbul, Turkey, 2013, 302-307.
9Kapil D, AbduUah N N, Sherief R. Power mapping and modeling of multi-core processors//Proceedings of the Inter- national Symposium on Low Power Electronics and Design. Beijing, China, 2013: 39-44.
10Sironi F, Maggio M, Cattaneo R, et al. ThermOS, System support for dynamic thermal management of chip multi- processors//Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques. Edinburgh, UK, 2013: 41-50.

二级参考文献93

1Bell S, Edwards B, Amann Jet al. TILE64 processor: A 64-core SoC with mesh inter-connect//Proceedings of the International Solid-State Circuits Conference. San Francisco, USA, 2008:88-598.
2Howard J, Dighe S, Hoskote Yet al. A 48-core IA-32 mes- sage-passing processor with DVFS in 45 nm CMOS//Pro- eeedings of the International Solid-State Circuits Conferenee. San Francisco, USA, 2010:108-109.
3Kelm John H, Johnson Daniel R, Johnson Matthew R et al. Rigel: An architecture and sealable programming//Proeeed- ings of the International Symposium of Computer Arehitee-ture. Saint-Malo, France, 2009: 140-151.
4Das S R, Fujimoto R, Panesar K S, Allison D, Hybinette M, GTW: A time warp system for shared memory multipro- eessors//Proceedings of the Winter Simulation Conference. Lake Buena Vista, USA, 1994:1332-1339.
5Chen J, Annavaram M, Dubois M. SlaekSim: A platform for parallel simulations of CMPs on CMPs. ACM SIGARCH Computer Architecture News, 2009, 37(2): 20-29.
6Miller J E, Kasture H, Kurian G et al. Graphite: A distributed parallel simulator for multieores//Proceedings of the 16th IEEE International Symposium on High-Performance Computer Architecture. Bangalore, India, 2010:1-12.
7Chiou D, Sunwoo D, Kim Jet al. FPGA-aeeelerated simula- tion technologies (FAST): Fast, full-system, cycle accurate simulators//Proeeedings of the 40th Annual IEEE/ACM In- ternational Symposium on Microarchiteeture. Porto Alegre, Brazil, 2007:249-261.
8Fujimoto R M. Parallel discrete event simulation. Communi- cations of the ACM, 1990, 33(10) : 30-53.
9Mukherjee S S, Reinhardt S, Falsafi Bet al. Wisconsin wind Tunnel II: A fast, portable parallel architecture simulator. IEEE Concurrency, 2000, 8(4): 12-20.
10Chandy K, Misra J. Distributed simulation: A case study in design and verification of distributed programs. IEEE Trans- actions on Software Engineering, 1979, 5(5): 440-452.

共引文献708

1张丛铄.基于大数据的研究生心理危机预警机制的构建[J].中国新通信,2020,0(2):80-81. 被引量：2
2吴嘉琪.一种基于ELK框架的地理信息动态时空数据获取与挖掘方法[J].测绘通报,2020(1):45-49. 被引量：2
3谢月锋,董现垒,陈卉,王燕,刘志成.利用网络痕迹信息即时预测儿童腹泻流行趋势[J].医学信息（医学与计算机应用）,2016,29(29):1-4.
4韩益亮,卢万谊,武光明,杨晓元.适用于网络大数据的属性基广义签密方案[J].计算机研究与发展,2013,50(S2):23-29. 被引量：2
5邓波,张玉超,金松昌,林旺群.基于MapReduce并行架构的大数据社会网络社团挖掘方法[J].计算机研究与发展,2013,50(S2):187-195. 被引量：10
6梁俊杰,熊亚军.以固态硬盘为缓存的存储技术研究[J].微电子学与计算机,2015,32(1):40-44. 被引量：2
7嵇梅.中国保健食品,明天还有“戏”吗?[J].新疆人大,2000(4):35-37.
8刘琼.大数据环境下图书馆面临的影响与挑战[J].理论观察,2013(8):112-113. 被引量：29
9马晓亭.大数据时代图书馆数据长期可用性保障研究[J].现代情报,2013,33(12):62-64. 被引量：7
10王晴.大数据时代企业竞争情报的机遇、挑战及对策研究[J].天津商务职业学院学报,2013,1(4):83-87. 被引量：1

同被引文献27

1宁津生,姚宜斌,张小红.全球导航卫星系统发展综述[J].导航定位学报,2013,1(1):3-8. 被引量：173
2S.Boccaletti,V.Latora,Y.Moreno,M.Chavezf,D.-U.Hwang,方爱丽,赵继军.复杂网络:结构和动力学[J].复杂系统与复杂性科学,2007,4(1):49-92. 被引量：7
3段亚军,武昌,李成恩.一种改进的卫星导航系统效能评估模型[J].火力与指挥控制,2008,33(5):133-136. 被引量：6
4周命端,郭际明,孟祥广.GPS对流层延迟改正UNB3模型及其精度分析[J].测绘信息与工程,2008,33(4):3-5. 被引量：21
5徐杰,孟黎,任超,徐军.对流层延迟改正中投影函数的研究[J].大地测量与地球动力学,2008,28(5):120-124. 被引量：26
6张勇,张斌,马能武.单频GPS接收机的电离层延迟改正模型研究[J].大地测量与地球动力学,2012,32(2):69-73. 被引量：14
7董龙明,王戟,陈立前,董威.基于局部堆内存抽象表示的堆操作程序内存泄露检测[J].计算机研究与发展,2012,49(9):1832-1842. 被引量：4
8徐慧,周建美,顾颀.强化课堂编程思维契合教学实践目标——《数据结构》教学方法探析[J].高教论坛,2013(1):24-28. 被引量：8
9吕慧伟,程元,白露,陈明宇,范东睿,孙凝晖.众核处理器和众核集群的并行模拟[J].计算机研究与发展,2013,50(5):1110-1117. 被引量：4
10李国伟,郭金运,原永东,杨磊,王方建.GPS测站多路径效应建模研究[J].测绘科学,2013,38(3):7-9. 被引量：13

引证文献5

1冯卫刚.数据结构、算法和程序之间关系研究[J].四川水泥,2016(9):275-275.
2方国庆,李文明,余洋,张洋,叶笑春,安虹.高通量众核并行模拟加速技术研究[J].计算机工程,2017,34(4):73-78.
3卢炼,阳爱民.大规模3D并行分层可扩展矩阵乘法的递阶优化方法[J].计算机应用研究,2017,34(6):1713-1717.
4袁磊,许劼,许广州.数据分析在汽车工业设备智能分析系统的应用[J].计算机应用与软件,2017,34(12):154-157. 被引量：1
5蒋炜,陈新鹏,赵光俊,王炳辉,潘飚.北斗卫星导航系统的定位效能评估方法[J].现代雷达,2022,44(6):70-76. 被引量：3

二级引证文献4

1曹永新.分析起重设备在港口备件智能化仓储系统中的应用[J].科技资讯,2020,18(4):60-60. 被引量：1
2马媛.基于北斗三号的气象应急通信系统应用分析[J].青海科技,2023,30(1):163-168. 被引量：2
3曹翔,史誉州,刘雅奇,赵仓龙,刘涛.北斗三号/GPS组合导航系统全球定位性能分析[J].珠江水运,2024(4):24-29.
4赵来定,廉佳鹏,张更新.COSPAS-SARSAT系统第二代示位标信号的软件无线电设计实现[J].现代雷达,2024,46(3):28-34.

1段平,罗笑南.分布式模拟中的CMB算法及其变型[J].计算机科学,1994,21(1):29-33.
2李越,钱德沛.基于NS的分布式并行网络模拟器[J].电子学报,2004,32(2):246-249. 被引量：13
3李俊红,解建军.并行离散事件模拟测试模型研究[J].计算机技术与发展,2007,17(8):113-116.
4LS Mtron公司部署Siemens PLM Software[J].CAD/CAM与制造业信息化,2012(6):6-6.
5王岩,吴悦,杨洪斌.并行离散事件模拟系统容错功能设计[J].计算机应用研究,2006,23(8):69-71. 被引量：1
6李俊红,解建军,王喜年,陈丽娟.并行离散事件模拟的同步机制研究[J].计算机工程与设计,2006,27(13):2375-2377. 被引量：3
7陆洋,赵卫东.一种面向服务的智能客户端组件技术[J].机电一体化,2010,16(3):66-71.
8D. Hamlet,陈涛.组装软件构件软件测试展望[J].国外科技新书评介,2012(3):14-15.
9张恺.电子商务中XML安全技术的应用研究[J].长春工程学院学报（自然科学版）,2010,11(1):97-100. 被引量：1
10郑纬民,余宏亮,施广宇,陈坚.基于并行离散事件模拟的大规模P2P系统行为预测[J].中国科学：信息科学,2010,40(10):1338-1350. 被引量：1

计算机学报

2015年第10期

浏览历史

内容加载中请稍等...

BDSim:面向大数据应用的组件化高可配并行模拟框架被引量：5

参考文献32

二级参考文献93

共引文献708

同被引文献27

引证文献5

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

BDSim:面向大数据应用的组件化高可配并行模拟框架 被引量：5

参考文献32

二级参考文献93

共引文献708

同被引文献27

引证文献5

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

BDSim:面向大数据应用的组件化高可配并行模拟框架被引量：5