多核编程模型运行时环境的自适应性研究被引量：3

On Adaptability of the Runtime Environment for Emerging Multi-Core Programming Models

下载PDF

导出

摘要针对多核编程模型运行时环境易造成处理器核资源竞争加剧以及可扩展性较差等弊端,基于动态反馈控制思想,将资源分配、运行时控制、任务执行视为有机整体,提出了自适应协同调度模型ACSM.ACSM采用集中式与分布式相结合的协同机制,动态调节处理器核资源在不同应用负载间及其内部的分配与管理.ACSM的优势在于充分体现了多核编程模型良好的可编程性和可移植性,消除了传统多核运行时环境显式指定核数的弊端,增强了处理器核资源分配的高效性和自适应性.实验结果表明,ACSM在提高多核编程模型易用性的同时,减少了系统处理器核资源的不良竞争,提升了系统的整体性能和资源利用率.与仅依赖多核编程模型运行时环境的调度算法相比,ACSM使应用程序的运行时间缩短了近50%,并且随着应用程序数量的增加效果更加显著. The adaptability and collaboration of the multi-core runtime system is studied to address the problems that the current multi-core runtime can easily lead to intensified competition for processor resources and the system scalability is inferior. An adaptive and collaborative scheduling model, named ACSM, is presented based upon the dynamic feedback-control principle by taking resource allocation, runtime control, and task execution as a holistic system. The ACSM dynamically reallocates and manages processor resources among and within workloads in both centralized and distributed manners. The superiorities of ACSM over the current multi-core runtime system are as follows. The ACSM maintains good programmability and portability, enhances efficiency and adaptability in processor resources allocation, and eliminates the need of explicitly specifying the number of cores. The experiment results show that ACSM greatly reduces the competition of processor resources and improves both the overall system performance and the usability of the current multi-core programming models. Comparisons with the scheduling algorithm that relies only on the original multi-core runtime show that applications of ACSM reduce the run time by about 50% or even more, especially when the system load increases.

作者曹仰杰杨海兵钱德沛伍卫国

机构地区西安交通大学电子与信息工程学院北京航空航天大学计算机学院

出处《西安交通大学学报》 EI CAS CSCD 北大核心 2011年第6期130-134,共5页 Journal of Xi'an Jiaotong University

基金国家自然科学基金资助项目(61073011) 国家高技术研究发展计划资助项目(2009AA01A135 2009AA01A13) 中意国际合作项目(2009DFA12110)

关键词多核编程模型运行时环境协同调度 multi-core programming model runtime system collaborative scheduling

分类号 TP301 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献9

1HILL M, MARTY M. Amdahl's law in the multicore era [J]. Computer, 2008, 41(7):33-38.
2易会战,刘永鹏.改善系统能量效率的体系结构方法:并行处理[J].计算机学报,2009,32(12):2475-2481. 被引量：5
3CHAPMAN B, HUANG Lei. Enhancing OpenMP and its implementation for programming multicore systems [M]//Parallel Computing: Architectures, Algorithms, and Applications. Amsterdam, Netherlands: IOS Press, 2008 : 3-18.
4REINDERS J. Intel threading building blocks: outfitting C++ for multi-core processor parallelism [M]. Sebastopol, CA, USA: O'Reilly Media, 2007: 133- 168.
5FRIGO M, LEISERSON C E, RANDALL K H. The implementation of the Cilk-5 multithreaded language [C] // Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation.New York, USA. ACM, 1998: 212-223.
6龙国平,张军超,范东睿.众核体系结构对Cilk语言的硬件支持及评测研究[J].计算机学报,2008,31(11):1975-1985. 被引量：7
7BIENIA C, KUMAR S, SINGH J P, et al. The PAR- SEC benchmark suite: characterization and architectural implications [C] // Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. New York, USA: ACM, 2008: 72-81.
8AGRAWAL K, LEISERSON C E, SUKHA J. Executing task graphs using Work-Stealing [C]//Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS). Piscataway, NJ, USA: IEEE, 2010: 1-12.
9AGRAWAL K, HE Y, LEISERSON C E. Adaptive work stealing with parallelism feedback [C]// Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). New York, USA:ACM, 2007: 112-120.

二级参考文献40

1Wentzlaff D, Griffin P, Hoffmann H, Bao L, Edwards B, Ramey C, Mattina M, Miao C C, Brown J F, Agarwal A. On-chip interconnection architecture of the Tile processor. IEEE Micro, 2007, 27(5): 15-31
2Tan G, Fan D, Zhang J, Russo A, Gao G R. Experience on optimizing irregular computation for memory hierarchy in manycore architecture//Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Salt Lake City, Utah, USA, 2008: 279-280
3Long G P, Fan D R, Zhang J C, Song F L, Yuan N, Lin W. A performance model of dense matrix operations on manycore architectures//Proceedings of the European Conference on Parallel and Distributed Computing. 2008:120-129
4Lamport L. How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE Transactions on Computers, 1979, 28(9): 690-691
5Adve S V, Gharachorloo K. Shared memory consistency models: A tutorial. IEEE Computer, 1996, 29(12): 66-76
6Lenoski D, Laudon J, Gharachorloo K, Gupta A, Hennessy J L. The directory-based cache coherence protocol for the DASH multiprocessor//Proceedings of the International Symposium on Computer Architecture. Seattle, WA, USA, 1990: 148-159
7Iftode L, Singh J P, Li K. Scope consistency: A bridge between release consistency and entry consistency. Theory Computing Systems, 1998, 31(4): 451-473
8胡伟武.共享存储体系结构.北京:高等教育出版社,2001
9Frigo M, Leiserson C E, Randall K H. The implementation of the Cilk-5 mnltithreaded language//Proceedings of the International Symposium on Programming Languages Design and Implementation. Montreal, Canada, 1998:212-223
10Blumofe R D, Leiserson C E. Scheduling multithreaded computations by work stealing//Proceedings of the Annual IEEE Symposium on Foundations of Computer Science. Santa Fe, New Mexico, 1994: 256-368

共引文献10

1白俊峰,邓祖朴.多核系统下的IPSec VPN网关的研究和实现[J].计算机工程与设计,2010,31(13):2992-2995. 被引量：5
2余磊,刘志勇,马宜科,宋风龙,徐卫志,叶笑春.众核结构上分块LU分解算法的研究[J].高技术通讯,2011,21(3):248-253.
3余磊,刘志勇,宋风龙,叶笑春.LU分解在众核结构仿真器上的指令级调度研究[J].系统仿真学报,2011,23(12):2603-2610. 被引量：5
4曹仰杰,钱德沛,伍卫国,董小社.众核处理器系统核资源动态分组的自适应调度算法[J].软件学报,2012,23(2):240-252. 被引量：14
5王蕾,崔慧敏,陈莉,冯晓兵.任务并行编程模型研究与进展[J].软件学报,2013,24(1):77-90. 被引量：29
6易会战,罗兆成.基于动态电压调节的高性能业务系统能耗优化[J].华中科技大学学报（自然科学版）,2013,41(1):25-29. 被引量：1
7周亦敏,沈云龙,曹丽东.基于异构多核平台H.264解码的DVFS算法[J].计算机工程,2013,39(11):268-271. 被引量：4
8罗章琪,黄昆,张大方,关洪涛,谢高岗.面向数据包处理的众核处理器核资源分配方法[J].计算机研究与发展,2014,51(6):1159-1166. 被引量：2
9彭慧.基于智能化分配算法的计算机负荷并行处理技术研究[J].赤峰学院学报（自然科学版）,2015,31(2):21-23. 被引量：1
10李旺,潘谜,王巍.基于Cilk的不确定机械手主控LM算法并行化研究[J].集美大学学报（自然科学版）,2017,22(3):55-59.

同被引文献14

1Kasanovic. The parallel computing laboratory at U.C.Berkeley:A research agenda based on the berkeley view[R].Berkeley:UCB,2008.1-25.
2LIU Duo,SHAO Zili,WANG Meng. Optimal loop parallelization for maximizing iteration-level parallelism[J].IEEE Transactions on Paralld and Distributed Systems,2012,(03):564-572.
3Hill M D,Marty M R. Amdahl's law in the multicore era[J].Computer,2008,(07):33-38.
4W Hwu,S Ryoo,SZ Ueog. Implicitly parallel programming models for thousand-core microprocessors[A].San Diego,CA,USA:ACM,2007.754-759.
5ZHANG Wangyuan,FU Xin,LI Tao. An analysis of microarchitecture vulnerability to soft errors on simultaneous multithreaded architectures[A].San Jose,CA,USA:IEEE,2007.169-178.
6Balakrishnan S,Sohi G S. Program demultiplexing:Data-flow based speculative parallelization of methods in sequential programs[A].Boston,MA,USA:IEEE,2006.302-313.
7Ben Lee. Pertormance evaluation of dynamic speculative multithreading with the cascadia architecture[J].IEEE Transactions on Parallel and Distributed Systems,2010,(01):47-59.
8Bridges M J,Vachharajani N,ZHANG Y. Revisiting the sequential programming model for multi-core[A].Chicago,IL,USA:IEEE,2007.69-84.
9Tian C,Feng M,Nagarajan V. Copy or discard execution model for speculative parallelization on multicores[A].Lake Como,Italy:IEEE,2008.300-341.
10伊君翰.基于多核处理器的并行编程模型[J].计算机工程,2009,35(8):62-64. 被引量：13

引证文献3

1谭海.基于对象的隐式并行编程众核体系结构研究[J].计算机工程与设计,2013,34(2):623-626.
2徐建.多核编程模型运行时环境的自适应性探讨[J].科技信息,2013(26):225-225. 被引量：1
3余娅梅.探讨计算机应用程序编程模型的发展方向[J].电脑编程技巧与维护,2015(16):16-17. 被引量：3

二级引证文献4

1高春明.c++数据库类型及转换应用分析[J].科技风,2015(24):96-96. 被引量：1
2王执源.计算机应用程序编程模型发展方向探析[J].信息与电脑,2016,28(17):55-56. 被引量：1
3邵富良,张嘉文,邢一.计算机应用程序编程模型的发展[J].电子技术与软件工程,2017(11):168-168.
4康海龙,曾麒麟,马天琦,范亚楠,陈志敏.开放式基础飞行系统架构设计[J].电子设计工程,2019,27(5):156-159. 被引量：2

1徐建.多核编程模型运行时环境的自适应性探讨[J].科技信息,2013(26):225-225. 被引量：1
2李斌,李海玉,孙延君.多核计算机上的并行计算[J].电脑编程技巧与维护,2011(18):34-35. 被引量：3
3王顺绪.多核计算机上并行计算的实现与分析[J].淮海工学院学报（自然科学版）,2009,18(3):30-33. 被引量：5
4欧阳璟.听Solaris CTO谈多核编程——2007 Sun科技日见闻[J].程序员,2007(12):34-34.
5董本清.多核编程,书写提高IT职业教育质量的新篇章——教育部高职高专计算机类专业教学指导委员会多核编程推广纪实[J].计算机教育,2009(7):122-124.
6滕英岩,高志君,张盈谦.高职高专计算机类专业开设《多核编程》课程的探索与实践[J].福建电脑,2010,26(1):184-185.
7张火林,李国庆,张江维.基于双核系统的快速排序效率分析[J].电脑知识与技术,2008(8):705-707. 被引量：2
8王冲,杨斌.OpenMP在Android多核编程中的研究与运用[J].单片机与嵌入式系统应用,2014,14(8):24-26.
9多核时代,嵌入式编程和应用之出路—“2007英特尔中国多核技术学术论坛”展开多核编程与应用的讨论[J].电子产品世界,2007,14(8):152-153. 被引量：1
10刘舒佳.不良竞争背后的“黑名单”[J].信息方略,2011(13):39-41.

西安交通大学学报

2011年第6期

浏览历史

内容加载中请稍等...

多核编程模型运行时环境的自适应性研究被引量：3

参考文献9

二级参考文献40

共引文献10

同被引文献14

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

多核编程模型运行时环境的自适应性研究 被引量：3

参考文献9

二级参考文献40

共引文献10

同被引文献14

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

多核编程模型运行时环境的自适应性研究被引量：3