期刊文献+

面向通用计算GPU集群的任务自动分配系统 被引量:2

Automatic Task Assignment System of General Computing Oriented GPU Cluster
下载PDF
导出
摘要 当前GPU集群的主流编程模型是MPI与CUDA的松散耦合,采用这种编程模型进行编程,存在编程复杂度大、程序的可移植性差、执行效率低等问题。为此,提出一种面向通用计算GPU集群的任务自动分配系统StreamMAP。对编译器进行改造,以编译制导的方式提供集群任务的计算资源需求,通过运行时系统动态地发现、建立并维护系统资源拓扑,设计一种较为契合GPU集群应用特征的任务分配策略。实验结果表明,StreamMAP系统能降低集群应用程序的编程复杂度,使之较为高效地利用GPU集群的计算资源,且程序的可移植性和可扩展性也得到了保证。 MPI+CUDA are the mainstream programming models of current GPU cluster architecture. However, by using such a low level programming model, programmers require detailed knowledge of the underlying architecture, which exerts a heavy burden. Besides, the program is less portability and inefficient. This paper proposes StreamMAP, an automatic task assignment system on GPU clusters. It provides powerful, yet concise language extension suitable to describe the compute resource demands of cluster tasks. It develops a run time system to maintain resource information, and supplies an automatic task assignment for GPU cluster. Experiments show that StreamMAP provides programmability, portability and scalability for GPU cluster application.
出处 《计算机工程》 CAS CSCD 2014年第3期103-107,119,共6页 Computer Engineering
关键词 GPU集群 异构 编程模型 任务分配 可移植性 可扩展性 GPU cluster heterogeneous programming model task assignment portability scalability
  • 相关文献

参考文献12

二级参考文献112

共引文献79

同被引文献14

  • 1Nickolls J,Buck I,Garland M,et al.Scalable Parallel Programming with CUDA[J].Queue,2008,6(2):40-53.
  • 2Yoo A B,Jette M A,Grondona M.SLURM:Simple Linux Utility for Resource Management[C]//Proceedings of JSSPP’03.Berlin,Germany:Springer,2003:44-60.
  • 3Staples G.TORQUE Resource Manager[C]//Proceed-ings of ACM/IEEE Conference on Supercomputing.New York,USA:ACM Press,2006:8.
  • 4沈莉,陈林.一种CPU+GPU资源调度系统的研究[J].高性能计算发展与应用,2011,(1):28-31.
  • 5Newall M,Holmes V,Lunn P.GPU Cluster for Accelerated Processing and Visualisation of Scientific and Engineering Data[C]//Proceedings of Science and Information Conference.Washington D.C.,USA:IEEE Press,2014:140-145.
  • 6Prabhakaran S,Iqbal M,Rinke S,et al.A Dynamic Resource Management System for Network-attached Accelerator Clusters[C]//Proceedings of the 42nd International Conference on Parallel Processing.Washington D.C.,USA:IEEE Press,2013:773-782.
  • 7张繁,王章野,姚建,吴韬,彭群生.应用GPU集群加速计算蛋白质分子场[J].计算机辅助设计与图形学学报,2010,22(3):412-419. 被引量:12
  • 8梁娟娟,任开新,郭利财,刘燕君.GPU上的矩阵乘法的设计与实现[J].计算机系统应用,2011,20(1):178-181. 被引量:7
  • 9刘进锋,郭雷.CPU与GPU上几种矩阵乘法的比较与分析[J].计算机工程与应用,2011,47(19):9-11. 被引量:7
  • 10马梦琦,刘羽,曾胜田.基于CUDA架构矩阵乘法的研究[J].微型机与应用,2011,30(24):62-64. 被引量:2

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部