期刊文献+

GPU异构系统中的存储层次和负载均衡策略研究 被引量:11

Research on Memory Hierarchy and Load Balance Strategy in Heterogeneous System Based on GPU
下载PDF
导出
摘要 GPU体系结构的革新和相应开发平台的发展使得GPU广泛地应用于科学计算领域。通过深入地分析GPU体系结构和存储层次的优缺点以及GPU上的关键性能特征,阐明了GPU体系结构、编程模型和存储层次之间的关系。针对GPU异构系统上的应用映射提出三种基本负载均衡优化策略:预取、流化、任务划分。试验结果揭示了不同的优化因子与优化效率之间的具体关联。 Owing to the revolution of GPU architecture and improvement of developing platforms, GPU is widely used in scientific computing nowadays. Relationships among GPU architecture, programming model and memory hierarchy are illustrated by analyzing memory hierarchy and exploring key performance featmes of GPU. Three basic load balance strategies on mapping applications onto GPU are presented: Prefetch, stream computing, task division. The effective relationships among different factors and optimization efficiency are tested and exposed by experiments.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2009年第5期38-43,共6页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(60873016) 国家863计划资助项目(2009AA01Z102) 教育部"高性能微处理器技术"创新团队资助项目(IRT0614)
关键词 GPGPU 存储层次 负载均衡策略 流计算 任务划分 GPGPU memory hierarchy load balance strategy stream computing task division
  • 相关文献

参考文献10

  • 1Hartley T D R,Catalyurek U,Ruiz A,et al.Biomedical Image Analysis on A Cooperative Cluster of Gpus and Multicores[].Proceedings of thendAnnual International Conference on Supercomputing.2008
  • 2Bond A.Havok F X:GPU-accelerated Physics for PC Games[].Proceedings of Game Developers Conference.2006
  • 3Hagen T R,Lie K A,Natvig J R.Solving the Euler Equations on Graphics Processing Units[].Proceedings of the thInternational Conference onComputational Science.2006
  • 4Elsen E,Houston M,Vishal V,et al.BN-body Simulation on GPUs[].Proc ACM/IEEE Confon Supercomputing.2006
  • 5Stone S S,Haldar J P,Tsao S C,et al.Accelerating Advanced MRI Reconstructions on GPUs[].ACM Computing Frontier Conference.2008
  • 6.OpenVIDIA:GPU-accelerated Computer Vision Library[EB][].openvidiasourceforgenet.2006
  • 7Volkov V,Demmel J W.Benchmarking GPUs to Tune Dense Linear Algebra[].SC’:Proceedings of the ACM/IEEE Conference on Su-per-computing.2008
  • 8Fatica M.Accelerating Linpack with CUDA on Heterogenous Clusters[].GPGPU’.2009
  • 9Halfhill T R.Parallel Processing with CUDA[].Microprocessor Report.2008
  • 10Gutowitz H.A Tutorial Introduction to Swarm[]..1993

同被引文献93

引证文献11

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部