期刊文献+

利用访存模式构建GPU高效率数据访问

Employing Access Model to Structure GPU Efficient Data Access
下载PDF
导出
摘要 针对访存相对密集的应用,提出了一种基于访存模式高效率数据访问技术。该技术结合应用程序的访存特性和GPU的片上高速共享存储器特性减少应用程序对高延迟片外存储访问的次数从而提高系统数据访问的效率,通过在不同架构的GPU上进行了验证,分别取得了N卡最高9倍和A卡最高8倍的加速效果,并对各个优化策略在不同架构GPU上取得效果的原因进行了分析。 Aiming at the relatively intensive applications of memory access,an efficient data access technology based on access model is proposed.This techonology combines the memory access characteristics of applications and the on-chip high-speed shared memory characteristics of GPU to reduce the number of applications accessing off-chip menory with high latency so as to improve the efficiency of system data access.By testifying on GPU of different architectures,the acceleration effects of N card up to nine times and A card up to eight times are achieved respectively,and the reasons for the effectiveness of each optimization strategy on different architecture GPU are analyzed.
作者 张瑞 田密 ZHANG RUI;TIAN MI(School of Education Scinence,Yan′an University,Yan′an 716000,China;School of Computer Sciense&Technology,Beijing Institute of Technology,Beijing 100081,China;Network Information Center,Yan′an Vocational&Technical College,Yan′an 716000,China)
出处 《延安大学学报(自然科学版)》 2020年第3期30-36,共7页 Journal of Yan'an University:Natural Science Edition
基金 国家自然科学基金地区科学基金项目(61866038) 陕西省教育厅科研项目(18JK0865)。
关键词 访存模式 GPU 数据饥饿 高效率数据访问 access model GPU data undersupplied efficient data access
  • 相关文献

参考文献1

二级参考文献57

  • 1Clark James H.The geometry engine:A VLSI geometry system for graphics[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1982.127~133
  • 2Fuchs Herry,Poulton John.Pixel-planes:A VLSI-Oriented design for a raster graphics engine[J].VLSI Design,1981,2(3):20~28
  • 3Eyles John,Austin John,Fuchs Henry,et al.Pixel-plane 4:A summary,advances in computer graphics hardware II[A].Eurographic Seminars Tutorials and Perspectives in Computer Graphics,New York:Springer-Verlag,1988.183~208
  • 4Fuchs Herry,Israel Laura,Poulton John,et al.Pixel-planes 5:A heterogeneous multiprocessor graphics system using processor-enhanced memories[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1989.79~88
  • 5http://www.nvidia.com/object/gpu.html[OL]
  • 6http://developer.nvidia.com/[OL]
  • 7http://www.ati.com/developer/[OL]
  • 8http://www.gpgpu.org[OL]
  • 9Joo Luiz Dihl Comba,Dietrich Carlos A,Pagot Christian A,et al.Computation on GPUs:From a programmable pipeline to an efficient stream processor[J].Revista de Informática Teóricae Aplicada,2003,X(2):41~70
  • 10Krüger Jens,Westermann Rüdiger.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Transactions on Graphics,2003,22(3):908~916

共引文献226

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部