利用访存模式构建GPU高效率数据访问

Employing Access Model to Structure GPU Efficient Data Access

下载PDF

导出

摘要针对访存相对密集的应用,提出了一种基于访存模式高效率数据访问技术。该技术结合应用程序的访存特性和GPU的片上高速共享存储器特性减少应用程序对高延迟片外存储访问的次数从而提高系统数据访问的效率,通过在不同架构的GPU上进行了验证,分别取得了N卡最高9倍和A卡最高8倍的加速效果,并对各个优化策略在不同架构GPU上取得效果的原因进行了分析。 Aiming at the relatively intensive applications of memory access,an efficient data access technology based on access model is proposed.This techonology combines the memory access characteristics of applications and the on-chip high-speed shared memory characteristics of GPU to reduce the number of applications accessing off-chip menory with high latency so as to improve the efficiency of system data access.By testifying on GPU of different architectures,the acceleration effects of N card up to nine times and A card up to eight times are achieved respectively,and the reasons for the effectiveness of each optimization strategy on different architecture GPU are analyzed.

作者张瑞田密 ZHANG RUI;TIAN MI(School of Education Scinence,Yan′an University,Yan′an 716000,China;School of Computer Sciense&Technology,Beijing Institute of Technology,Beijing 100081,China;Network Information Center,Yan′an Vocational&Technical College,Yan′an 716000,China)

机构地区延安大学教育科学学院北京理工大学计算机学院延安职业技术学院网络信息中心

出处《延安大学学报（自然科学版）》 2020年第3期30-36,共7页 Journal of Yan'an University：Natural Science Edition

基金国家自然科学基金地区科学基金项目(61866038) 陕西省教育厅科研项目(18JK0865)。

关键词访存模式 GPU 数据饥饿高效率数据访问 access model GPU data undersupplied efficient data access

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量：227

二级参考文献57

1Clark James H.The geometry engine:A VLSI geometry system for graphics[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1982.127～133
2Fuchs Herry,Poulton John.Pixel-planes:A VLSI-Oriented design for a raster graphics engine[J].VLSI Design,1981,2(3):20～28
3Eyles John,Austin John,Fuchs Henry,et al.Pixel-plane 4:A summary,advances in computer graphics hardware II[A].Eurographic Seminars Tutorials and Perspectives in Computer Graphics,New York:Springer-Verlag,1988.183～208
4Fuchs Herry,Israel Laura,Poulton John,et al.Pixel-planes 5:A heterogeneous multiprocessor graphics system using processor-enhanced memories[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1989.79～88
5http://www.nvidia.com/object/gpu.html[OL]
6http://developer.nvidia.com/[OL]
7http://www.ati.com/developer/[OL]
8http://www.gpgpu.org[OL]
9Joo Luiz Dihl Comba,Dietrich Carlos A,Pagot Christian A,et al.Computation on GPUs:From a programmable pipeline to an efficient stream processor[J].Revista de Informática Teóricae Aplicada,2003,X(2):41～70
10Krüger Jens,Westermann Rüdiger.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Transactions on Graphics,2003,22(3):908～916

共引文献226

1何红英,尉朝闻.基于逆滤波法的图像复原技术研究[J].西安文理学院学报（自然科学版）,2009,12(3):92-95. 被引量：1
2吴恩华.图形处理器用于通用计算的技术、现状及其挑战[J].软件学报,2004,15(10):1493-1504. 被引量：141
3张杨,诸昌钤,何太军.图形硬件通用计算技术的应用研究[J].计算机应用,2005,25(9):2192-2195. 被引量：6
4梁亮,张定华,毛海鹏,顾娟.一种基于可编程图形硬件的快速三维图像重建算法[J].计算机应用研究,2006,23(1):241-243. 被引量：5
5柳有权,刘学慧,吴恩华.基于GPU带有复杂边界的三维实时流体模拟[J].软件学报,2006,17(3):568-576. 被引量：54
6郝立巍,陈武凡.医学三维动态超声实时体绘制[J].南方医科大学学报,2006,26(3):275-278. 被引量：1
7李笑盈,吴恩华.过程性纹理映射的FPGA动态生成[J].计算机辅助设计与图形学学报,2006,18(5):630-637. 被引量：1
8张庆丹,戴正华,冯圣中,孙凝晖.基于GPU的串匹配算法研究[J].计算机应用,2006,26(7):1735-1737. 被引量：15
9李宏海,肖建海.CPU+GPU技术在非编系统中的应用[J].现代电视技术,2006(6):82-85. 被引量：4
10孔渊,陆虎敏,周坚锋,郭凡.计算机图形系统发展简述[J].航空电子技术,2006,37(2):10-14. 被引量：2

1郭嘉良.数据新闻产业化发展的现实困境与未来危机——基于国内三家数据新闻媒体栏目的分析[J].现代传播（中国传媒大学学报）,2020(7):61-67. 被引量：11
2刘志强.基于邻井数据耦合的压裂井干扰试井工艺研究与应用[J].石油管材与仪器,2020,6(1):77-80. 被引量：1

延安大学学报（自然科学版）

2020年第3期

浏览历史

内容加载中请稍等...

利用访存模式构建GPU高效率数据访问

参考文献1

二级参考文献57

共引文献226

相关作者

相关机构

相关主题

浏览历史