期刊文献+

图形处理器在通用计算中的应用 被引量:4

Application of graphics processing unit in general purpose computation
下载PDF
导出
摘要 基于图形处理器(GPU)的计算统一设备体系结构(compute unified device architecture,CUDA)构架,阐述了GPU用于通用计算的原理和方法。在Geforce8800 GT下,完成了矩阵乘法运算实验。实验结果表明,随着矩阵阶数的递增,无论是GPU还是CPU处理,速度都在减慢。数据增加100倍后,GPU上的运算时间仅增加了3.95倍,而CPU的运算时间增加了216.66倍。 Based on the CUDA (compute unified device architecture) of GPU (graphics processing unit), the technical fimdamentals and methods for general purpose computation on GPU are introduced. The algorithm ofmatrix multiplication is simulated on Geforce8800 GT. With the increasing of matrix order, algorithm speed is slowed either on CPU or on GPU. After the data quantity increases to 100 times, the operation time only increased in 3.95 times on GPU, and 216.66 times on CPU.
作者 张健 陈瑞
出处 《计算机工程与设计》 CSCD 北大核心 2009年第14期3359-3361,共3页 Computer Engineering and Design
基金 南京工程学院引进人才科研启动基金项目(KXJ07056)
关键词 图形处理器 计算统一设备体系结构 通用计算 矩阵乘法 矩阵阶数 graphics processing unit (GPU) compute unified device architecture (CUDA) general purpose computation matrix multiply matrix order
  • 相关文献

参考文献8

  • 1Macedonia M.The GPU enters computing's mainstream[J].IEEE Computer,2003,36(10):106-108.
  • 2Cuda programming guide version 2.0[M].NVIDIA Corporation,2008.
  • 3Kruger J,Westermann R.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Trans on Graphics,2003,22(3):908-916.
  • 4Hall JD,Carr NA,Hart JC.Cache and bandwidth aware matrix multiplication on the GPU[R].Champaign:University of Illinois at Urbana-Champaign,2003.
  • 5Thompson CJ,Hahn S,Oskin M.Using modern graphics architectures for general-purpose computing:A framework and analysis[C].Proc of the Int'l Syrup on Microarchitecture,2002:306-317.
  • 6Govindaraju NK,Sud A,Yoon SE,et al.SWITCH:Parallel occlusion culling for interactive walkthroughs using multiple GPUs[R].Techical Report,TR02-027,UNC-CH,2002.
  • 7Tomov S,McG-uigan M,Bennett R,et al.Benchmarking and implementation of probability-based simulations on programmable graphics cards[J].Computers and Graphics,2005,29(1):53-56.
  • 8吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:226

二级参考文献57

  • 1Clark James H.The geometry engine:A VLSI geometry system for graphics[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1982.127~133
  • 2Fuchs Herry,Poulton John.Pixel-planes:A VLSI-Oriented design for a raster graphics engine[J].VLSI Design,1981,2(3):20~28
  • 3Eyles John,Austin John,Fuchs Henry,et al.Pixel-plane 4:A summary,advances in computer graphics hardware II[A].Eurographic Seminars Tutorials and Perspectives in Computer Graphics,New York:Springer-Verlag,1988.183~208
  • 4Fuchs Herry,Israel Laura,Poulton John,et al.Pixel-planes 5:A heterogeneous multiprocessor graphics system using processor-enhanced memories[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1989.79~88
  • 5http://www.nvidia.com/object/gpu.html[OL]
  • 6http://developer.nvidia.com/[OL]
  • 7http://www.ati.com/developer/[OL]
  • 8http://www.gpgpu.org[OL]
  • 9Joo Luiz Dihl Comba,Dietrich Carlos A,Pagot Christian A,et al.Computation on GPUs:From a programmable pipeline to an efficient stream processor[J].Revista de Informática Teóricae Aplicada,2003,X(2):41~70
  • 10Krüger Jens,Westermann Rüdiger.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Transactions on Graphics,2003,22(3):908~916

共引文献225

同被引文献32

  • 1吴恩华.图形处理器用于通用计算的技术、现状及其挑战[J].软件学报,2004,15(10):1493-1504. 被引量:141
  • 2王志勇,张继贤,黄国满.高分辨率SAR影像斑点噪声滤除方法的研究[J].测绘科学,2004,29(6):41-44. 被引量:17
  • 3芮杰,吴冰,秦志远,山海涛.一种稳健的自适应图像平滑算法[J].中国图象图形学报(A辑),2005,10(1):54-58. 被引量:25
  • 4桑会勇,郭华东,韩春明,王长林.一种基于梯度信息的小波SAR图像滤波方法[J].测绘通报,2005(2):17-19. 被引量:5
  • 5NVIDIA CUDA计算统一设备架构编程指南版本2.0[EB/OL].http://www.nvidia.com/object/cuda_home.html,2008.
  • 6Stone S S,Yi Hao-ran,Haldar J P,et al.How GPUs can improve the quality of magnetic resonance imaging[EB/OL].http://www.gigascale.org/pubs/1175/gpgpu.pdf,2008-04-20.
  • 7Boyer M,Skadron K,Weimer W.Automated dynamic analysis of CUDA programs[EB/OL].http://web.mit.edu/rabbah/www/conferences/08/stmcs/papers/boyer-stmcs08.pdf,2008-05-20.
  • 8Catanzaro B,Sundaram N,Keutzer ICA map reduce framework for programming graphics processors[EB/OL].http://web.mit.edu/rabbah/www/conferences/08/stmcs/papers/catanzarostmcs08.pdf,2008-04-30.
  • 9NVIDIA CUDA Programming Guide Version 2.3 [ EB/OL]. 2009. http://www, nvidia, com/content/cudazone/download/ OpenCL/NVIDIA OpenCL ProgrammingGuide. pdf.
  • 10ATI Stream Computing PenCL Programming Guide [ EB/ OL]. 2010. http://www, ljll. math. upmc. fr/groupes/gpgpu/ tutorial/ATI_Stream_SDK_OpenCL_Programming_Guide, pdf.

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部