期刊文献+

流式缩减技术在GPU上的研究与应用 被引量:1

Disquisition and application of streaming curtailment technology on GPU
下载PDF
导出
摘要 随着GPU通用计算技术应用的不断深入,如何把某些并行计算任务从传统的CPU平台向GPU平台转移,把串行编程模型向并行的流式编程模型转变等,已经成为了研究的热点。讨论了基于GPU的流式编程模型,探讨了基于流式编程模型的GPU与CPU编程之间的差别与联系,最后描述了一种在GPU上的流式缩减操作算法的设计与实现。为把图形处理器应用在通用计算领域提供参考和帮助。 With the rapid improvement of GPU (graphic process unit) general computation technology, how to transfer some parallel computation tasks from traditional CPU platform to GPU platform and transform from serial programming model to parallel model has been a hotspot. Stream programming model based on GPU is discussed. The differences and relation between GPU and CPU programming are analyzed. Finally, a style of stream programming of curtailment operation on GPU is designed and implied.
出处 《计算机工程与设计》 CSCD 北大核心 2008年第5期1268-1270,1275,共4页 Computer Engineering and Design
关键词 图形处理器通用计算技术 流式编程 流式缩减 GPGPU style of stream programming streaming curtailment
  • 相关文献

参考文献8

  • 1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:227
  • 2Michael Macedonia. The GPU enters computing's mainstream [J]. IEEE Computer Society, 2003,36(10):106-108.
  • 3吴恩华.图形处理器用于通用计算的技术、现状及其挑战[J].软件学报,2004,15(10):1493-1504. 被引量:141
  • 4Kr Uger Jens, Westermann R Oliger. Linear algebra operations for GPU implementation of numerical algorithms [J]. ACM Transactions on Graphics, 2003,22(3):908-916.
  • 5DominikGoddeke. GPGPU basic math tutorial [EB/OL].http:// www.mathematik.uni-dortmund.de/-goeddeke/gpgpu/tutorial.html.
  • 6DominikGoddeke. GPGPU reduction tutorial [EB/OL] .http:// www.mathematik.uni-dortmund.de/-goeddeke/gpgpu/tutofial2.html.
  • 7Dave Shreiner,Mason Woo.OpenGL编程指南[M].4版.邓效祥,译.北京:人民邮电出版社,2005.
  • 8RandI J Post. OpenGL着色语言[M].天宏工作室,译.北京:人民邮电出版社,2006.

二级参考文献58

  • 1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:227
  • 2Clark James H.The geometry engine:A VLSI geometry system for graphics[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1982.127~133
  • 3Fuchs Herry,Poulton John.Pixel-planes:A VLSI-Oriented design for a raster graphics engine[J].VLSI Design,1981,2(3):20~28
  • 4Eyles John,Austin John,Fuchs Henry,et al.Pixel-plane 4:A summary,advances in computer graphics hardware II[A].Eurographic Seminars Tutorials and Perspectives in Computer Graphics,New York:Springer-Verlag,1988.183~208
  • 5Fuchs Herry,Israel Laura,Poulton John,et al.Pixel-planes 5:A heterogeneous multiprocessor graphics system using processor-enhanced memories[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1989.79~88
  • 6http://www.nvidia.com/object/gpu.html[OL]
  • 7http://developer.nvidia.com/[OL]
  • 8http://www.ati.com/developer/[OL]
  • 9http://www.gpgpu.org[OL]
  • 10Joo Luiz Dihl Comba,Dietrich Carlos A,Pagot Christian A,et al.Computation on GPUs:From a programmable pipeline to an efficient stream processor[J].Revista de Informática Teóricae Aplicada,2003,X(2):41~70

共引文献344

同被引文献20

  • 1单莹,吴建平,王正华.基于SMP集群的多层次并行编程模型与并行优化技术[J].计算机应用研究,2006,23(10):254-256. 被引量:25
  • 2ZHE F, FENG Q, ARIE K, et al. GPU cluster for high performance computing [C]∥Proceedings of the ACM/IEEE Conference on Supercomputing, Pittsburgh, Pennsylvania. USA: IEEE Computer Society, 2004: 4-7.
  • 3IROYUKI H T, IROAKI H K. Hierarchical parallel processing of large scale data clustering on a PC cluster with GPU co-processing [J]. The Journal of Supercomputing, 2006, 36(3): 219-234.
  • 4DOMINIK G, ROBERT S, JAMALUDIN M, et al. Exploring weak scalability for FEM calculations on a GPU-enhanced cluster [J]. Parallel Computing, 2007, 33(10/11): 685-699.
  • 5MICHAEL S, JEREMY E, AVNEESH P, et al. QP: A heterogeneous multi-accelerator cluster [C]∥Proceeding of the 10th LCI International Conference on High-Performance Clustered Computing. Boulder, Colorado, USA: LCI, 2009: 34-41.
  • 6JAMES P, JOHN S, KLAUS S. Adapting a message-driven parallel application to GPU-accelerated clusters [C]∥ Proceedings of the ACM/IEEE Conference on Supercomputing. Austin, Texas, USA : IEEE Computer Society, 2008: 19.
  • 7ONEPPO M. HLSL shader model 4.0 [C]∥ACM SIGGRAPH 2007 Courses. San Diego, California, USA: ACM, 2007: 112-152.
  • 8ERIK L, JOHN N, STUART O, et al. NVIDIA Tesla: A unified graphics and computing architecture [J]. IEEE Micro, 2008, 28(2): 39-55.
  • 9JOHN N, IAN B, MICHAEL G, et al. Scalable parallel programming with CUDA [J]. Queue, 2008, 6(2): 4053.
  • 10MICHAEL M, STEFANUS T, TIBERIU P, et al. Shader algebra [C]∥ ACM SIGGRAPH. Los Angeles, California, USA: ACM, 2004: 787-795.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部