期刊文献+

FILiC:一种CUDA上的交互型库函数框架

FILiC:A Framework for Interactive Library on CUDA
下载PDF
导出
摘要 CUDA是NVIDIA公司推出的GPU编程模型,它为高效利用GPU计算能力提供了强大的支持。但CUDA线程无法直接访问I/O设备、网卡等外围设备,在CUDA线程与外围设备的交互功能方面,目前CUDA的支持十分有限,仅支持非实时的屏幕打印(printf)。因此提出了一种交互型库函数框架FILiC,它通过设备和主机之间的巧妙交互,高效实现了CUDA线程实时的较完整I/O等函数;并且该框架具有很好的可扩展性,CUDA程序员或者编译器开发者可基于该框架按需求开发新的CUDA线程交互功能。 NVIDIA developed the CUDA programming model which provides a way to accelerate more general applications by GPU. But CUDA threads cannot access peripherals directly. As far as library functions interacting with peripherals, only ' printf' is allowed in CUDA threads by now. We described a framework named FILiC for interactive library, which implements I/O functions in CUDA threads efficiently by interactions between the device and the host. And FIL- iC is a framework with good scalability - CUDA progammers and compiler developers can use it to design some new library functions which interact with peripherals for CUDA threads.
出处 《计算机科学》 CSCD 北大核心 2012年第3期124-127,138,共5页 Computer Science
关键词 CUDA FILiC 交互型库函数 可扩展性 CUDA, FILiC, Interactive library, Scalability
  • 相关文献

参考文献11

  • 1CUDA 3. 1 Downloads. developer, nvidia, eom/object/euda_3_1 _downloads, html.
  • 2Ocelot:An Open Source Debugging and Compilation Framework for CUDA[OL]. http://code, google. com/p/gpuocelot/.
  • 3euda-waste: Why Another Simple Trivial Emulator for CUDA [OL], http://code, google, com/p/cuda-waste/.
  • 4NVIDIA Compute PTX: Parallel Thread Execution ISA Version 1. 1[OL]. http://www, nvidia, com/ob]ect/io_l195170102 263. html.
  • 5OpenHMPP[OL]. http://en.wikipedia.org/wiki/HMPP Open _Standard.
  • 6PyCUDA[OL]. http://mathema, tician, de/software/pycuda.
  • 7Breitbart J. Cupp-a framework for easy cuda integration [C]// IPDPS '09 :Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing. Washington, DC, USA, IEEE Computer Society, 2009 : 1-8.
  • 8CUDA 4.0 Downloads. developer, nvidia, com/object/euda 4 0 downloads, html.
  • 9Kirk D B, Hwu W-M W. Programming Massively Parallel Processors: A Hands-on Approach[M]. ELSEVIER Press, 2010.
  • 10Next Generation CUDA Architecture [OL]. www. nvidia, com/ object/fermi architecture, html.

二级参考文献4

  • 1G.Cybenko,Dynamic load balancing for distributed memory multiprocessors,Journal of Parallel and Distributed Computing,vol.7,Oct.1989,pp.279-301.
  • 2K.Nam,J.Seo,S.Lee,and J.Kim,Synchronous load balancing in hypercube multicomputers with faulty nodes,Journal of Parallel and Distributed Computing,vol.58,1999,pp.26-43.
  • 3H.Rim,J.W.Jang,and S.Kim.An efficient dynamic load balancing using the dimension exchange method for balancing of quantized loads on hypercube multiprocessors,13th International and 10th Symposium on parallel and distributed processing,1999.
  • 4孟念青,石华先.大规模并行处理机技术综述[J].微计算机信息,1995,11(2):21-25. 被引量:1

共引文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部