期刊文献+

一种基于OPENACC指令加速的均值模糊算法

A Box Blur Algorithm Based Acceleration OPENACC Instruction
下载PDF
导出
摘要 目前,计算机系统普遍使用GPGPU设备加速计算,但使用底层API进行加速计算繁琐而又缺乏效率,运用基于指令的高级抽象编程可以解决这个问题。运用均值模糊算法体验OPENACC的用法,OPENACC通过指令把C或C++中的计算紧密代码转移到GUP中。比较了分别使用CPU、OPENACC、CUDA实现均值模湖算法的性能表现,虽然CUDA有不错的性能,但相对于低下的开发效率和陡峭的学习曲线而言,OPENACC只用十几行代码就可以取得可观的加速比,而且随着编译器和硬件技术的发展,指令制导编译有越来越大的发展空间。 Computer systems become increasingly widespread use of GPGPU devices to speed up the calculation, but, using the low-level API for accelerated calculation is cumbersome and inefficient To solve this problem,Based on the instruction of the higher level of abstraction for programming to solve this problem,in this article,Gaussian blur algorithm experience OPENACC just usage OPENACC through instruction in C or C + + in the calculation close code transfer to GUP in con- duct,we compare the respectively use the CPU, OPENACC, CLIDA achieve Gaussian mode Lake algorithm performance performance. Although CUDA have a good performance, but relative to the low efficiency of the development and the steep learning curve,OPENACC used only a dozen lines of code can achieve significant speedup,and with the development of the compiler and hardware technology,the command guidance compile getting more the larger development space.
作者 胡玉贵
出处 《软件导刊》 2013年第1期59-61,共3页 Software Guide
基金 广东省自然科学基金项目(S2011010002537)
关键词 OPENACC CUDA GPGPU 均值模糊 OPENACC CUDA GPGPU~ BOX Blur
  • 相关文献

参考文献6

  • 1R BORDAWEKAR,U BONDHUGULA,R RAO. Can CPUs Match GPUs on performance with productivity:experiences with optimizing a FLOP-intensive application on CPUs and GPU[M].2010.
  • 2C BRECHER,C GORGELS,A HARDJOSUWITO. Simulation based ToolWear Analysis in Bevel Gear Cutting[M].2010.
  • 3M B-UCKER,R BEUCKER,A RUPP. Parallel Minimum p-Norm Solution of the Neuromagnetic Inverse Problem for Realistic Signals Using Exact Hessian-Vector Products[J].SIAM Journal on Scientific Computing,2008,(06).
  • 4R DOLBEAU,S BIHAN,F BODIN. Hmpp:A Hybrid Multi-core Parallel Programming Evironment[M].2007.
  • 5CAPS ENTERPRISE,CRAY INC.NVIDIA,THE PORTLAND GROUP. The OpenACC Application Programming Interface[M].2011.
  • 6T D HAN T.S ABDELRAHMAN. hiCUDA:High-Level GPGPU Programming[A].2011.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部