期刊文献+

Cell BE环境中BF算法并行化及性能优化 被引量:1

Parallelization and Performance Optimization of BF Algorithm in Cell BE Environment
下载PDF
导出
摘要 BF(Brute Force)算法在Cell BE环境中的并行化及性能优化研究是此类算法向CellBE环境迁移的基础。根据CellBE独特的结构及算法本身的特点,采用计算-加速的编程模型实现并行化,分析评价双缓冲、Mailbox、DMA-list机制对BF算法性能的影响。结果显示,3种机制的单独应用都可以优化BF算法在CellBE上的并行处理性能,任意2种以及3种机制的综合应用都可以不同程度地进一步提升性能,其中3种机制的综合应用使性能达到最优。 The parallelization and optimization of Brute Force(BF) algorithm in Cell BE environment is the basic of migration of this kinds of algorithms to Cell BE environment. According to the architecture of Cell BE computation-acceleration model is used as the pi'ogramming model, and the mechanism of dual-buffer, Mailbox, DMA-list are evaluated as the data transfer mechanism for performance tuning of BF algorithm. Results show that each transfer mechanism can improve the performance. Different degrees of performance improvement are achieved when two or three mechanisms are used together, and the combination of all three mechanisms gets the highest performance.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第6期35-38,共4页 Computer Engineering
基金 天津市科技支撑计划基金资助重点项目(09ZCKFGX00400) 天津市应用基础及前沿技术研究计划基金资助重点项目(08JCZDJC19700)
关键词 BF算法 CELL BE处理器 并行化 性能优化 Brute Force(BF) algorithm Cell BE processor parallelization performance optimization
  • 相关文献

参考文献5

  • 1张庆丹,戴正华,冯圣中,孙凝晖.基于GPU的串匹配算法研究[J].计算机应用,2006,26(7):1735-1737. 被引量:15
  • 2IBM Systems and Technology Group. Cell Broadband Engine: An Introduction[EB/OL]. (2006-12-01). http://www.ibm.com.
  • 3IBM Systems and Technology Group. Cell Broadband Engine Programming Tutorial Version2.0[EB/OL]. (2006-12-15). http://www.ibm.com.
  • 4汤灿群,李春江.异构多核处理器的编程模型和编译技术[EB/OL].(2007-01-01)http://www.ccrg.org.cn/documents/Multicore/Multi_Core_Programming_Compiler.pdf.
  • 5IBM Systems and Technology Group. Developing Code for Cell- DMA and Mailbox[EB/OL]. (2006-12-01). http://www.ibm.com.

二级参考文献10

  • 1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:225
  • 2陈国良,林洁,顾乃杰.分布式存储的并行串匹配算法的设计与分析[J].软件学报,2000,11(6):771-778. 被引量:10
  • 3JOWENS JD, LUEBKE D, GOVINDARAJU N, et al. A survey of general-purpose computation on graphics hardware [ A]. EUROGRAPHICS 2005[ C].2005.21 -51.
  • 4MARK WR, GLANVILLE RS, AKELEY K, et al. Cg: a system for programming graphics hardware in a C-like language[J]. ACM Transactions on Graphics, 2003, 22(3) : 896 -907.
  • 5LEFOHN A, KNISS J, OWENS J. Implementing efficient parallel data structures on GPUs[ A]. GPU gems 2: programming techniques for high performance graphics and general purpose computation[ C].Addison-Wesley, 2005. 521 -545.
  • 6HARRIS M. Mapping computational concepts to GPUs[ A]. GPUGems2: programming techniques for high performance graphics and general purpose computation[ C]. Addison-Wesley, 2006.493 -508.
  • 7THOMPSON CJ, HAHN S, OSKIN M. Using modem graphics architectures for general-purpose computing: a framework and analysis[A]. Proceedings of the 35th Annual ACM/IEEE International Symposium on Microarchitecture[C].2002. 306 -317.
  • 8BUCK I, FOLEY T, HORN D, et al. Brook for GPUs: stream computing on graphics hardware[ A]. Proceedings of the ACM SIGGRAPH 2004[C].2004.
  • 9BUCK I. GPGPU: General-purpose computation on graphics hardware-GPU computation strategies & tricks[ A]. ACM SIGGRAPH Course Notes[C].2004.
  • 10CHARRAS C, LECROQ TT. Handbook of exact string matching algorithms[ M]. London: King's College London Publications, 2004.

共引文献14

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部