期刊文献+

多核平台上基于可声明并行性的程序优化框架 被引量:1

A Declarative Parallelism Based Program Parallel Optimization Framework on Multicore Platforms
下载PDF
导出
摘要 针对多核体系平台上充分、有效地发掘目标程序中各种可用并行性的需求,通过引入"层次关系"、"等价关系"和"特性权重"的支持,提出了一种扩展的TStreams模型,并在此基础上实现了一个基于可声明并行性的程序并行优化框架(FAPOF).该框架支持用户对算法的并行特性进行多角度、多粒度的描述并指定适用的各类并行优化规则.基于用户描述,框架可以编译驱动的方式评估各种优化决策的组合,以半自动化的方式对目标程序进行并行优化.由此可将优化过程中程序员原本复杂、困难的并行优化的"决策"工作转化为可用并行性的"描述"工作.实验结果表明,此方法显著地降低了并行优化的难度,提高了并行优化的效率. To satisfy the requirements for exploiting various types of available parallelism in destination programs on multicore platforms fully and efficiently,an extended TStreams model was developed by introducing the hierarchical relationship,equivalent relationship and characteristic weights as the support,so as to implement a declarative parallelism based feature aware parallel optimization framework(FAPOF) which is on the basis of the model.FAPOF supports multi-aspects and multi-grains description for the latent parallelism and appropriate optimization rules available in destination algorithms.With users' description,FAPOF can evaluate the different combinations of the optimization decisions in a compiler-driven method,thus the destination programs can be optimized in parallel semi-automatically.In this way,the programmers' complex and difficult "decision-making" jobs in parallel optimization will be transformed into an easier "description" job for available parallelism.Test results showed that the method mentioned above can reduce the difficulty in parallel optimization significantly with the optimization efficiency improved.
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2011年第1期22-26,共5页 Journal of Northeastern University(Natural Science)
基金 国家高技术研究发展计划项目(2009AA011908)
关键词 TStreams模型 多核 可声明并行性 程序优化框架 自动并行化 TStreams model multicore declarative parallelism program optimization framework automatic parallelization
  • 相关文献

参考文献10

  • 1Graham R L. The MPI 2.2 standard and the emerging MPI 3 standard[M]. Berlin: Springer, 2009.
  • 2OpenMP A R B. OpenMP application program interface[EB/ OL]. (2008 05-31)[2009-05-01]. http://www. openmp, org/mp documents/spec30, pdf.
  • 3Leiseron C E. The Cilk + + concurrency platform [C]// Proceedings of the 46th Annual Design Automation Conference. San Francisco, 2009 : 522 - 527.
  • 4Yang C Y, Yuan H, Wang N. A parallelism analysis pattern for real-time vehicle recognition algorithm [ C]//Proceedings of the IEEE Intelligent Vehicles Symposium. Istanbul, 2007 : 369 - 374.
  • 5Yang C Y, Wen X Z, Yuan H. A study on data parallel optimization for real-time vehicle recognition algorithm[C]// Proceedings of the 10th International IEEE Conference on Intelligent Transportation Systems. Seattle, 2007:661 - 665.
  • 6Knobe K, Offner C. Compile to TStreams--a new model of parallel computation [ R ]. Cambridge: Hewlet Packard Labs and Cambridge Research Laboratory, 2005.
  • 7Knobe K, Blower M, Chen C, et al. Intel concurrent collection for C++ textual notation[EB/OL]. (2010 -03- 01) [2010- 05 -05 ]. http: //software. intel, com/file/ 30233.
  • 8Liu W, Wen X Z, Duan B B, et al. Rear vehicle detection and tracking for lane change assist[C]//Proceedings of the IEEE Intelligent Vehicle Symposium. Istanbul, 2007 : 252 - 257.
  • 9Chandramowlishwaran A, Knobe K, Vuduc R. Applying the concurrent collections programming model to asynchronous parallel dense linear algebra [ C]//Proceedings of the 15th ACM Symposium on Principles and Practice of Paralld Programming. New York, 2010 : 345 - 346.
  • 10Chandramowlishwaran A, Knobe K, Vuduc R. Performance evaluation of concurrent collections on high-performance muhlcore computing systems [ C ] //Proceedings of IEEE International Symposium on Parallel & Distributed Processing. Atlanta, 2010:1 - 12.

同被引文献8

  • 1Vasiliadis G, Polychronakis M, Antonatos S, et al. Regular expression matching on graphics hardware for intrusion detection[ C ]//Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection. Saint- Malo, France : Springer Verlag ,2009 :265 - 283.
  • 2Xia Gao, Liuy Bin. Accelerating network applications on X86 -64 platforms [ C ]// Proceedings of IEEE Symposium on Computers and Communications. Ricci on e, Italy:IEEE Computer and Commtmications Societies ,2010:906 - 912.
  • 3陈虎,欧彦麟,陈海波,等.面向多核处理器平台的并行HashJOIN算法设计与实现[J].计算机研究与发展,2010,47(z1):171-175.
  • 4MIT Lincoln Laboratory. MIT DARPA intrusion detection data sets [ Z/OL ]. [ 2010 - 09 - 25 ]. http ://www. 11. mit. edu/mission/ communications/ist/corpora/ideval/data/index, html.
  • 5陈卓,杨爱良,王骥.基于PLB总线的多通道SGDMA设计[J].航空电子技术,2009,40(1):12-15. 被引量:5
  • 6徐乾,鄂跃鹏,葛敬国,钱华林.深度包检测中一种高效的正则表达式压缩算法[J].软件学报,2009,20(8):2214-2226. 被引量:28
  • 7张顺利,张定华,李明君,吉军.基于SIMD技术的锥束ART算法快速并行图像重建[J].仪器仪表学报,2010,31(3):630-634. 被引量:6
  • 8杨云,宓佳,党宏社.嵌入式入侵检测系统的设计与实现[J].计算机工程与设计,2011,32(1):21-23. 被引量:8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部