期刊文献+

一种面向众核处理器的嵌套循环多维并行识别方法 被引量:3

Multi-dimensional parallelism recognition method of nested loop for many-core processors
下载PDF
导出
摘要 现有循环并行识别方法用于众核处理器时存在一定不足,当选择的循环并行维迭代数较少时可能导致严重的负载不均衡。针对这一问题,提出了一种面向众核处理器的多维并行识别方法。在现有并行识别方法无法做到较好的负载均衡时,选择嵌套循环的多个维进行并行,将多个并行维的迭代空间合并后再作任务划分,减少负载不均衡对程序并行效率的影响。此方法在已开发的自动并行化系统中进行了实现,实际应用过程中能够很好地提升一些应用程序在众核处理器上并行执行的效率。 There were some shortcomings in the existing parallelism recognition methods for the many-core processors.It could lead to serious load imbalance when the selected loop parallel dimension iteration number was small.To solve this problem,this paper proposed a multi-dimensional parallel recognition method for many-core processor.When it was difficult for the existing recognition methods to reach a better load balancing,this paper took a multi-dimensional parallel approach to the nested loops,and merged a task partition scheme after multi-dimensional parallel iteration space to reduce the impact of load imbalance on parallel efficiency of the program.It has been implemented in the automatic parallelization system developed by the research group,which can improve the parallel execution efficiency of some applications on many-core processor.
作者 李颖颖 庞建民 李雁冰 翟胜伟 Li Yingying;Pang Jianmin;Li Yanbing;Zhai Shengwei(State Key Laboratory of Mathematical Engineering&Advanced Computing,Zhengzhou 450002,China;Information Engineering University,Zhengzhou 450002,China;The 27th Research Institute of China Electronics Technology Group Corporation,Zhengzhou 450047,China)
出处 《计算机应用研究》 CSCD 北大核心 2018年第11期3311-3314,共4页 Application Research of Computers
基金 国家自然科学基金面上项目(61472447) 国家"863"计划资助项目(2014AA01A300) 国家"核高基"重大专项资助项目
关键词 多维并行识别 众核处理器 自动并行化 嵌套循环 multi-dimensional parallelism recognition many-core processor automatic parallelization nested loop
  • 相关文献

参考文献3

二级参考文献42

  • 1Lei Hu, Ian Gorton. Performance evaluation for parallel systems:A survey. University of NSW, Sydney, Australia, Tech Rep:UNSW-CSE- TR-9707, 1997
  • 2Marcelo Lobosco, Vitor Santos Costa, Claudio L. de Amorim.Performance evaluation of fast ethernet, giganet and myrinet on a Cluster. In: Proc. Int'l Conf. Computer Science. Berlin:Springer-Verlag, 2002
  • 3Jack Dongarra. Performance of various computers using standard linear equations software. University of Tennessee Computer Science, America, Tech Rep: CS-89-85, 2003
  • 4A.B. Yoo, B. R. de Supinski, F. Mueller, et al. Memory benchmarks for SMP-based high performance parallel computers.Lawrence Livermore National Laboratory, Tech Rep: UCRL-JC-146246, 2001
  • 5.TOP500[EB/OL].http: ∥ www. top500. org,2004-10-02.
  • 6.TOP100[EB/OL].http:∥www.samss.org.cn,2004-10-02.
  • 7HPC Challenge Benchmark. http: ∥ icl.cs.utk.edu/hpcc/, 2004-12-21
  • 8NAS Parallel Benchmarks. http: ∥ science.nas.nasa.gov/Software/NPB, 2004-09-15
  • 9Ahmad Faraj, Xin Yuan. Communication characteristics in the NAS parallel benchmarks. The 14th IASTED Int'l Conf. Parallel and Distributed Computing and Systems, Cambridge, MA, 2002
  • 10D. Bailey, E. Barscz, J. Barton, et al. The NAS parallel benchmarks. NASA Ames Research Center, Tech Rep: RNR-94-007, 1994

共引文献48

同被引文献7

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部