期刊文献+

分布内存系统中节点间软流水优化技术

Exploiting Inter-node Pipelining Parallelism in Distributed Memory Systems
下载PDF
导出
摘要 Maximize parallelism and minimize communication overheads are important issues for distributed memory systems. Communication and data redistribution cannot be avoided even when considering global optimization of data distribution and computation decomposition. A new approach based on loop fusion is presented exploiting pipelining parallelism, thus communication overhead can be hidden and data redistribution can be avoided. This technique exploits pipelining from complex loop structures, which distinguishes itself from traditional pipelining techniques. Ex-periments show that the technique is superior to other optimizations. Maximize parallelism and minimize communication overheads are important issues for distributed memory systems. Communication and data redistribution cannot be avoided even when considering global optimization of data distribution and computation decomposition. A new approach based on loop fusion is presented exploiting pipelining parallelism, thus communication overhead can be hidden and data redistribution can be avoided. This technique exploits pipelining from complex loop structures, which distinguishes itself from traditional pipelining techniques. Experiments show that the technique is superior to other optimizations.
出处 《计算机科学》 CSCD 北大核心 2002年第11期24-28,共5页 Computer Science
基金 国家高性能计算基金 国家自然科学基金(69933020)
关键词 分布内存系统 软流水优化技术 数据存储 计算机 数据重分布 Parallel compiling, Data redistribution, Software pipeline,Parallel loop,Loop fusion
  • 相关文献

参考文献11

  • 1Hiranandani S, et al. Evaluation of Compiler Optimization for FortranD on MIMD. In: 1992 Intl. Conf. on Supercomputing, Washington, D. C, July 1992
  • 2Banerjee P,et al. An Overview of the PARADIGM Compiler for Distributed Memory Message-Passing Multicomputers. IEEE Computer, Oct. 1995. 37~47
  • 3Lowenthal D K, James M. Run-Time Selection of Block Size in Pipelined Parallel Programs. Second Merged IPPS/SPDP Symposium, 1999. 82~87
  • 4Van der Wijngaart R F, et al. Analysis and Optimization of Software Pipeline Performance on MIMD Parallel Computers. Journal of Parallel and Distributed Computing, 1996
  • 5Wolfe M. High Performance Compilers for Parallel Computing. Addison-Wesley Reading, MA, 1996
  • 6Callahan C D. A Global Approach to the Detection of Parallelism: [PhD thesis]. Dept. of Computer Science, Rice Univ., Mar.1987
  • 7McKinley K S,Carr S,Tseng C-W. Improving Data Locality With Loop Transformations. ACM Transactions on Programming Languages and Systems, 1996,18(4)
  • 8Singhai S K,McKinley K S. An Algorithm for Improving Parallelism and Cache Locality. The Computer Journal, 1997,40(7)
  • 9Manjikian N,Abdelrahman T S. Fusion of Loops for Parallelism and Locality. IEEE Transaction on Parallel and Distributed Systems, 1997,8(2)
  • 10Abdelrahman T, et al. Locality Enhancement for Large-Scale Shared-Memory Multiprocessor. Languages, Compilers, and Run-Time Systems for Scalable Computers 4th International Workshop, LCR'98,LNCS 1511 ,Springer-Verlag, 1998

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部