期刊文献+

分布存储系统中优化通信的冗余计算分割 被引量:1

Redundant Computation Partitioning in Distributed-Memory Systems
下载PDF
导出
摘要 针对并行循环套序列 ,提出一种冗余计算分割的通信优化方法 .根据数据流分析 ,文中给出用以确定每个循环套的冗余计算量的一般方法 ,并在此基础上提出冗余计算分割的实现和判定 .针对规则依赖的程序 ,该文还提出了一个高效的冗余计算分割的实现方法 .该技术已经在一个并行编译器中实现 ,试验结果表明 ,它比传统的通信优化技术有明显的优越性 . Maximize parallelism and minimize communication overheads are important issues in distributed-memory systems. Using ordinary computation partitioning methods, inter-loop communications cannot be avoided usually. This paper presents a communication optimization technique, named communication-free redundant computation partitioning, for parallel loop sequences. Based on dataflow analysis, a generic method is given to select computation redundancy properly for each partitioned loop nest, so as to eliminate inter-loop communications. Upon this, the implementation and criterion of communication-free redundant computation partitioning are given. An efficient algorithm is also provided for a certain class of programs, called regularly dependent programs. The technique spends very little redundant computation and space to get lesser communication. When the parallel loop sequence is enclosed by sequential loops, the technique can eliminate some data redistribution points for the program, simplifying the global deduction of data and computation partitioning. The technique has been realized in Autopar3.0, which is a parallelizing compiling system targeting at Dawning 3000 parallel machines. According to the results of experiments, the technique is superior to some well-known communication optimization.
出处 《计算机学报》 EI CSCD 北大核心 2003年第2期180-187,共8页 Chinese Journal of Computers
基金 国家高性能计算基金 国家自然科学基金项目 ( 6 99330 2 0 )资助 .
关键词 分布存储系统 冗余计算分割 并行编译 冗余计算 计算分割 通信优化 Data communication systems Optimization Parallel processing systems Program compilers
  • 相关文献

参考文献13

  • 1Shih K-P, Sheu J-P, Huang C-H. Statement-level communication-free partitioning techniques for parallelizing compilers. In: Proceedings of the 9th Workshop on Languages and Compilers for Parallel Computing, 1997. 389~403
  • 2Huang C-H, Sadayappan P. Communication-free hyperplane partitioning of nested loops. Journal of Parallel and Distributed Computing, 1993, 19(2): 90~102
  • 3Anderson J. Automatic computation and data decomposition for multiprocessors[Ph D dissertation]. Department of EE and CS,Stanford University, Stanford, CA, 1997
  • 4Bau D, Koduklula I et al. Solving alignment using elementary linear algebra. In: Proceedings of the 8th International Workshop on Languages and Compilers for Parallel Computing, 1995.46~60
  • 5Lee P Z. Efficient algorithms for data distribution on distributed memory parallel computers. IEEE Transactions on Parallel and Distributed Systems, 1997, 8(8): 825~839
  • 6Lim A W, Cheong G I, Lam M S. An affine partitioning algorithm to maximize parallelism and minimize communication. In: Proceedings of the 13th ACM SIGARCH International Conference on Supercomputing, Rhodes, Greece, 1999.228~237
  • 7Olav Beckmann, Paul H J Kelly. A linear algebra formulation for optimizing replication in data parallel programs. In: Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing, Yorktown Heights, NY, USA,2000. 100~116
  • 8Ulrich Kremer. Automatic data layout with read-only replication and memory constraints. In: Proceedings of the 10th International Workshop on Language and Compilers for Parallel Computing, Chapel Hill, NC, USA, 1998. 419~422
  • 9Hiranandani S, Kennedy K, Tseng C W. Evaluating compiler optimizations for Fortran D. Journal of Parallel and Distributed Computing, 1994, 21(1): 27~45
  • 10Peng Tu, David Padua. Automatic array privatization. In: Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing, Ithaca, NY,USA, 1994. 500~521

同被引文献47

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部