期刊文献+

一种基于线性代数的计算和数据自动分解算法 被引量:2

An Automatic Computation and Data Decomposition Algorithm Based on Linear Algebra
下载PDF
导出
摘要 在针对分布内存体系结构的并行识别技术中,如何对计算和数据进行合理分解,以增加数据引用的本地化、减少处理器间的通信是提高并行程序性能的关键。本文通过对Anderson-lam分解算法完整性的补充,给出了一种可实现无通信的计算划分和数据分布算法,并阐述了对该算法在工程实践中的一些优化考虑。 Increasing the locality of data references and minimizing communication of processors by computation and data decomposition are the key optimization for achieving high performance on large-scale parallel machines. A compiler algorithm that automatically finding computation and data decomposition with no communication based on Andersonlain algorithm is described, and some improvement on engineering implement is also mentioned in this paper.
出处 《计算机科学》 CSCD 北大核心 2007年第1期278-280,293,共4页 Computer Science
基金 国防重点科研项目资助 河南省杰出人才创新基金(0521000200)
关键词 并行编译 计算划分 数据分布 Parallel compilation , Data distribution, Computation partition
  • 相关文献

参考文献10

  • 1Knobe K,Lukas J D,Steele G L.Data optimization:Allocation of arrays to reduce communication on SIMD machies.Journal of Parallel and Distributed Computing,1990,8:102~118
  • 2Lam M S,Wolf M E.Compilation techniques to achieve parallelism and locality.In:Proceedings of the DARPA Software Technology Conference,April 1992.150~158
  • 3Huang C H,Sadayappan P.Communication-free hyperplane positioning of nested loops.In:Banerjee U,Gelernter D,Nicolau A,et al,eds.Languages and Compilers for Parallel Computing.Berlin,Germany,Springer-Verlag,1992.186~200
  • 4Kulkarni D,Kumar K G,Basu A,et al.Loop partitioning for distributed memory multiprocessors as unimodular transformations.In:Proceedings of the 1991 ACM International Conference on Supercomputing,June 1991.206~215
  • 5Kumar K G,Kulkarni D,Basu A.Deriving good transformations for mapping nested loops on hierarchical parallel machines in polynomial time.In:Proceedings of the 1992 ACM International Conference on Supercomputing,July 1992.82~91
  • 6Kennedy K,Kremer U.Automatic Data Layout for High Performance Fortra[A].Proc Supercomputer[C].San Diego,Calif,1995
  • 7Wolf M E.Improving Locality and Parallelism in Nested Loops:[PhD Thesis].Stanford University,August 1992
  • 8Anderson J M,Lam M S.Global optimizations for parallelism and locality on scalable parallel machines.In:Proceedings of the SIGPLAN'93 Conference on Programming Language Design and Implementation,Albuquerque,NM,June 1993.112~135
  • 9Anderson J M.automatic computation and data decomposition for multiprocessors:[PhD Thesis].Stanford University,March 1997
  • 10Guo Minyi.Efficient Techniques for Data Distribution and Redistribution in Parallelizing Compilers:[PhD Thesis].Tsukuba University,1998

同被引文献18

  • 1张平,李清宝,赵荣彩.消息传递并行程序的自动生成[J].计算机工程与应用,2007,43(8):74-77. 被引量:2
  • 2杜澎,赵荣彩,董春丽.MPI通信代码自动生成算法[J].计算机应用,2007,27(3):759-761. 被引量:5
  • 3沈志宇,胡子昂.并行编译方法[M].北京:国防工业出版社,2000:12-38.
  • 4Ferner C S. The Paraguin Compiler: Message-passing Code Generation Using SUIF[ C ]//Proceedings of the IEEE Southeast Con 2002. Columbia SC, USA : IEEE, 2002 : 1 - 6.
  • 5University of Houston Computer Science Department High Performance Computing Tools Group. Overview of the Open64 Compiler Infrastructure [ EB/OL ]. ( 2002-11-12 ) [ 2011-09-22 ]. http://www2, cs. uhedu/ - dragon/Documents/open64-doc, pdf.
  • 6SCI. WHIRL Intermediate Language Specification [ EB/OL]. (2000-05-10 ) [ 2011-09-22 ]. http: // open64, sourceforge, net. whirl, pdf.
  • 7Dick G Henri E B, Ceriel J H,et al. Modern Compiler Design [M]. UK:JohnWiley& SonsLtd, 2010:58 -63.
  • 8Randy A, Ken K. Optimizing Compilers for Modern Architectures:A Dependence-based Approach [ M ]. US: Morgan Kaufmann Pubiishers, 2001:201 - 205.
  • 9Hall M, Anderson J, Amarasinghe S, et al. Maximizing Multiprocessor Performance with the SUIF Compiler[ J]. IEEE Computer, 1996, 29(12) :84 -89.
  • 10Chapman B, Hernandez O, Huang L, et al. Dragon: An Open64-based Interactive Program Analysis Tool for Large Applications[ C ]//Proceedings of the Fourth International Conference on Parallel and Distributed Computing, Applications and Technologies. Chengdu, China : IEEE,2003:46 - 47.

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部