期刊文献+

基于MPI的并行分布式高斯消元算法设计和评估 被引量:4

MPI-based Parallel Algorithm Design and Evaluation for Gaussian Elimination
下载PDF
导出
摘要 为了满足电磁仿真数值计算日益增高的速度和精度的需求,针对单机内存需求和计算负荷需求都比较大的矛盾,提出基于分布式并行机群环境的并行计算划分和并行存储划分的算法设计思想,并且给出了基于行列循环数据划分的并行计算算法描述。在此基础上进行了实验验证,用MPI+FORTRAN和MPI+C编程实现了对大矩阵求逆的分布式高斯消元,并进行了性能评估和实验验证,在国内外超级计算中心平台上的实验结果表明所完成的工作对于系统的电磁仿真计算具有应用价值,该算法和代码实现可应用于电磁仿真计算的矩量法MOM(Method of Mom)中。 To address the problem of the conflicting requirements between memory need and computation payload of one single computer used for increasingly fast and precise EM numerical computation simulation, a novel parallel data distribution and computation partition based on matrix's row/column-cycle array division was proposed. The new data distribution scheme could be used in distribution cluster environment and the Computation partition algorithm was proposed and described. First, a novel scheme was proposed based on MPI (Message Passing Interface) distributed parallel computation environment because MPI-based computation platform could be used to implement the computation and storage partition at the same time for the computation of the inverse of large dense matrix used in MOM (Method of Moment) electromagnetic simulation. Then the implementation and performance evaluation of the algorithm using MPI+FORTARN and MPI+ C programming were given out. Finally the effectiveness (correctness and speedup) of the proposed scheme was tested and verified using both TACC system in university of Texas at Austin in USA and HPC system in NPU China. The codes and results can be applied to EM data simulation in subsurface detection.
出处 《系统仿真学报》 CAS CSCD 北大核心 2009年第20期6429-6431,6435,共4页 Journal of System Simulation
基金 西北工业大学第四批英才培养计划(05XE0124) 2008届毕业设计重点扶持项目(W00220)
关键词 分布式环境下的并行算法 数据和计算划分 基于消息传递接口的代码设计 能评估 MPI+C parallel algorithm for distributed environment storage and computation partition Message Passing Interface (MPI)-based code design performance evaluation MPI+C.
  • 相关文献

参考文献5

  • 1S F McGinn, R E Shaw. Parallel Gaussian Elimination Using OpenMP and MPI [C]// Proceedings of the 16th Annual International Symposium on High Performance Computing Systems and Applications (HPCS'02). Moncton, NB, Canada, 2002: 169-176.
  • 2Jeff Howe, Shawn Bratcher, Parallel Guassian Elimination [DB/OL]. (2006-9)[2008-1] http://www.cse.ucsd.edu/classes/ fa98/cse164b/ Proj ects/PastProj ects/LU/.
  • 3张为华,王鹏,臧斌宇,朱传琪.一种基于代表元的划分算法[J].计算机学报,2008,31(3):400-410. 被引量:4
  • 4Wei Bing Lu, Tie Jun Cui, Zhi Guo Qian, et al. Accurate Analysis of Large-Scale Periodic Structures Using an Efficient Sub-Entire- Domain Basis Function Method [J]. IEEE Transactions on Antennas and Propagation (S0018-926x), 2005, 53(1): 3078-3085.
  • 5Luis M Camacho, Saibun Tjuatja. FDTD Simulation of Microwave Scattering from a Lung Tumor [C]// Antennas and Propagation Society International Symposium, 2005 IEEE, 3-8 July 2005. USA: IEEE, 2005, 3A: 815-818.

二级参考文献20

  • 1Wolf M E. Improving locality and parallelism in nested loops [Ph. D. dissertation]. Stanford University, CSL_ TR-92- 538, 1992
  • 2Anderson J M, Amarasinghe S P, Lam M S. Data and computation transformations for multiprocessors//Proceedings of the 5th ACM/SIGPLAN Symposium on Principles and Practice of Parallel Programming. Santa Barbara, California, USA, 1995:166-178
  • 3Anderson J. Automatic computation and data decomposition for multiprocessors[Ph. D. dissertation]. Stanford University, Standford, CA, 1997
  • 4Bau D, Kidukula Iet al. Solving alignment using elementary linear algebra//Proceedings of the 7th Workshop on Languages and Compilers for Parallel Computing. Ithaca, NY, USA, 1994:46-60
  • 5Ramanujam J, Sadayappan P. Compile-time techniques for data distribution in distributed memory machines. IEEE Transactions on Parallel and Distributed Systems, 1991, 2 (4) : 472-482
  • 6Huang C H, Sadayappan P. Communication-free hyperplane partitioning of nested loops. Journal of Parallel and Distribu ted Computing, 1993, 19(2): 90-102
  • 7Shih K-P, Sheu J P, Huang C H. Statement level communication free partitioning techniques for parallelizing compilers//Proceedings of the 9th Workshop on Languages and Compilers for Parallel Computing. Minneapolis, Minnesota, USA, 1997:389-403
  • 8Shih K P, Sheu J P, Huang C H. Statement-level communi cation-free partitioning technique for parallelizing compilers. The Journal of Supercomputing, 2000, 15(3) : 243-269
  • 9Lim A W, Lam M S. Maximizing parallelism and minimizing synchronization with affine transforms//Proceedings of the Conference Record of the 24th ACM SIGPLAN/SIGACT Symposium on Principles of Programming Languages. Paris, France, 1997:201-214
  • 10Lim A W, Lam M S. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing, 1998, 24(3-4): 445 475

共引文献3

同被引文献18

  • 1黄炜,张建秋.构造准循环LDPC码生成矩阵的块高斯消元法[J].复旦学报(自然科学版),2008,47(6):691-696. 被引量:3
  • 2姚俊,文传军.关于n元线性方程组求解的探讨[J].常州工学院学报,2004,17(4):25-29. 被引量:3
  • 3北京大学数学系几何与代数小组教研室.高等代数[M].2版,北京:高等教育出版社,1981.
  • 4小林崇史.视点检测器和视点检测方法及其应用.专利号:95168095.2.
  • 5Bai Lianfa, Zhang Yi, Gu Guohua, et al. Analysis and fusion methods on low light level image and ultra-violet image. I'I, 2007,36(1) :113-117.
  • 6华健,韩学山,王锦旗,陈芳,李超.改进高斯消元算法在电力系统拓扑结构分析中的应用[J].电网技术,2007,31(23):57-61. 被引量:25
  • 7G Gerardi,L Abbene,A La Manna. Digtal Filtering and Analysis for a Semiconductor X-Ray Detector Data Acquisition[J].Nuclear Instruments and Methods in Physics Research Section A:Accelerators Spectrometers Detectors and Associated Equipment (S0168-9002),2007,(1-2):378-380.
  • 8T Noulis,C Deradonis,S Siskos. Particle Detector Tunable Monolithic Semi-Gaussian Shaping Filter Based on Trans-Conductance Amplifiers[J].Nuclear Instuments and Methods in Physics Research Section A:Accelerators Spectrometers Detectors and Associated Equipment (S0168-9002),2008,(02):330-337.
  • 9龚沛曾;陆慰民;杨志强.Visual Basic程序设计简明教程[M]北京:高等教育出版社,2003.
  • 10蔡和熙,陈沐天.对高斯消元法的改进以及在工程上的应用[J].计算机辅助工程,1997,6(4):61-66. 被引量:5

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部