摘要
分布式共享存储系统的特点是每个节点内是共享存储的,而节点间是分布式存储.为了更好地利用这种多级体系结构,讨论了MPI+OpenMP混合编程模型的性能及实现方法,建立了大规模三对角线性方程组的MPI+OpenMP混合并行算法,并在上海大学高性能计算集群上与单纯MPI算法进行了性能方面的比较.结果表明,MPI+OpenMP混合并行算法具有更好的加速比和扩展性.
The distributed shared memory system is characterized by shared memory multi-processors on each node and distributed memory among nodes.In order to make use of this hierarchical architecture,this paper discusses the performance of MPI+OpenMP hybrid programming paradigm and different implementations.We design a multi-granularity parallel algorithm for solving larger scale tridiagonal linear systems,and compare its performance with pure MPI algorithm on the high performance computer of Shanghai University.The results indicate that the hybrid algorithm shows better speedup and scalability.
出处
《微电子学与计算机》
CSCD
北大核心
2011年第8期158-161,共4页
Microelectronics & Computer
基金
上海市科委重点项目(10510500600)
教育部2008年度高等学校博士学科点专项科研基金项目(200802800007)
上海市重点学科建设项目资助基金项目(J50103)