Asynchronous Approach to Memory Management in Sparse Multifrontal Methods on Multiprocessors 被引量：1

Asynchronous Approach to Memory Management in Sparse Multifrontal Methods on Multiprocessors

下载PDF

导出

摘要 This research covers the Intel? Direct Sparse Solver for Clusters, the software that implements a direct method for solving the Ax = b equation with sparse symmetric matrix A on a cluster. This method, researched by Intel, is based on Cholesky decomposition and could be considered as extension of functionality PARDISO from Intel??MKL. To achieve an efficient work balance on a large number of processes, the so-called “multifrontal” approach to Cholesky decomposition is implemented. This software implements parallelization that is based on nodes of the dependency tree and uses MPI, as well as parallelization inside a node of the tree that uses OpenMP directives. The article provides a high-level description of the algorithm to distribute the work between both computational nodes and cores within a single node, and between different computational nodes. A series of experiments shows that this implementation causes no growth of the computational time and decreases the amount of memory needed for the computations. This research covers the Intel? Direct Sparse Solver for Clusters, the software that implements a direct method for solving the Ax = b equation with sparse symmetric matrix A on a cluster. This method, researched by Intel, is based on Cholesky decomposition and could be considered as extension of functionality PARDISO from Intel??MKL. To achieve an efficient work balance on a large number of processes, the so-called “multifrontal” approach to Cholesky decomposition is implemented. This software implements parallelization that is based on nodes of the dependency tree and uses MPI, as well as parallelization inside a node of the tree that uses OpenMP directives. The article provides a high-level description of the algorithm to distribute the work between both computational nodes and cores within a single node, and between different computational nodes. A series of experiments shows that this implementation causes no growth of the computational time and decreases the amount of memory needed for the computations.

作者 Alexander Kalinkin Konstantin Arturov

机构地区 ZAO Intel/AO

出处《Applied Mathematics》 2013年第12期33-39,共7页 应用数学（英文）

关键词 Direct SOLVER Distributed Data OPENMP and MPI Direct Solver Distributed Data OpenMP and MPI

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

引证文献1

1Alexander Kalinkin,Anton Anders,Roman Anders.Schur Complement Computations in Intel^(■) Math Kernel Library PARDISO[J].Applied Mathematics,2015,6(2):304-311. 被引量：2

二级引证文献2

1任政勇,陈超健,汤井田,周峰,陈煌,邱乐稳,胡双贵.一种新的三维大地电磁积分方程正演方法[J].地球物理学报,2017,60(11):4506-4515. 被引量：30
2Zhou Feng,Tang Jing-Tian,Ren Zheng-Yong,Zhang Zhi-Yong,Chen Huang,Huang Xiang-Yu,Zhong Yi-Yuan.A hybrid finite-element and integral-equation method for forward modeling of 3D controlled-source electromagnetic induction[J].Applied Geophysics,2018,15(3):536-544. 被引量：6

1Wlodek M. Zuberek.Timed Petri Net Models of Shared-Memory Bus-Based Multiprocessors[J].Journal of Computer and Communications,2018,6(10):1-14. 被引量：1
2Kyumann Im,Woonchul Ham.Analysis and Programming of Kernel for Embedded Systems[J].Journal of Software Engineering and Applications,2014,7(1):14-26.
3Anthony Y. Aidoo,Kwasi Baah Gyamfi,Joseph Ackora-Prah,Francis T. Oduro.Solvability of Inverse Eigenvalue Problem for Dense Singular Symmetric Matrices[J].Advances in Pure Mathematics,2013,3(1):14-19. 被引量：1
4Alexander Kalinkin,Anton Anders,Roman Anders.Schur Complement Computations in Intel^(■) Math Kernel Library PARDISO[J].Applied Mathematics,2015,6(2):304-311. 被引量：2
5苏锦钿,欧阳志凡,余珊珊.基于依存树及距离注意力的句子属性情感分类[J].计算机研究与发展,2019,56(8):1731-1745. 被引量：12
6Mitsuhiro Kashiwagi.Derivative of a Determinant with Respect to an Eigenvalue in the Modified Cholesky Decomposition of a Symmetric Matrix, with Applications to Nonlinear Analysis[J].American Journal of Computational Mathematics,2014,4(2):93-103.
7Alexander Kalinkin,Anton Anders,Roman Anders.Intel^(■) Math Kernel Library PARDISO* forIntel^(■) Xeon Phi^(TM) Manycore Coprocessor[J].Applied Mathematics,2015,6(8):1276-1281.
8Achiya Dax.Low-Rank Positive Approximants of Symmetric Matrices[J].Advances in Linear Algebra & Matrix Theory,2014,4(3):172-185.
9Rahul Shrimali,Hemal Shah,Riya Chauhan.Proposed Caching Scheme for Optimizing Trade-off between Freshness and Energy Consumption in Name Data Networking Based IoT[J].Advances in Internet of Things,2017,7(2):11-24. 被引量：1
10D. Ganeshwar Rao,C. Patvardhan,Ranjit Singh.Intelligent Tool Management Strategies for Automated Manufacturing Systems[J].Intelligent Control and Automation,2011,2(4):405-412. 被引量：3

Applied Mathematics

2013年第12期

浏览历史

内容加载中请稍等...

Asynchronous Approach to Memory Management in Sparse Multifrontal Methods on Multiprocessors 被引量：1

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史