Towards an accurate performance modeling of parallel sparse factorization

被引:0
作者
Laura Grigori
Xiaoye S. Li
机构
[1] Parc Club Orsay Universite,INRIA Futurs
[2] Lawrence Berkeley National Laboratory,undefined
来源
Applicable Algebra in Engineering, Communication and Computing | 2007年 / 18卷
关键词
Parallel sparse factorization; Performance modeling; Distributed parallel machine;
D O I
暂无
中图分类号
学科分类号
摘要
We present a simulation-based performance model to analyze a parallel sparse LU factorization algorithm on modern cached-based, high-end parallel architectures. We consider supernodal right-looking parallel factorization on a bi-dimensional grid of processors, that uses static pivoting. Our model characterizes the algorithmic behavior by taking into account the underlying processor speed, memory system performance, as well as the interconnect speed. The model is validated using the implementation in the SuperLU_DIST linear system solver, the sparse matrices from real application, and an IBM POWER3 parallel machine. Our modeling methodology can be adapted to study performance of other types of sparse factorizations, such as Cholesky or QR, and on different parallel machines.
引用
收藏
页码:241 / 261
页数:20
相关论文
共 19 条
  • [1] Agarwal R.C.(1994)Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms IBM J. Res. Develop. 38 563-576
  • [2] Gustavson F.G.(2000)A portable programming interface for performance evaluation on modern processors Int. J. High Perfor. Comput. Appl. 14 189-204
  • [3] Zubair M.(1997)Highly Scalable Parallel Algorithms for Sparse Matrix Factorization IEEE Trans. Parallel Distrib. Syst. 8 502-520
  • [4] Browne S.(1998)GEMM-Based Level 3 BLAS: model implementations and performance evaluation benchmark ACM Trans. Math. Softw. 24 268-302
  • [5] Dongarra J.(1998)GEMM-based level 3 BLAS: portability and optimization issues ACM Trans. Math. Softw. 24 303-316
  • [6] Garner N.(2003)SuperLU_DIST: a scalable distributed-memory sparse direct solver for unsymmetric linear systems ACM Trans. Math. Softw. 29 110-140
  • [7] Ho G.(undefined)undefined undefined undefined undefined-undefined
  • [8] Mucci P.(undefined)undefined undefined undefined undefined-undefined
  • [9] Gupta A.(undefined)undefined undefined undefined undefined-undefined
  • [10] Karypis G.(undefined)undefined undefined undefined undefined-undefined