Adapting a parallel sparse direct solver to architectures with clusters of SMPs

被引:12
作者
Amestoy, PR
Duff, IS
Pralet, S
Vömel, C
机构
[1] CERFACS, F-31057 Toulouse 01, France
[2] CERFACS, Toulouse & Atlas Ctr, RAL, Didcot OX11 0QX, Oxon, England
[3] ENSEEIHT, F-31071 Toulouse 7, France
基金
英国工程与自然科学研究理事会;
关键词
sparse linear systems; MUMPS; distributed memory algorithms; task scheduling; dynamic scheduling;
D O I
10.1016/j.parco.2003.05.010
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the direct solution of general sparse linear systems baseds on a multifrontal method. The approach combines partial static scheduling of the task dependency graph during the symbolic factorization and distributed dynamic scheduling during the numerical factorization to balance the work among the processes of a distributed memory computer. We show that to address clusters of Symmetric Multi-Processor (SMP) architectures, and more generally non-uniform memory access multiprocessors, our algorithms for both the static and the dynamic scheduling need to be revisited to take account of the non-uniform cost of communication. The performance analysis on an IBM SP3 with 16 processors per SMP node and up to 128 processors shows that we can significantly reduce both the amount of inter-node communication and the solution time. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1645 / 1668
页数:24
相关论文
共 29 条
[1]   An approximate minimum degree ordering algorithm [J].
Amestoy, PR ;
Davis, TA ;
Duff, IS .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1996, 17 (04) :886-905
[2]   Multifrontal parallel distributed symmetric and unsymmetric solvers [J].
Amestoy, PR ;
Duff, IS ;
L'Excellent, JY .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2000, 184 (2-4) :501-520
[3]   Analysis and comparison of two general sparse solvers for distributed memory computers [J].
Amestoy, PR ;
Duff, IS ;
L'Excellent, JY ;
Li, XS .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2001, 27 (04) :388-421
[4]   A fully asynchronous multifrontal solver using distributed dynamic scheduling [J].
Amestoy, PR ;
Duff, IS ;
L'Excellent, JY ;
Koster, J .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2001, 23 (01) :15-41
[5]  
AMESTOY PR, 2002, UNPUB SIAM J MATRIX
[6]  
[Anonymous], NUMERICAL LINEAR ALG
[7]   ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance [J].
Choi, J ;
Demmel, J ;
Dhillon, I ;
Dongarra, J ;
Ostrouchov, S ;
Petitet, A ;
Stanley, K ;
Walker, D ;
Whaley, RC .
COMPUTER PHYSICS COMMUNICATIONS, 1996, 97 (1-2) :1-15
[8]  
DAVIS T, 2002, U FLORIDA SPARSE MAT
[9]   A supernodal approach to sparse partial pivoting [J].
Demmel, JW ;
Eisenstat, SC ;
Gilbert, JR ;
Li, XYS ;
Liu, JWH .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1999, 20 (03) :720-755
[10]   A SET OF LEVEL 3 BASIC LINEAR ALGEBRA SUBPROGRAMS - MODEL IMPLEMENTATION AND TEST PROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
DUFF, I .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1990, 16 (01) :18-28