A parallel out-of-core multifrontal method: Storage of factors on disk and analysis of models for an out-of-core active memory

被引:13
作者
Agullo, Emmanuel [1 ]
Guermouche, Abdou [2 ]
L'Excellent, Jean-Yves [1 ]
机构
[1] UCBL, INRIA, ENS Lyon, CNRS,UMR,Lab Informat Parallelisme, F-69364 Lyon, France
[2] UMR 5800, Lab Bordelais Rech Informat, F-33405 Talence, France
关键词
sparse direct solvers; parallel multifrontal method; out-of-core;
D O I
10.1016/j.parco.2008.03.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The memory usage of sparse direct solvers can be the bottleneck to solve large sparse systems of linear equations of the form Ax = b. In order to solve large problems, we have designed a robust out-of-core solver, in which computed factors are stored on disk. We use large real-life problems (up to several million equations and several hundred million nonzeros) to show that we can significantly reduce the core memory usage in parallel (on up to 128 processors), with a time performance comparable to that of a parallel in-core solver. A careful study shows how the low-level I/O mechanisms impact the performance. We describe a low-level I/O layer that avoids the perturbations introduced by system buffers and allows consistently good performance results. To go significantly further in the memory reduction, it is interesting to also store the intermediate working memory on disk. In this paper we describe algorithmic models to address this issue, and study their potential in terms of both memory requirements and I/O volume. The out-of-core solver discussed in this paper is publicly available and already used by several academic and industrial groups. The results of the algorithmic modelling will be the basis to design a new version of this solver; this work may also be a useful reference for other developers of sparse out-of-core solvers. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 317
页数:22
相关论文
共 24 条
[1]  
AGULLO E, 2007, 6120 INRIA
[2]  
Agullo E, 2006, LECT NOTES COMPUT SC, V4128, P1053
[3]  
AMESTOY P, 2007, RTAPO073 ENSEEIHT
[4]   Hybrid scheduling for the parallel solution of linear systems [J].
Amestoy, PR ;
Guermouche, A ;
L'Excellent, JY ;
Pralet, S .
PARALLEL COMPUTING, 2006, 32 (02) :136-156
[5]   Task scheduling in an asynchronous distributed memory multifrontal solver [J].
Amestoy, PR ;
Duff, IS ;
Vömel, C .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2004, 26 (02) :544-565
[6]   MEMORY MANAGEMENT ISSUES IN SPARSE MULTIFRONTAL METHODS ON MULTIPROCESSORS [J].
AMESTOY, PR ;
DUFF, IS .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1993, 7 (01) :64-82
[7]   A fully asynchronous multifrontal solver using distributed dynamic scheduling [J].
Amestoy, PR ;
Duff, IS ;
L'Excellent, JY ;
Koster, J .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2001, 23 (01) :15-41
[8]  
ASHCRAFT CC, 1987, INT J SUPERCOMPUT AP, V1, P10
[9]  
*BSCLIB, BCSLIB MATH STAT LIB
[10]   ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance [J].
Choi, J ;
Demmel, J ;
Dhillon, I ;
Dongarra, J ;
Ostrouchov, S ;
Petitet, A ;
Stanley, K ;
Walker, D ;
Whaley, RC .
COMPUTER PHYSICS COMMUNICATIONS, 1996, 97 (1-2) :1-15