Analysis and comparison of two general sparse solvers for distributed memory computers

被引:41
|
作者
Amestoy, PR
Duff, IS
L'Excellent, JY
Li, XS
机构
[1] ENSEEIHT IRIT, F-31071 Toulouse, France
[2] CERFACS, F-31527 Toulouse 1, France
[3] Ecole Normale Super Lyon, LIP, F-69364 Lyon 07, France
[4] Univ Calif Berkeley, Lawrence Berkeley Lab, NERSC, Berkeley, CA 94720 USA
[5] Rutherford Appleton Lab, F-31527 Toulouse 1, France
来源
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE | 2001年 / 27卷 / 04期
关键词
algorithms; performance; sparse direct solvers; parallelism; distributed-memory computers; multifrontal and supernodal factorizations;
D O I
10.1145/504210.504212
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper provides a comprehensive study and comparison of two state-of-the-art direct solvers for large sparse sets of linear equations on large-scale distributed. memory computers. One is a multifrontal solver called MUMPS, the other is a supernodal solver called SuperLU. We describe the main algorithmic features of the two solvers and compare their performance characteristics with respect to uniprocessor speed, interprocessor communication, and memory requirements. For both solvers, preorderings for numerical stability and sparsity play an important role in achieving high parallel efficiency. We analyse the results with various ordering algorithms. Our performance analysis is based on data obtained from runs on a 512-processor Cray T3E using a set of matrices from real applications. We also use regular 3D grid problems to study the scalability of the two solvers.
引用
收藏
页码:388 / 421
页数:34
相关论文
共 50 条
  • [21] Distributed-Memory DMRG via Sparse and Dense Parallel Tensor Contractions
    Levy, Ryan
    Solomonik, Edgar
    Clark, Bryan K.
    PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [22] A DISTRIBUTED-MEMORY RANDOMIZED STRUCTURED MULTIFRONTAL METHOD FOR SPARSE DIRECT SOLUTIONS
    Xin, Zixing
    Xia, Jianlin
    de Hoop, Maarten V.
    Cauley, Stephen
    Balakrishnan, Venkataramanan
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (04) : C292 - C318
  • [23] The Scalable Modeling System: directive-based code parallelization for distributed and shared memory computers
    Govett, M
    Hart, L
    Henderson, T
    Middlecoff, J
    Schaffer, D
    PARALLEL COMPUTING, 2003, 29 (08) : 995 - 1020
  • [24] Task-Based Sparse Hybrid Linear Solver for Distributed Memory Heterogeneous Architectures
    Agullo, Emmanuel
    Giraud, Luc
    Nakov, Stojce
    EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 83 - 95
  • [25] GPU-resident sparse direct linear solvers for alternating current optimal power flow analysis
    Swirydowicz, Kasia
    Koukpaizan, Nicholson
    Ribizel, Tobias
    Goebel, Fritz
    Abhyankar, Shrirang
    Anzt, Hartwig
    Peles, Slaven
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 155
  • [26] FPGA structures with concentrated vs distributed memory for images comparison
    Geninatti, Sergio
    Gennai, Gerardo
    Roatta, Santiago
    Boemo, Eduardo
    2014 IX SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC (SPL 2014), 2014,
  • [27] A General Framework for the Design and Analysis of Sparse FIR Linear Equalizers
    Al-Abbasi, Abubakr O.
    Hamila, Ridha
    Bajwa, Waheed U.
    Al-Dhahir, Naofal
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 834 - 838
  • [28] A general class of explicit pseudo two-step RKN methods on parallel computers
    Cong, NH
    Strehmel, K
    Weiner, R
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1999, 38 (5-6) : 17 - 30
  • [29] A general approach for supporting nonblocking data structures on distributed-memory systems
    Thanh-Dang Diep
    Phuong Hoai Ha
    Fuerlinger, Karl
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 173 : 48 - 60
  • [30] A Memory Sparse Proportionate Affine Projection Algorithm for Echo Cancellation: Analysis and Simulations
    Boopalan, Senthil Murugan
    Alagala, Swarnalatha
    Ramalingam, Avudaiammal
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (03) : 3367 - 3381