Analysis and comparison of two general sparse solvers for distributed memory computers

被引:41
|
作者
Amestoy, PR
Duff, IS
L'Excellent, JY
Li, XS
机构
[1] ENSEEIHT IRIT, F-31071 Toulouse, France
[2] CERFACS, F-31527 Toulouse 1, France
[3] Ecole Normale Super Lyon, LIP, F-69364 Lyon 07, France
[4] Univ Calif Berkeley, Lawrence Berkeley Lab, NERSC, Berkeley, CA 94720 USA
[5] Rutherford Appleton Lab, F-31527 Toulouse 1, France
来源
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE | 2001年 / 27卷 / 04期
关键词
algorithms; performance; sparse direct solvers; parallelism; distributed-memory computers; multifrontal and supernodal factorizations;
D O I
10.1145/504210.504212
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper provides a comprehensive study and comparison of two state-of-the-art direct solvers for large sparse sets of linear equations on large-scale distributed. memory computers. One is a multifrontal solver called MUMPS, the other is a supernodal solver called SuperLU. We describe the main algorithmic features of the two solvers and compare their performance characteristics with respect to uniprocessor speed, interprocessor communication, and memory requirements. For both solvers, preorderings for numerical stability and sparsity play an important role in achieving high parallel efficiency. We analyse the results with various ordering algorithms. Our performance analysis is based on data obtained from runs on a 512-processor Cray T3E using a set of matrices from real applications. We also use regular 3D grid problems to study the scalability of the two solvers.
引用
收藏
页码:388 / 421
页数:34
相关论文
共 50 条
  • [31] Scalability Analysis of Memory Consistency Models in NoC-based Distributed Shared Memory SoCs
    Naeem, Abdul
    Jantsch, Axel
    Lu, Zhonghai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2013, 32 (05) : 760 - 773
  • [32] Distributed-memory tensor completion for generalized loss functions in python']python using new sparse tensor kernels
    Singh, Navjot
    Zhang, Zecheng
    Wu, Xiaoxiao
    Zhang, Naijing
    Zhang, Siyuan
    Solomonik, Edgar
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 169 : 269 - 285
  • [33] Comparison of distributed memory algorithms for X-ray wave propagation in inhomogeneous media
    Ali, Sajid
    Du, Ming
    Adams, Mark F.
    Smith, Barry
    Jacobsen, Chris
    OPTICS EXPRESS, 2020, 28 (20): : 29590 - 29618
  • [34] A comparison of the shared-memory parallel programming models OpenMP, OpenACC and Kokkos in the context of implicit solvers for high-order FEM
    Eichstadt, Jan
    Vymazal, Martin
    Moxey, David
    Peiro, Joaquim
    COMPUTER PHYSICS COMMUNICATIONS, 2020, 255
  • [35] Qualitative and quantitative analysis and comparison of Java']Java distributed architectures
    Rozman, Ivan
    Juric, Matjaz B.
    Golob, Izidor
    Hericko, Marjan
    SOFTWARE-PRACTICE & EXPERIENCE, 2006, 36 (14) : 1543 - 1562
  • [36] A distributed-memory parallel technique for two-dimensional mesh generation for arbitrary domains
    Freitas, Markos O.
    Wawrzynek, Paul A.
    Cavalcante-Neto, Joaquim B.
    Vidal, Creto A.
    Martha, Luiz F.
    Ingraffea, Anthony R.
    ADVANCES IN ENGINEERING SOFTWARE, 2013, 59 : 38 - 52
  • [37] Comparison of Two Hyperparameter-Free Sparse Signal Processing Methods for Direction-of-Arrival Tracking in the HF97 Ocean Acoustic Experiment
    Das, Anup
    Zachariah, Dave
    Stoica, Petre
    IEEE JOURNAL OF OCEANIC ENGINEERING, 2018, 43 (03) : 725 - 734
  • [38] Two-dimensional indexing to provide one-integrated-memory view of distributed memory for a massively-parallel search engine
    Yun, Tae-Seob
    Whang, Kyu-Young
    Kwon, Hyuk-Yoon
    Kim, Jun-Sung
    Song, Il-Yeol
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (06): : 2437 - 2467
  • [39] RUN-TIME LOAD BALANCING SUPPORT FOR A PARALLEL MULTIBLOCK EULER NAVIER-STOKES CODE WITH ADAPTIVE REFINEMENT ON DISTRIBUTED-MEMORY COMPUTERS
    DEKEYSER, J
    LUST, K
    ROOSE, D
    PARALLEL COMPUTING, 1994, 20 (08) : 1069 - 1088
  • [40] Computerised working memory training in healthy adults: A comparison of two different training schedules
    Penner, Iris-Katharina
    Vogt, Annamarie
    Stoecklin, Markus
    Gschwind, Leo
    Opwis, Klaus
    Calabrese, Pasquale
    NEUROPSYCHOLOGICAL REHABILITATION, 2012, 22 (05) : 716 - 733