A MULTILEVEL PARALLEL SOLVER FOR BLOCK TRIDIAGONAL AND BANDED LINEAR-SYSTEMS

被引:12
|
作者
HAJJ, IN
SKELBOE, S
机构
[1] UNIV ILLINOIS,DEPT ELECT & COMP ENGN,URBANA,IL 61801
[2] UNIV COPENHAGEN,DEPT COMP SCI,DK-2100 COPENHAGEN,DENMARK
关键词
Complexity analysis; Execution times; Intel iPSC; Linear algebra; Message passing multiprocessor; Multilevel LU-factorization; Speed-up;
D O I
10.1016/0167-8191(90)90029-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes an efficient algorithm for the parallel solution of systems of linear equations with a block tridiagonal coefficient matrix. The algorithm comprises a multilevel LU-factorization based on block cyclic reduction and a corresponding solution algorithm. The paper includes a general presentation of the parallel multilevel LU-factorization and solution algorithms, but the main emphasis is on implementation principles for a message passing computer with hypercube topology. Problem partitioning, processor allocation and communication requirement are discussed for the general block tridiagonal algorithm. Band matrices can be cast into block tridiagonal form, and this special but important problem is dealt with in detail. It is demonstrated how the efficiency of the general block tridiagonal multilevel algorithm can be improved by introducing the equivalent of two-way Gaussian elimination for the first and the last partitioning and by carefully balancing the load of the processors. The presentation of the multilevel band solver is accompanied by detailed complexity analyses. The properties of the parallel band solver were evaluated by implementing the algorithm on an Intel iPSC hypercube parallel computer and solving a larger number of banded linear equations using 2 to 32 processors. The results of the evaluation include speed-up over a sequential processor, and the measure values are in good agreement with the theoretical values resulting from complexity analysis. It is found that the maximum asymptotic speed-up of the multilevel LU-factorization using p processors and load balancing is approximated well by the expression (p +6)/4. Finally, the multilevel parallel solver is compared with solvers based on row and column interleaved organization. © 1990.
引用
收藏
页码:21 / 45
页数:25
相关论文
共 29 条
  • [21] GOLIATH - A SOFTWARE SYSTEM FOR THE EXACT ANALYSIS OF RECTANGULAR RANK-DEFICIENT SPARSE RATIONAL LINEAR-SYSTEMS
    ALFELD, P
    EYRE, DJ
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1991, 17 (04): : 519 - 532
  • [22] A Parallel Strategy for Solving Sparse Linear Systems over Finite Fields
    Rivera-Zamarripa, Luis
    Adj, Gora
    Aguilar-Ibanez, Carlos
    Cruz-Cortes, Nareli
    Rodriguez-Henriquez, Francisco
    COMPUTACION Y SISTEMAS, 2022, 26 (01): : 493 - 504
  • [24] Scalable preconditioning of block-structured linear algebra systems using ADMM
    Rodriguez, Jose S.
    Laird, Carl D.
    Zavala, Victor M.
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 133
  • [25] Aggregation of clans to speed-up solving linear systems on parallel architectures
    Zaitsev, Dmitry A.
    Shmeleva, Tatiana R.
    Luszczek, Piotr
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2022, 37 (02) : 198 - 219
  • [26] A hybrid GMRES/LS-Arnoldi method to accelerate the parallel solution of linear systems
    He, Haiwu
    Bergere, G.
    Petiton, S.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 51 (11) : 1647 - 1662
  • [27] Parallel Sub-Structuring Methods for solving Sparse Linear Systems on a cluster of GPU
    Ahamed, Abal-Kassim Cheik
    Magoules, Frederic
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 121 - 128
  • [28] An Efficient Task-Based Execution Model for Stochastic Linear Solver on Multi-Core and Many-Core Systems
    Ye, Fan
    Calvin, Christophe
    Petiton, Serge
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2015, : 200 - 207
  • [29] A Distributed and Parallel Asynchronous Unite and Conquer Method to Solve Large Scale Non-Hermitian Linear Systems
    Wu, Xinzhe
    Petiton, Serge G.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, : 36 - 46