A MULTILEVEL PARALLEL SOLVER FOR BLOCK TRIDIAGONAL AND BANDED LINEAR-SYSTEMS

被引：12

作者：

HAJJ, IN

SKELBOE, S

机构：

[1] UNIV ILLINOIS,DEPT ELECT & COMP ENGN,URBANA,IL 61801

[2] UNIV COPENHAGEN,DEPT COMP SCI,DK-2100 COPENHAGEN,DENMARK

来源：

PARALLEL COMPUTING | 1990年 / 15卷 / 1-3期

关键词：

Complexity analysis; Execution times; Intel iPSC; Linear algebra; Message passing multiprocessor; Multilevel LU-factorization; Speed-up;

D O I：

10.1016/0167-8191(90)90029-9

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper describes an efficient algorithm for the parallel solution of systems of linear equations with a block tridiagonal coefficient matrix. The algorithm comprises a multilevel LU-factorization based on block cyclic reduction and a corresponding solution algorithm. The paper includes a general presentation of the parallel multilevel LU-factorization and solution algorithms, but the main emphasis is on implementation principles for a message passing computer with hypercube topology. Problem partitioning, processor allocation and communication requirement are discussed for the general block tridiagonal algorithm. Band matrices can be cast into block tridiagonal form, and this special but important problem is dealt with in detail. It is demonstrated how the efficiency of the general block tridiagonal multilevel algorithm can be improved by introducing the equivalent of two-way Gaussian elimination for the first and the last partitioning and by carefully balancing the load of the processors. The presentation of the multilevel band solver is accompanied by detailed complexity analyses. The properties of the parallel band solver were evaluated by implementing the algorithm on an Intel iPSC hypercube parallel computer and solving a larger number of banded linear equations using 2 to 32 processors. The results of the evaluation include speed-up over a sequential processor, and the measure values are in good agreement with the theoretical values resulting from complexity analysis. It is found that the maximum asymptotic speed-up of the multilevel LU-factorization using p processors and load balancing is approximated well by the expression (p +6)/4. Finally, the multilevel parallel solver is compared with solvers based on row and column interleaved organization. © 1990.

引用

页码：21 / 45

页数：25

共 29 条

[21] GOLIATH - A SOFTWARE SYSTEM FOR THE EXACT ANALYSIS OF RECTANGULAR RANK-DEFICIENT SPARSE RATIONAL LINEAR-SYSTEMS
ALFELD, P
EYRE, DJ
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1991, 17 (04): : 519 - 532
[22] A Parallel Strategy for Solving Sparse Linear Systems over Finite Fields
Rivera-Zamarripa, Luis
Adj, Gora
Aguilar-Ibanez, Carlos
Cruz-Cortes, Nareli
Rodriguez-Henriquez, Francisco
COMPUTACION Y SISTEMAS, 2022, 26 (01): : 493 - 504
[23] Parallel Iterative Schemes of Linear Algebra with Application to the Stability Analysis of Solutions of Systems of Linear Differential Equations
Ya. E. Romm
Cybernetics and Systems Analysis, 2004, 40 (4) : 565 - 586
[24] Scalable preconditioning of block-structured linear algebra systems using ADMM
Rodriguez, Jose S.
Laird, Carl D.
Zavala, Victor M.
COMPUTERS & CHEMICAL ENGINEERING, 2020, 133
[25] Aggregation of clans to speed-up solving linear systems on parallel architectures
Zaitsev, Dmitry A.
Shmeleva, Tatiana R.
Luszczek, Piotr
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2022, 37 (02) : 198 - 219
[26] A hybrid GMRES/LS-Arnoldi method to accelerate the parallel solution of linear systems
He, Haiwu
Bergere, G.
Petiton, S.
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 51 (11) : 1647 - 1662
[27] Parallel Sub-Structuring Methods for solving Sparse Linear Systems on a cluster of GPU
Ahamed, Abal-Kassim Cheik
Magoules, Frederic
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 121 - 128
[28] An Efficient Task-Based Execution Model for Stochastic Linear Solver on Multi-Core and Many-Core Systems
Ye, Fan
Calvin, Christophe
Petiton, Serge
2015 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2015, : 200 - 207
[29] A Distributed and Parallel Asynchronous Unite and Conquer Method to Solve Large Scale Non-Hermitian Linear Systems
Wu, Xinzhe
Petiton, Serge G.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, : 36 - 46

← 1 2 3 →