Efficient implementation of the improved quasi-minimal residual method on massively distributed memory computers

被引:0
作者
Yang, TR [1 ]
Lin, HX
机构
[1] Linkoping Univ, Dept Comp Sci, S-58183 Linkoping, Sweden
[2] Delft Univ Technol, Dept Tech Math & Comp Sci, NL-2600 GA Delft, Netherlands
来源
SOLVING IRREGULARLY STRUCTURED PROBLEMS IN PARALLEL | 1997年 / 1253卷
关键词
ALGORITHM; QMR;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For the solutions of linear systems of equations with unsymmetric coefficient matrices, we has proposed an improved version of the quasi-minimal residual (IQMR) method by using the Lanczos process as a major component combining elements of numerical stability and parallel algorithm design. For Lanczos process, stability is obtained by a couple two-term procedure that generates Lanczos vectors scaled to unit length. The algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent and communication time required for inner product can be overlapped efficiently with computation time. Therefore, the cost of global communication on parallel distributed memory computers can be significantly reduced. In this paper, we describe an efficient implementation of this method which is particularly well suited to problems with irregular sparsity pattern. The corresponding communication cost is independent of the sparsity pattern with several performance improvement techniques such as overlapping computation and communication, balancing the computational load. The performance is demonstrated by numerical experimental results carried out on massively parallel distributed memory computer Parsytec GC/PowerPlus.
引用
收藏
页码:80 / 92
页数:13
相关论文
共 15 条
[1]  
[Anonymous], KFAZAMIB9606 CENTR I
[2]  
BUCKER HM, 1996, KFAZAMIB9604 CTR I A
[3]  
BUCKER HM, 1996, P WORKSH APPL PAR CO
[4]  
de STURLER E., 1991, P 13 IMACS WORLD C C
[5]  
DESTURLER E, 1994, 832 U UTRECHT MATH I
[6]  
Dongarra J. J., 1991, Solving Linear Systems on Vector and Shared Memory Computers
[7]   AN IMPLEMENTATION OF THE LOOK-AHEAD LANCZOS-ALGORITHM FOR NON-HERMITIAN MATRICES [J].
FREUND, RW ;
GUTKNECHT, MH ;
NACHTIGAL, NM .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (01) :137-158
[8]   AN IMPLEMENTATION OF THE QMR METHOD BASED ON COUPLED 2-TERM RECURRENCES [J].
FREUND, RW ;
NACHTIGAL, NM .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1994, 15 (02) :313-337
[9]   QMR - A QUASI-MINIMAL RESIDUAL METHOD FOR NON-HERMITIAN LINEAR-SYSTEMS [J].
FREUND, RW ;
NACHTIGAL, NM .
NUMERISCHE MATHEMATIK, 1991, 60 (03) :315-339
[10]   AN EFFICIENT PARALLEL ALGORITHM FOR MATRIX-VECTOR MULTIPLICATION [J].
HENDRICKSON, B ;
LELAND, R ;
PLIMPTON, S .
INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1995, 7 (01) :73-88