Parallel and fully recursive multifrontal sparse Cholesky

被引:21
作者
Irony, D [1 ]
Shklarski, G [1 ]
Toledo, S [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2004年 / 20卷 / 03期
关键词
sparse Cholesky factorization; parallel Cholesky factorization; multifrontal factorizations; Cilk; recursive factorizations; block layouts; recursive layouts;
D O I
10.1016/j.future.2003.07.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the BLAs and LAPACK. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state-of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:425 / 440
页数:16
相关论文
共 40 条
[1]   EXPLOITING FUNCTIONAL PARALLELISM OF POWER2 TO DESIGN HIGH-PERFORMANCE NUMERICAL ALGORITHMS [J].
AGARWAL, RC ;
GUSTAVSON, FG ;
ZUBAIR, M .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1994, 38 (05) :563-576
[2]   IMPROVING PERFORMANCE OF LINEAR ALGEBRA ALGORITHMS FOR DENSE MATRICES, USING ALGORITHMIC PREFETCH [J].
AGARWAL, RC ;
GUSTAVSON, FG ;
ZUBAIR, M .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1994, 38 (03) :265-275
[3]   A fully asynchronous multifrontal solver using distributed dynamic scheduling [J].
Amestoy, PR ;
Duff, IS ;
L'Excellent, JY ;
Koster, J .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2001, 23 (01) :15-41
[4]  
AMESTOY PR, 2000, MULTIFRONTAL MASSIVE
[5]   A recursive formulation of Cholesky factorization of a matrix in packed storage [J].
Andersen, BS ;
Wasniewski, J ;
Gustavson, FG .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2001, 27 (02) :214-244
[6]  
[Anonymous], 1997, INT C SUP
[7]  
[Anonymous], AUTOMATICALLY TUNED
[8]   THE INFLUENCE OF RELAXED SUPERNODE PARTITIONS ON THE MULTIFRONTAL METHOD [J].
ASHCRAFT, C ;
GRIMES, R .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1989, 15 (04) :291-309
[9]  
Blackford L. S., 1997, ScaLAPACK user's guide
[10]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748