Deciding Non-Compressible Blocks in Sparse Direct Solvers using Incomplete Factorization

被引:0
作者
Korkmaz, Esragul [1 ]
Faverge, Mathieu [1 ]
Ramet, Pierre [1 ]
Pichon, Gregoire [2 ]
机构
[1] Univ Bordeaux, UMR 5800, CNRS, Bordeaux INP,Inria, Talence, France
[2] Univ Lyon, EnsL, UCBL, CNRS,Inria,LIP, Lyon, France
来源
2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021) | 2021年
关键词
sparse direct solvers; low-rank compression; ILU factorization; PERFORMANCE; MATRICES;
D O I
10.1109/HiPC53243.2021.00024
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Low-rank compression techniques are very promising for reducing memory footprint and execution time on a large spectrum of linear solvers. Sparse direct supernodal approaches are one of these techniques. However, despite providing a very good scalability and reducing the memory footprint, they suffer from an important flops overhead in their unstructured low-rank updates. As a consequence, the execution time is not improved as expected. In this paper, we study a solution to improve low-rank compression techniques in sparse supernodal solvers. The proposed method tackles the overprice of the low-rank updates by identifying the blocks that have poor compression rates. We show that the fill-in levels of the graph based block incomplete LU factorization can be used in a new context to identify most of these non-compressible blocks at low cost. This identification enables to postpone the low-rank compression step to trade small extra memory consumption for a better time to solution. The solution is validated within the PASTIX library with a large set of application matrices. It demonstrates sequential and multi-threaded speedup up to 8.5x, for small memory overhead of less than 1.49x with respect to the original version.
引用
收藏
页码:101 / 110
页数:10
相关论文
共 28 条
  • [1] ON THE COMPLEXITY OF THE BLOCK LOW-RANK MULTIFRONTAL FACTORIZATION
    Amestoy, Patrick
    Buttari, Alfredo
    L'Excellent, Jean-Yves
    Mary, Theo
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (04) : A1710 - A1740
  • [2] IMPROVING MULTIFRONTAL METHODS BY MEANS OF BLOCK LOW-RANK REPRESENTATIONS
    Amestoy, Patrick
    Ashcraft, Cleve
    Boiteau, Olivier
    Buttari, Alfredo
    L'Excellent, Jean-Yves
    Weisbecker, Clement
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2015, 37 (03) : A1451 - A1474
  • [3] BRIDGING THE GAP BETWEEN FLAT AND HIERARCHICAL LOW-RANK MATRIX FORMATS: THE MULTILEVEL BLOCK LOW-RANK FORMAT
    Amestoy, Patrick R.
    Buttari, Alfredo
    L'Excellent, Jean-Yves
    Mary, Theo A.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (03) : A1414 - A1442
  • [4] A fast, memory efficient and robust sparse preconditioner based on a multifrontal approach with applications to finite-element matrices
    Aminfar, AmirHossein
    Darve, Eric
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2016, 107 (06) : 520 - 540
  • [5] ParILUT - A Parallel Threshold ILU for GPUs
    Anzt, Hartwig
    Ribizel, Tobias
    Flegar, Goran
    Chow, Edmond
    Dongarra, Jack
    [J]. 2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 231 - 241
  • [6] Bollhofer M., APPL NUMER MATH, V162, P2021
  • [7] Chadwick J. N., 2015, ABS150705593 CORR
  • [8] An unsymmetric-pattern multifrontal method for sparse LU factorization
    Davis, TA
    Duff, IS
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1997, 18 (01) : 140 - 158
  • [9] Davis Timothy A., ACM T MATH SOFTWARE, V38
  • [10] DONGARRA JJ, 1990, ACM T MATH SOFTWARE, V16, P1, DOI 10.1145/77626.79170