Multithreaded Multifrontal Sparse Cholesky Factorization Using Threading Building Blocks

被引:1
|
作者
Povelikin, Rostislav [1 ]
Lebedev, Sergey [1 ]
Meyerov, Iosif [1 ]
机构
[1] Lobachevsky State Univ Nizhni Novgorod, Nizhnii Novgorod, Russia
来源
SUPERCOMPUTING (RUSCDAYS 2019) | 2019年 / 1129卷
关键词
Sparse direct methods; Multifrontal method; Parallel computing; High performance computing; Threading building blocks; SOLVER;
D O I
10.1007/978-3-030-36592-9_7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The multifrontal method is a well-established approach to parallel sparse direct solvers of linear algebraic equations systems with sparse symmetric positive-definite matrices. This paper discusses the approaches and challenges of scalable parallel implementation of the numerical phase of the multifrontal method for shared memory systems based on high-end server CPUs with dozens of cores. The commonly used parallelization schemes are often guided by an elimination tree, containing information about dependencies between logical tasks in a computational loop of the method. We consider a dynamic two-level scheme for the organization of parallel computations. This scheme employs the task-based model with dynamic switching between solving relatively small tasks in parallel and using parallel functions of BLAS for relatively large tasks. There are several problems with the implementation of this scheme, including time-consuming synchronizations and the need for smart memory management. We found a way to improve performance and scaling efficiency using the model of parallelism and memory management tools from the Threading Building Blocks library. Experiments on large symmetric matrices from the SuiteSparse Matrix Collection show that our implementation is competitive with the commercial direct sparse solver Intel MKL PARDISO.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 27 条
  • [21] Finding optimal ordering of sparse matrices for column-oriented parallel Cholesky factorization
    Lin, WY
    JOURNAL OF SUPERCOMPUTING, 2003, 24 (03): : 259 - 277
  • [22] FSCHOL: An OpenCL-based HPC Framework for Accelerating Sparse Cholesky Factorization on FPGAs
    Tavakoli, Erfan Bank
    Riera, Michael
    Quraishi, Masudul Hassan
    Ren, Fengbo
    2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 209 - 220
  • [23] Finding Optimal Ordering of Sparse Matrices for Column-Oriented Parallel Cholesky Factorization
    Wen-Yang Lin
    The Journal of Supercomputing, 2003, 24 : 259 - 277
  • [24] A Task Tree Executor Architecture Based on Intel Threading Building Blocks
    Popovic, Miroslav
    Djukic, Miodrag
    Marinkovic, Vladimir
    Vranic, Nikola
    2012 IEEE 19TH INTERNATIONAL CONFERENCE AND WORKSHOPS ON ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS), 2012, : 201 - 209
  • [25] Geostatistical Modeling and Prediction Using Mixed-Precision Tile Cholesky Factorization
    Abdulah, Sameh
    Ltaief, Hatem
    Sun, Ying
    Genton, Marc G.
    Keyes, David E.
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 152 - 162
  • [26] Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks
    Tomas, Andres E.
    Quintana-Orti, Enrique S.
    EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT I, EURO-PAR 2023, 2024, 14351 : 306 - 317
  • [27] Sparse factorization of finite element matrices using overlapped localizing solution modes
    Choi, J. -S.
    Adams, R. J.
    Canning, F. X.
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2008, 50 (04) : 1050 - 1054