Multithreaded Multifrontal Sparse Cholesky Factorization Using Threading Building Blocks

被引:1
|
作者
Povelikin, Rostislav [1 ]
Lebedev, Sergey [1 ]
Meyerov, Iosif [1 ]
机构
[1] Lobachevsky State Univ Nizhni Novgorod, Nizhnii Novgorod, Russia
来源
SUPERCOMPUTING (RUSCDAYS 2019) | 2019年 / 1129卷
关键词
Sparse direct methods; Multifrontal method; Parallel computing; High performance computing; Threading building blocks; SOLVER;
D O I
10.1007/978-3-030-36592-9_7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The multifrontal method is a well-established approach to parallel sparse direct solvers of linear algebraic equations systems with sparse symmetric positive-definite matrices. This paper discusses the approaches and challenges of scalable parallel implementation of the numerical phase of the multifrontal method for shared memory systems based on high-end server CPUs with dozens of cores. The commonly used parallelization schemes are often guided by an elimination tree, containing information about dependencies between logical tasks in a computational loop of the method. We consider a dynamic two-level scheme for the organization of parallel computations. This scheme employs the task-based model with dynamic switching between solving relatively small tasks in parallel and using parallel functions of BLAS for relatively large tasks. There are several problems with the implementation of this scheme, including time-consuming synchronizations and the need for smart memory management. We found a way to improve performance and scaling efficiency using the model of parallelism and memory management tools from the Threading Building Blocks library. Experiments on large symmetric matrices from the SuiteSparse Matrix Collection show that our implementation is competitive with the commercial direct sparse solver Intel MKL PARDISO.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 27 条
  • [1] Dynamic Parallelization Strategies for Multifrontal Sparse Cholesky Factorization
    Lebedev, Sergey
    Akhmedzhanov, Dmitry
    Kozinov, Evgeniy
    Meyerov, Iosif
    Pirova, Anna
    Sysoyev, Alexander
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2015), 2015, 9251 : 68 - 79
  • [2] GPU-based Multifrontal Optimizing Method in Sparse Cholesky Factorization
    Zheng, Ran
    Wang, Wei
    Jin, Hai
    Wu, Song
    Chen, Yong
    Jiang, Han
    PROCEEDINGS OF THE ASAP2015 2015 IEEE 26TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2015, : 90 - 97
  • [3] A Multithreaded Algorithm for Sparse Cholesky Factorization on Hybrid Multicore Architectures
    Tang, Meng
    Gadou, Mohamed
    Ranka, Sanjay
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 616 - 625
  • [4] A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization
    Yong Chen
    Hai Jin
    Ran Zheng
    Yuandong Liu
    Wei Wang
    Journal of Signal Processing Systems, 2018, 90 : 53 - 67
  • [5] A Hybrid CPU-GPU Multifrontal Optimizing Method in Sparse Cholesky Factorization
    Chen, Yong
    Jin, Hai
    Zheng, Ran
    Liu, Yuandong
    Wang, Wei
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (01): : 53 - 67
  • [6] A MAPPING ALGORITHM FOR PARALLEL SPARSE CHOLESKY FACTORIZATION
    POTHEN, A
    SUN, CG
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (05): : 1253 - 1257
  • [7] DESIGN OF A MULTICORE SPARSE CHOLESKY FACTORIZATION USING DAGs
    Hogg, J. D.
    Reid, J. K.
    Scott, J. A.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (06): : 3627 - 3649
  • [8] HIGHLY PARALLEL SPARSE CHOLESKY FACTORIZATION
    GILBERT, JR
    SCHREIBER, R
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1992, 13 (05): : 1151 - 1172
  • [9] Sparse Approximate Multifrontal Factorization with Composite Compression Methods
    Claus, Lisa
    Ghysels, Pieter
    Liu, Yang
    Nhan, Thai Anh
    Thirumalaisamy, Ramakrishnan
    Bhalla, Amneet Pal Singh
    Li, Sherry
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2023, 49 (03):
  • [10] Efficient cost evaluation for sparse multifrontal QR factorization
    Jiang, DM
    Chen, CL
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1567 - 1574