A FACTORED SPARSE APPROXIMATE INVERSE PRECONDITIONED CONJUGATE GRADIENT SOLVER ON GRAPHICS PROCESSING UNITS

被引:16
作者
Bernaschi, Massimo [1 ]
Bisson, Mauro [1 ]
Fantozzi, Carlo [2 ]
Janna, Carlo [3 ]
机构
[1] CNR, Inst Comp Applicat, Rome, Italy
[2] Univ Padua, Dept Informat Engn, Via Gradenigo 6-b, I-35131 Padua, Italy
[3] Univ Padua, Dept ICEA, Via Trieste 63, I-35121 Padua, Italy
关键词
preconditioning; approximate inverses; parallel computing; iterative methods; MATRIX-VECTOR MULTIPLICATION; GPU; ALGORITHM; PATTERNS; FORMAT;
D O I
10.1137/15M1027826
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Graphics Processing Units (GPUs) exhibit significantly higher peak performance than conventional CPUs. However, in general only highly parallel algorithms can exploit their potential. In this scenario, the iterative solution to sparse linear systems of equations could be carried out quite efficiently on a GPU as it requires only matrix-by-vector products, dot products, and vector updates. However, to be really effective, any iterative solver needs to be properly preconditioned and this represents a major bottleneck for a successful GPU implementation. Due to its inherent parallelism, the factored sparse approximate inverse (FSAI) preconditioner represents an optimal candidate for the conjugate gradient-like solution of sparse linear systems. However, its GPU implementation requires a nontrivial recasting of multiple computational steps. We present our GPU version of the FSAI preconditioner along with a set of results that show how a noticeable speedup with respect to a highly tuned CPU counterpart is obtained.
引用
收藏
页码:C53 / C72
页数:20
相关论文
共 41 条
  • [1] Anderson M. J., 2012, P 26 INT PAR DISTR P
  • [2] [Anonymous], 2015, CUSPARSE LIB DU 0670
  • [3] [Anonymous], 2008, NVIDIA Technical Report NVR-2008-004
  • [4] [Anonymous], 2014, IFIP INT C NETWORK P
  • [5] [Anonymous], 2015, CUDA C PROGRAMMING G
  • [6] MINIMIZING COMMUNICATION IN NUMERICAL LINEAR ALGEBRA
    Ballard, Grey
    Demmel, James
    Holtz, Olga
    Schwartz, Oded
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2011, 32 (03) : 866 - 901
  • [7] A comparative study of sparse approximate inverse preconditioners
    Benzi, M
    Tuma, M
    [J]. APPLIED NUMERICAL MATHEMATICS, 1999, 30 (2-3) : 305 - 340
  • [8] A sparse approximate inverse preconditioner for the conjugate gradient method
    Benzi, M
    Meyer, CD
    Tuma, M
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1996, 17 (05) : 1135 - 1149
  • [9] Approximate Inverse Preconditioners for Krylov Methods on Heterogeneous Parallel Computers
    Bertaccini, Daniele
    Filippone, Salvatore
    [J]. PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 183 - 192
  • [10] Sparse matrix solvers on the GPU:: Conjugate gradients and multigrid
    Bolz, J
    Farmer, I
    Grinspun, E
    Schröder, P
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03): : 917 - 924