Exploiting mesh structure to improve multigrid performance for saddle-point problems

被引:0
作者
Spies, Lukas [1 ]
Olson, Luke [1 ]
MacLachlan, Scott [2 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Mem Univ Newfoundland, St John, NF, Canada
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Monolithic multigrid; GPU performance; relaxation scheme; Vanka; Braess-Sarazin; Schur-Uzawa; block-triangular preconditioners; STOKES; SMOOTHERS;
D O I
10.1177/10943420241261989
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, solvers for finite-element discretizations of linear or linearized saddle-point problems, like the Stokes and Oseen equations, have become well established. There are two main classes of preconditioners for such systems: those based on a block-factorization approach and those based on monolithic multigrid. Both classes of preconditioners have several critical choices to be made in their composition, such as the selection of a suitable relaxation scheme for monolithic multigrid. From existing studies, some insight can be gained as to what options are preferable in low-performance computing settings, but there are very few fair comparisons of these approaches in the literature, particularly for modern architectures, such as GPUs. In this paper, we perform a comparison between a Block-Triangular preconditioner and monolithic multigrid methods with the three most common choices of relaxation scheme - Braess-Sarazin, Vanka, and Schur-Uzawa. We develop a performant Vanka relaxation algorithm for structured-grid discretizations, which takes advantage of memory efficiencies in this setting. We detail the behavior of the various CUDA kernels for the multigrid relaxation schemes and evaluate their individual arithmetic intensity, performance, and runtime. Running a preconditioned FGMRES solver for the Stokes equations with these preconditioners allows us to compare their efficiency in a practical setting. We show that monolithic multigrid can outperform Block-Triangular preconditioning, and that using Vanka or Braess-Sarazin relaxation is most efficient. Even though multigrid with Vanka relaxation exhibits reduced performance on the CPU (up to 100% slower than Braess-Sarazin), it is able to outperform Braess-Sarazin by more than 20% on the GPU, making it a competitive algorithm, especially given the high amount of algorithmic tuning needed for effective Braess-Sarazin relaxation.
引用
收藏
页码:211 / 229
页数:19
相关论文
共 36 条
  • [1] MONOLITHIC MULTIGRID FOR A REDUCED-QUADRATURE DISCRETIZATION OF POROELASTICITY
    Adler, James H.
    He, Yunhui
    Hu, Xiaozhe
    MacLachlan, Scott
    Ohm, Peter
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (03) : S54 - S81
  • [2] Preconditioning a mass-conserving discontinuous Galerkin discretization of the Stokes equations
    Adler, James H.
    Benson, Thomas R.
    MacLachlan, Scott P.
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2017, 24 (03)
  • [3] MONOLITHIC MULTIGRID METHODS FOR TWO-DIMENSIONAL RESISTIVE MAGNETOHYDRODYNAMICS
    Adler, James H.
    Benson, Thomas R.
    Cyr, Eric C.
    Maclachlan, Scott P.
    Tuminaro, Raymond S.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (01) : B1 - B24
  • [4] [Anonymous], 2014, Numerical Mathematics and Scientific Computation
  • [5] Anzt H., 2014, IMPLEMENTING SPARSE
  • [6] A Simple Preconditioner for a Discontinuous Galerkin Method for the Stokes Problem
    Ayuso de Dios, Blanca
    Brezzi, Franco
    Marini, L. Donatella
    Xu, Jinchao
    Zikatanov, Ludmil
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2014, 58 (03) : 517 - 547
  • [7] Benzi M, 2005, ACTA NUMER, V14, P1, DOI 10.1017/S0962492904000212
  • [8] Reducing communication in algebraic multigrid with multi-step node aware communication
    Bienz, Amanda
    Gropp, William D.
    Olson, Luke N.
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (05) : 547 - 561
  • [9] REDUCING PARALLEL COMMUNICATION IN ALGEBRAIC MULTIGRID THROUGH SPARSIFICATION
    Bienz, Amanda
    Falgout, Robert D.
    Gropp, William
    Olson, Luke N.
    Schroder, Jacob B.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (05) : S332 - S357
  • [10] An efficient smoother for the Stokes problem
    Braess, D
    Sarazin, R
    [J]. APPLIED NUMERICAL MATHEMATICS, 1997, 23 (01) : 3 - 19