A General Design for a Scalable MPI-GPU Multi-Resolution 2D Numerical Solver

被引:8
|
作者
Turchetto, Massimiliano [1 ]
Palu, Alessandro Dal [2 ]
Vacondio, Renato [1 ]
机构
[1] Univ Parma, Engn & Architecture Dept, I-43121 Parma, Italy
[2] Univ Parma, Math Phys Comp Sci Dept, I-43121 Parma, Italy
关键词
CUDA; multi-GPU; MPI; dynamic load balancing; hilbert space filling curves; multi-resolution grid; shallow water equations (SWE); AMR; ADAPTIVE MESH REFINEMENT; PARALLEL; IMPLEMENTATION; CODE;
D O I
10.1109/TPDS.2019.2961909
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This article presents a multi-GPU implementation of a Finite-Volume solver on a multi-resolution grid. The implementation completely offloads the computation to the GPUs and communications between different GPUs are implemented by means of the Message Passing Interface (MPI) API. Different domain decomposition techniques have been considered and the one based on the Hilbert Space Filling Curves (HSFC) showed optimal scalability. Several optimizations are introduced: One-to-one MPI communications among MPI ranks are completely masked by GPU computations on internal cells and a novel dynamic load balancing algorithm is introduced to minimize the waiting times at global MPI synchronization barriers. Such algorithm adapts the computational load of ranks in response to dynamical changes in the execution time of blocks and in network performances; Its capability to converge to a balanced computation has been empirically shown by numerical experiments. Tests exploit up to 64 GPUs and 83M cells and achieve an efficiency of 90 percent in weak scalability and 85 percent for strong scalability. The framework is general and the results of the article can be ported to a wide range of explicit 2D Partial Differential Equations solvers.
引用
收藏
页码:1036 / 1047
页数:12
相关论文
共 9 条
  • [1] Reducing Communication Overhead in Multi-GPU Hybrid Solver for 2D Laplace's Equation
    Czapinski, Michal
    Thompson, Chris
    Barnes, Stuart
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (06) : 1032 - 1047
  • [2] Reducing Communication Overhead in Multi-GPU Hybrid Solver for 2D Laplace’s Equation
    Michał Czapiński
    Chris Thompson
    Stuart Barnes
    International Journal of Parallel Programming, 2014, 42 : 1032 - 1047
  • [3] TRITON: A Multi-GPU open source 2D hydrodynamic flood model
    Morales-Hernandez, M.
    Sharif, Md B.
    Kalyanapu, A.
    Ghafoor, S. K.
    Dullo, T. T.
    Gangrade, S.
    Kao, S. -C.
    Norman, M. R.
    Evans, K. J.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2021, 141
  • [4] 2D GPU-Accelerated High Resolution Numerical Scheme for Solving Diffusive Wave Equations
    Park, Seonryang
    Kim, Boram
    Kim, Dae-Hong
    WATER, 2019, 11 (07):
  • [5] A GPU Numerical Implementation of a 2D Simplified Wildfire Spreading Model
    San Martin, Daniel
    Torres, Claudio E.
    HIGH PERFORMANCE COMPUTING, CARLA 2023, 2024, 1887 : 131 - 145
  • [6] 2D shallow water GPU parallelized scheme for high resolution real-field flood simulations
    Vacondio, R.
    Aureli, F.
    Mignosa, P.
    Dal Palu, A.
    RIVER FLOW 2014, 2014, : 623 - 630
  • [7] Swendsen-Wang Multi-Cluster Algorithm for the 2D/3D Ising Model on Xeon Phi and GPU
    Wende, Florian
    Steinke, Thomas
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [8] Acceleration of a 2D/3D finite-discrete element code for geomechanical simulations using General Purpose GPU computing
    Lisjak, A.
    Mahabadi, O. K.
    He, L.
    Tatone, B. S. A.
    Kaifosh, P.
    Haque, S. A.
    Grasselli, G.
    COMPUTERS AND GEOTECHNICS, 2018, 100 : 84 - 96
  • [9] Parallelization and implementation of multi-spin Monte Carlo simulation of 2D square Ising model using MPI and C plus
    Hassani, Dariush
    Rafibakhsh, Shahnoosh
    JOURNAL OF THEORETICAL AND APPLIED PHYSICS, 2018, 12 (03) : 199 - 208