Multi-GPU implementation of a time-explicit finite volume solver using CUDA and a CUDA-Aware version of OpenMPI with application to shallow water flows

被引:14
作者
Delmas, Vincent [1 ,2 ]
Soulaimani, Azzedine [1 ]
机构
[1] Ecole Technol Super ETS, Dept Mech Engn, 1100 Notre Dame Ouest, Montreal, PQ H3C 1K3, Canada
[2] Ecole Natl Super Elect Informat Telecommun Math &, Dept Math & Mecan, 1 Ave Dr Albert Schweitzer, F-33402 Talence, France
关键词
Flood simulations; Shallow-water equations; Multi-GPU; CUDA; MPI; HPC; SIMULATION; SCHEME; EQUATIONS;
D O I
10.1016/j.cpc.2021.108190
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper shows the development of a multi-GPU version of a time-explicit finite volume solver for the Shallow-Water Equations (SWE) on a multi-GPU architecture. MPI is combined with CUDA-Fortran in order to use as many GPUs as needed and the METIS library is leveraged to perform a domain decomposition on the 2D unstructured triangular meshes of interest. A CUDA-Aware version of OpenMPI is adopted to speed up the messages between the MPI processes. A study of both speed-up and efficiency is conducted; first, for a classic dam-break flow in a canal, and then for two real domains with complex bathymetries. In both cases, meshes with up to 12 million cells are used. Using 24 to 28 GPUs on these meshes leads to an efficiency of 80% and more. Finally, the multi-GPU version is compared to the pure MPI multi-CPU version, and it is concluded that in this particular case, about 100 CPU cores would be needed to achieve the same performance as one GPU. The developed methodology is applicable for general time-explicit Riemann solvers for conservation laws. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:22
相关论文
共 54 条
  • [1] [Anonymous], 2009, METIS UNSTRUCTURED G
  • [2] A Weighted Average Flux (WAF) scheme applied to shallow water equations for real-life applications
    Ata, Riadh
    Pavan, Sara
    Khelladi, Sofiane
    Toro, Eleuterio F.
    [J]. ADVANCES IN WATER RESOURCES, 2013, 62 : 155 - 172
  • [3] A well-balanced positivity preserving "second-order" scheme for shallow water flows on unstructured meshes
    Audusse, E
    Bristeau, MO
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2005, 206 (01) : 311 - 333
  • [4] A fast and stable well-balanced scheme with hydrostatic reconstruction for shallow water flows
    Audusse, E
    Bouchut, F
    Bristeau, MO
    Klein, R
    Perthame, B
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2004, 25 (06) : 2050 - 2065
  • [5] Ayyad M., 2020, COMPUT PHYS COMMUN, DOI [10.1016/j.cpc.2020.107495, DOI 10.1016/J.CPC.2020.107495]
  • [6] Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics
    Borrell, R.
    Dosimont, D.
    Garcia-Gasulla, M.
    Houzeaux, G.
    Lehmkuhl, O.
    Mehta, V
    Owen, H.
    Vazquez, M.
    Oyarzun, G.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 107 : 31 - 48
  • [7] Finite-volume model for shallow-water flooding of arbitrary topography
    Bradford, SF
    Sanders, BF
    [J]. JOURNAL OF HYDRAULIC ENGINEERING, 2002, 128 (03) : 289 - 298
  • [8] Efficient shallow water simulations on GPUs: Implementation, visualization, verification, and validation
    Brodtkorb, Andre R.
    Saetra, Martin L.
    Altinakar, Mustafa
    [J]. COMPUTERS & FLUIDS, 2012, 55 : 1 - 12
  • [9] Simulation and visualization of the Saint-Venant system using GPUs
    Brodtkorb, Andre R.
    Hagen, Trond R.
    Lie, Knut-Andreas
    Natvig, Jostein R.
    [J]. COMPUTING AND VISUALIZATION IN SCIENCE, 2010, 13 (07) : 341 - 353
  • [10] Zero mass error using unsteady wetting-drying conditions in shallow flows over dry irregular topography
    Brufau, P
    García-Navarro, P
    Vázquez-Cendón, ME
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2004, 45 (10) : 1047 - 1082