High-performance Shallow Water Model for Use on Massively Parallel and Heterogeneous Computing Systems

被引:0
|
作者
Chaplygin A.V. [1 ]
Gusev A.V. [1 ,2 ,3 ]
Diansky N.A. [1 ,3 ,4 ]
机构
[1] Marchuk Institute of Numerical Mathematics of the Russian Academy of Sciences, Moscow
[2] P.P. Shirshov Institute of Oceanology of the Russian Academy of Sciences, Moscow
[3] N.N. Zubov State Oceanographic Institute, Moscow
[4] Lomonosov Moscow State University, Moscow
基金
俄罗斯基础研究基金会;
关键词
Cuda; Heterogeneous computing systems; Mpi; Openmp; Shallow water; Supercomputer modeling;
D O I
10.14529/JSFI210407
中图分类号
学科分类号
摘要
This paper presents the shallow water model, formulated from the ocean general circulation sigma model INMOM (Institute of Numerical Mathematics Ocean Model). The shallow water model is based on software architecture, which separates the physics-related code from parallel implementation features, thereby simplifying the model’s support and development. As an improvement of the two-dimensional domain decomposition method, we present the blocked-based decomposition proposing load-balanced and cache-friendly calculations on CPUs. We propose various hybrid parallel programming patterns in the shallow water model for effective calculation on massively parallel and heterogeneous computing systems and evaluate their scaling performances on the Lomonosov-2 supercomputer. We demonstrate that performance per a single grid point on GPUs dramatically decreases for small grid sizes starting from 219 points per node, while performance on CPUs scales up to 217 well. Although, calculations on GPUs outperform calculations on CPUs by a factor of 4.7 at 30 nodes using 60 GPUs and 360 CPU cores at 6100 × 4460 grid size. We demonstrate that overlapping kernel execution with data transfers on GPUs increases performance by 28%. Furthermore, we demonstrate the advantage of using the load-balancing method in the Azov Sea model on CPUs and GPUs. © The Authors 2021. This paper is published with open access at SuperFri.org
引用
收藏
页码:74 / 93
页数:19
相关论文
共 50 条
  • [1] A Heterogeneous Supercomputer Model for High-Performance Parallel Computing Pedagogy
    Wolfer, James
    PROCEEDINGS OF 2015 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2015, : 799 - 805
  • [2] Parallel Soft Computing Techniques in High-Performance Computing Systems
    Dorronsoro, Bernabe
    Nesmachnow, Sergio
    COMPUTER JOURNAL, 2016, 59 (06): : 775 - 776
  • [4] MERLIN - MASSIVELY PARALLEL HETEROGENEOUS COMPUTING
    WITTIE, L
    MAPLES, C
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 1: ARCHITECTURE, 1989, : I142 - I150
  • [5] Parallel computing in bioinformatics: a view from high-performance, heterogeneous, and cloud computing
    Vega-Rodriguez, Miguel A.
    Santander-Jimenez, Sergio
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (07): : 3369 - 3373
  • [6] Parallel computing in bioinformatics: a view from high-performance, heterogeneous, and cloud computing
    Miguel A. Vega-Rodríguez
    Sergio Santander-Jiménez
    The Journal of Supercomputing, 2019, 75 : 3369 - 3373
  • [7] Model for simulation of heterogeneous high-performance computing environments
    de Mello, Rodrigo Fernandes
    Senger, Luciano Jose
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2006, 2007, 4395 : 107 - +
  • [8] A language and programming environment for high-performance parallel computing on heterogeneous networks
    Lastovetsky, AL
    Kalinov, AY
    Ledovskikh, IN
    Arapov, DM
    Posypkin, MA
    PROGRAMMING AND COMPUTER SOFTWARE, 2000, 26 (04) : 216 - 236
  • [9] A language and programming environment for high-performance parallel computing on heterogeneous networks
    A. L. Lastovetsky
    A. Ya. Kalinov
    I. N. Ledovskikh
    D. M. Arapov
    M. A. Posypkin
    Programming and Computer Software, 2000, 26 : 216 - 236
  • [10] A new massively parallel version of CRYSTAL for large systems on high performance computing architectures
    Orlando, Roberto
    Delle Piane, Massimo
    Bush, Ian J.
    Ugliengo, Piero
    Ferrabone, Matteo
    Dovesi, Roberto
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2012, 33 (28) : 2276 - 2284