Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators

被引:22
作者
Wyrzykowski, Roman [1 ]
Szustak, Lukasz [1 ]
Rojek, Krzysztof [1 ]
机构
[1] Czestochowa Tech Univ, Inst Comp & Informat Sci, Czestochowa, Poland
关键词
MPDATA advection algorithm; Stencil computation; GPU accelerators; Hybrid CPU-GPU architectures; Hierarchical decomposition; Autotuning; ADVECTION TRANSPORT ALGORITHM; PERFORMANCE; MULTI; IMPLEMENTATION; SIMULATION;
D O I
10.1016/j.parco.2014.04.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
EULAG (Eulerian/semi-Lagrangian fluid solver) is an established computational model developed for simulating thermo-fluid flows across a wide range of scales and physical scenarios. The dynamic core of EULAG includes the multidimensional positive definite advection transport algorithm (MPDATA) and elliptic solver. In this work we investigate aspects of an optimal parallel version of the 2D MPDATA algorithm on modern hybrid architectures with GPU accelerators, where computations are distributed across both GPU and CPU components. Using the hybrid OpenMP-OpenCL model of parallel programming opens the way to harness the power of CPU-GPU platforms in a portable way. In order to better utilize features of such computing platforms, comprehensive adaptations of MPDATA computations to hybrid architectures are proposed. These adaptations are based on efficient strategies for memory and computing resource management, which allow us to ease memory and communication bounds, and better exploit the theoretical floating point efficiency of CPU-GPU platforms. The main contributions of the paper are: method for the decomposition of the 2D MPDATA algorithm as a tool to adapt MPDATA computations to hybrid architectures with GPU accelerators by minimizing communication and synchronization between CPU and GPU components at the cost of additional computations; method for the adaptation of 2D MPDATA computations to multicore CPU platforms, based on space and temporal blocking techniques; method for the adaptation of the 2D MPDATA algorithm to GPU architectures, based on a hierarchical decomposition strategy across data and computation domains, with support provided by the developed GPU task scheduler allowing for the flexible management of available resources; approach to the parametric optimization of 2D MPDATA computations on GPUs using the autotuning technique, which allows us to provide a portable implementation methodology across a variety of GPUs. Hybrid platforms tested in this study contain different numbers of CPUs and GPUs from solutions consisting of a single CPU and a single GPU to the most elaborate configuration containing two CPUs and two GPUs. Processors of different vendors are employed in these systems - both Intel and AMD CPUs, as well as GPUs from NVIDIA and AMD. For all the grid sizes and for all the tested platforms, the hybrid version with computations spread across CPU and GPU components allows us to achieve the highest performance. In particular, for the largest MPDATA grids used in our experiments, the speedups of the hybrid versions over GPU and CPU versions vary from 1.30 to 1.69, and from 1.95 to 2.25, respectively. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:425 / 447
页数:23
相关论文
共 50 条
  • [41] Hybrid 2D and 3D frameworks based on ε-Keggin polyoxometallates:: Experiment and simulation
    Dolbecq, A
    Mellot-Draznieks, C
    Mialane, P
    Marrot, J
    Férey, G
    Sécheresse, F
    EUROPEAN JOURNAL OF INORGANIC CHEMISTRY, 2005, (15) : 3009 - 3018
  • [42] 2D Hybrid Halide Perovskites: Structure, Properties, and Applications in Solar Cells
    Wu, Guangbao
    Liang, Rui
    Zhang, Zhipeng
    Ge, Mingzheng
    Xing, Guichuan
    Sun, Guoxing
    SMALL, 2021, 17 (43)
  • [43] A new type of zinc ion hybrid supercapacitor based on 2D materials
    Zhang, Dan
    Li, Le
    Deng, Jianping
    Guo, Shaobo
    Pang, Haixia
    Lu, Jiufu
    Xia, Donghui
    Ji, Xiaohui
    NANOSCALE, 2021, 13 (25) : 11004 - 11016
  • [44] Hybrid TBETI domain decomposition for huge 2D scalar variational inequalities
    Dostal, Zdenek
    Sadowska, Marie
    Horak, David
    Kruzik, Jakub
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2024, 125 (24)
  • [45] 2D shallow water GPU parallelized scheme for high resolution real-field flood simulations
    Vacondio, R.
    Aureli, F.
    Mignosa, P.
    Dal Palu, A.
    RIVER FLOW 2014, 2014, : 623 - 630
  • [46] Multi-functional 2D hybrid aerogels for gas absorption applications
    Androulidakis, Charalampos
    Kotsidi, Maria
    Gorgolis, George
    Pavlou, Christos
    Sygellou, Labrini
    Paterakis, George
    Koutroumanis, Nick
    Galiotis, Costas
    SCIENTIFIC REPORTS, 2021, 11 (01) : 13548
  • [47] Hybrid statistics of ideal 2D MHD turbulence in a mean magnetic field
    Shebalin, John V.
    RADIATION EFFECTS AND DEFECTS IN SOLIDS, 2011, 166 (10): : 774 - 788
  • [48] Web-Based GPU-Accelerated Application for Multiplanar Reconstructions from Conventional 2D Ultrasound
    Borgbjerg, Jens
    Horlyck, Arne
    ULTRASCHALL IN DER MEDIZIN, 2021, 42 (02): : 194 - 201
  • [49] Acceleration for 2D time-domain elastic full waveform inversion using a single GPU card
    Jiang, Jinpeng
    Zhu, Peimin
    JOURNAL OF APPLIED GEOPHYSICS, 2018, 152 : 173 - 187
  • [50] On Numerical 2D P Colonies Modelling the Grey Wolf Optimization Algorithm
    Valenta, Daniel
    Langer, Miroslav
    PROCESSES, 2021, 9 (02) : 1 - 24