Reducing Communication Overhead in Multi-GPU Hybrid Solver for 2D Laplace's Equation

被引:1
|
作者
Czapinski, Michal [1 ]
Thompson, Chris [1 ]
Barnes, Stuart [1 ]
机构
[1] Cranfield Univ, Appl Math & Comp Grp, Cranfield MK43 0AL, Beds, England
关键词
Hybrid parallelism; Multiple GPUs; Heterogeneous architectures; Non-blocking communication; Laplace solver; CUDA; CONJUGATE GRADIENTS; GRAPHICS; IMPLEMENTATION; COMPUTATION; OVERLAP;
D O I
10.1007/s10766-013-0293-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The possibility of porting algorithms to graphics processing units (GPUs) raises significant interest among researchers. The natural next step is to employ multiple GPUs, but communication overhead may limit further performance improvement. In this paper, we investigate techniques reducing overhead on hybrid CPU-GPU platforms, including careful data layout and usage of GPU memory spaces, and use of non-blocking communication. In addition, we propose an accurate automatic load balancing technique for heterogeneous environments. We validate our approach on a hybrid Jacobi solver for 2D Laplace's Equation. Experiments carried out using various graphics hardware and types of connectivity have confirmed that the proposed data layout allows our fastest CUDA kernels to reach the analytical limit for memory bandwidth (up to 106 GB/s on NVidia GTX 480), and that the non-blocking communication significantly reduces overhead, allowing for almost linear speed-up, even when communication is carried out over relatively slow networks.
引用
收藏
页码:1032 / 1047
页数:16
相关论文
共 16 条
  • [1] Reducing Communication Overhead in Multi-GPU Hybrid Solver for 2D Laplace’s Equation
    Michał Czapiński
    Chris Thompson
    Stuart Barnes
    International Journal of Parallel Programming, 2014, 42 : 1032 - 1047
  • [2] Hybrid Multi-GPU Solver Based on Schur Complement Method
    Kopysov, Sergey
    Kuzmin, Igor
    Nedozhogin, Nikita
    Novikov, Alexander
    Sagdeeva, Yulia
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2013), 2013, 7979 : 65 - 79
  • [3] TRITON: A Multi-GPU open source 2D hydrodynamic flood model
    Morales-Hernandez, M.
    Sharif, Md B.
    Kalyanapu, A.
    Ghafoor, S. K.
    Dullo, T. T.
    Gangrade, S.
    Kao, S. -C.
    Norman, M. R.
    Evans, K. J.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2021, 141
  • [4] Multi-GPU implementation of a hybrid thermal lattice Boltzmann solver using the TheLMA framework
    Obrecht, Christian
    Kuznik, Frederic
    Tourancheau, Bernard
    Roux, Jean-Jacques
    COMPUTERS & FLUIDS, 2013, 80 : 269 - 275
  • [5] Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model
    Block, Benjamin
    Virnau, Peter
    Preis, Tobias
    COMPUTER PHYSICS COMMUNICATIONS, 2010, 181 (09) : 1549 - 1556
  • [6] A General Design for a Scalable MPI-GPU Multi-Resolution 2D Numerical Solver
    Turchetto, Massimiliano
    Palu, Alessandro Dal
    Vacondio, Renato
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (05) : 1036 - 1047
  • [7] Qubit Placement to Minimize Communication Overhead in 2D Quantum Architectures
    Shafaei, Alireza
    Saeedi, Mehdi
    Pedram, Massoud
    2014 19TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2014, : 495 - 500
  • [8] Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators
    Wyrzykowski, Roman
    Szustak, Lukasz
    Rojek, Krzysztof
    PARALLEL COMPUTING, 2014, 40 (08) : 425 - 447
  • [9] A parallel hybrid implementation of the 2D acoustic wave equation
    Altybay, Arshyn
    Ruzhansky, Michael
    Tokmagambetov, Niyaz
    INTERNATIONAL JOURNAL OF NONLINEAR SCIENCES AND NUMERICAL SIMULATION, 2020, 21 (7-8) : 821 - 827
  • [10] GPU-accelerated solutions of the nonlinear Schrodinger equation for simulating 2D spinor BECs
    Smith, Benjamin D.
    Cooke, Logan W.
    LeBlanc, Lindsay J.
    COMPUTER PHYSICS COMMUNICATIONS, 2022, 275