Massively parallel lattice-Boltzmann codes on large GPU clusters

被引：51

作者：

Calore, E. ^{[1
,2
]}

Gabbana, A. ^{[1
]}

Kraus, J. ^{[3
]}

Pellegrini, E. ^{[1
]}

Schifano, S. F. ^{[1
,2
]}

Tripiccione, R. ^{[1
,2
]}

机构：

[1] Univ Ferrara, Via Saragat 1, I-44122 Ferrara, Italy

[2] INFN Ferrara, Via Saragat 1, I-44122 Ferrara, Italy

[3] NVIDIA GmbH, Adenauerstr 20 A4, D-52146 Wurselen, Germany

来源：

PARALLEL COMPUTING | 2016年 / 58卷

关键词：

Lattice-Boltzmann; GPU accelerators; Massively parallel programming; Heterogeneous systems; PERFORMANCE; PORTABILITY;

D O I：

10.1016/j.parco.2016.08.005

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper describes a massively parallel code for a state -of-the art thermal lattice-Boltzmann method. Our code has been carefully optimized for performance on one GPU and to have a good scaling behavior extending to a large number of GPUs. Versions of this code have been already used for large-scale studies of convective turbulence. GPUs are becoming increasingly popular in HPC applications, as they are able to deliver higher performance than traditional processors. Writing efficient programs for large clusters is not an easy task as codes must adapt to increasingly parallel architectures, and the overheads of node-to-node communications must be properly handled. We describe the structure of our code, discussing several key design choices that were guided by theoretical models of performance and experimental benchmarks. We present an extensive set of performance measurements and identify the corresponding main bottlenecks; finally we compare the results of our GPU code with those measured on other currently available high performance processors. Our results are a production-grade code able to deliver a sustained performance of several tens of Tflops as well as a design and optimization methodology that can be used for the development of other high performance applications for computational physics. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：1 / 24

页数：24

共 39 条

[1]

[Anonymous], 2012, P 26 ACM INT C SUPER, DOI [DOI 10.1145/2304576.2304619, 10.1145/2304576.2304619]

[2]

[Anonymous], 2013, INTRO CUDA AWARE MPI

[3]

[Anonymous], 2014, BENCHMARKING GPUDIRE

[4] A flexible high-performance Lattice Boltzmann GPU code for the simulations of fluid flows in complex geometries [J].

Bernaschi, Massimo ;

Fatica, Massimiliano ;

Melchionna, Simone ;

Succi, Sauro ;

Kaxiras, Efthimios .

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (01) :1-14

[5] Second-order closure in stratified turbulence: Simulations and modeling of bulk and entrainment regions [J].

Biferale, L. ;

Mantovani, F. ;

Sbragaglia, M. ;

Scagliarini, A. ;

Toschi, F. ;

Tripiccione, R. .

PHYSICAL REVIEW E, 2011, 84 (01)

[6] Reactive Rayleigh-Taylor systems: Front propagation and non-stationarity [J].

Biferale, L. ;

Mantovani, F. ;

Sbragaglia, M. ;

Scagliarini, A. ;

Toschi, F. ;

Tripiccione, R. .

EPL, 2011, 94 (05)

[7] Lattice Boltzmann fluid-dynamics on the QPACE supercomputer [J].

Biferale, L. ;

Mantovani, F. ;

Pivanti, M. ;

Sbragaglia, M. ;

Scagliarini, A. ;

Schifano, S. F. ;

Toschi, F. ;

Tripiccione, R. .

ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01) :1069-1076

[8] An optimized D2Q37 Lattice Boltzmann code on GP-GPUs [J].

Biferale, Luca ;

Mantovani, Filippo ;

Pivanti, Marcello ;

Pozzati, Fabio ;

Sbragaglia, Mauro ;

Scagliarini, Andrea ;

Schifano, Sebastiano Fabio ;

Toschi, Federico ;

Tripiccione, Raffaele .

COMPUTERS & FLUIDS, 2013, 80 :55-62

[9]

Biferale L, 2012, LECT NOTES COMPUT SC, V7203, P640, DOI 10.1007/978-3-642-31464-3_65

[10] Optimization of Multi-Phase Compressible Lattice Boltzmann Codes on Massively Parallel Multi-Core Systems [J].

Biferale, Luca ;

Mantovani, Filippo ;

Pivanti, Marcello ;

Pozzati, Fabio ;

Sbragaglia, Mauro ;

Scagliarini, Andrea ;

Schifano, Sebastiano Fabio ;

Toschi, Federico ;

Tripiccione, Raffaele .

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 :994-1003

← 1 2 3 4 →