The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method

被引:11
作者
Obrecht, Christian [1 ,2 ,3 ,6 ]
Kuznik, Frederic [2 ,3 ,6 ]
Tourancheau, Bernard [2 ,4 ,5 ]
Roux, Jean-Jacques [2 ,3 ,6 ]
机构
[1] EDF R&D, Dept EnerBAT, F-77818 Moret Sur Loing, France
[2] Univ Lyon, Lyon 07, France
[3] INSA Lyon, CETHIL, UMR5008, Villeurbanne, France
[4] INSA Lyon, CITI, INRIA, Villeurbanne, France
[5] Univ Lyon 1, F-69622 Villeurbanne, France
[6] INSA Lyon, CETHIL Lab, Villeurbanne, France
关键词
computational fluid dynamics; CUDA; GPU computing; lattice Boltzmann method;
D O I
10.1177/1094342011414745
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow solver based on the lattice Boltzmann method (LBM). The LBM is a novel approach in computational fluid dynamics, with numerous interesting features from a computational, numerical, and physical standpoint. Our program is based on CUDA and uses POSIX threads to manage multiple computation devices. Using recently released hardware, our solver may therefore run eight GPUs in parallel, which allows us to perform simulations at a rather large scale. Performance and scalability are excellent, the speedup over sequential implementations being at least of two orders of magnitude. In addition, we discuss tiling and communication issues for present and forthcoming implementations.
引用
收藏
页码:295 / 303
页数:9
相关论文
共 17 条
[1]   Numerical linear algebra on emerging architectures: the PLASMA and MAGMA projects [J].
Agullo, Emmanuel ;
Demmel, Jim ;
Dongarra, Jack ;
Hadri, Bilel ;
Kurzak, Jakub ;
Langou, Julien ;
Ltaief, Hatem ;
Luszczek, Piotr ;
Tomov, Stanimire .
SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2009, 180
[2]  
[Anonymous], 2004, Proceedings of the 2004 ACM/IEEE conference on Supercomputing, page, DOI DOI 10.1109/SC.2004.26
[3]   hwloc: a Generic Framework for Managing Hardware Affinities in HPC Applications [J].
Broquedis, Francois ;
Clet-Ortega, Jerome ;
Moreaud, Stephanie ;
Furmento, Nathalie ;
Goglin, Brice ;
Mercier, Guillaume ;
Thibault, Samuel ;
Namyst, Raymond .
PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, :180-186
[4]   Multiple-relaxation-time lattice Boltzmann models in three dimensions [J].
d'Humières, D ;
Ginzburg, I ;
Krafczyk, M ;
Lallemand, P ;
Luo, LS .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2002, 360 (1792) :437-451
[5]  
DHUMIERES D, 1994, P 18 INT S RAR GAS D
[6]   Exploring New Architectures in Accelerating CFD for Air Force Applications [J].
Dongarra, Jack ;
Peterson, Gregory ;
Tomov, Stanimire ;
Allred, Jeff ;
Natoli, Vincent ;
Richie, David .
PROCEEDINGS OF THE HPCMP USERS GROUP CONFERENCE 2008, 2008, :472-+
[7]   LATTICE-GAS AUTOMATA FOR THE NAVIER-STOKES EQUATION [J].
FRISCH, U ;
HASSLACHER, B ;
POMEAU, Y .
PHYSICAL REVIEW LETTERS, 1986, 56 (14) :1505-1508
[8]  
KUZNIK F, 2009, COMPUTERS MATH APPL
[9]   USE OF THE BOLTZMANN-EQUATION TO SIMULATE LATTICE-GAS AUTOMATA [J].
MCNAMARA, GR ;
ZANETTI, G .
PHYSICAL REVIEW LETTERS, 1988, 61 (20) :2332-2335
[10]  
NVIDIA, 2010, COMP UN DEV ARCH PRO