Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

被引：0

作者：

Obrecht, Christian ^{[1
]}

Kuznik, Frederic ^{[1
]}

Tourancheau, Bernard ^{[2
]}

Roux, Jean-Jacques ^{[1
]}

机构：

[1] Univ Lyon, INSA Lyon, CNRS,Ctr Thermique Lyon, UMR 5008, Bat Sadi Carnot,9 Rue Phys, F-69621 Villeurbanne, France

[2] UCB Lyon 1, UMR 5668,ENS Lyon, INRIA, Lab Informat Parallelisme,CNRS, Lyon 63964, France

来源：

HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010 | 2011年 / 6449卷

关键词：

GPU computing; CUDA; lattice Boltzmann method; CFD;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.

引用

页码：151 / +

页数：2

共 50 条

[31] The efficient implementation of correction procedure via reconstruction with graphics processing unit computing
Zimmerman, Ben J.
Wang, Z. J.
COMPUTERS & FLUIDS, 2014, 101 : 263 - 272
[32] A cache-efficient implementation of the lattice Boltzmann method for the two-dimensional diffusion equation
Velivelli, AC
Bryden, KM
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (14) : 1415 - 1432
[33] Implementation of the replica-exchange Wang-Landau sampling on Graphics Processing Units (GPUs)
Boer, A.
COMPUTER PHYSICS COMMUNICATIONS, 2018, 233 : 62 - 66
[34] AN APPROACH TO EFFICIENT FEM SIMULATIONS ON GRAPHICS PROCESSING UNITS USING CUDA
Nutti, Bjorn
Marinkovic, Dragan
FACTA UNIVERSITATIS-SERIES MECHANICAL ENGINEERING, 2014, 12 (01) : 15 - 25
[35] Numerical implementation of thermal boundary conditions in the lattice Boltzmann method
Kuo, Long-Sheng
Chen, Ping-Hei
INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2009, 52 (1-2) : 529 - 532
[36] Lattice Boltzmann Method Implementation on Multiple Devices using OpenCL
Tekic, Jelena B.
Tekic, Predrag M.
Rackovic, Milos
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2018, 18 (03) : 3 - 8
[37] Mathematical Modelling of Volatile Gas Using Lattice Boltzmann Method
Dhaundiyal, Alok
Singh, Suraj Bhan
ENVIRONMENTAL AND CLIMATE TECHNOLOGIES, 2020, 24 (01) : 483 - 500
[38] Comparisons of different implementations of turbulence modelling in lattice Boltzmann method
Xia, Zhenhua
Shi, Yipeng
Chen, Yu
Wang, Moran
Chen, Shiyi
JOURNAL OF TURBULENCE, 2015, 16 (01): : 67 - 80
[39] Simple lattice Boltzmann method algorithm with low memory usage
Namvar, Morteza
Leclaire, Sebastien
JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 62
[40] Accelerating the Fourier split operator method via graphics processing units
Bauke, Heiko
Keitel, Christoph H.
COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (12) : 2454 - 2463

← 1 2 3 4 5 →