Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

被引:0
|
作者
Obrecht, Christian [1 ]
Kuznik, Frederic [1 ]
Tourancheau, Bernard [2 ]
Roux, Jean-Jacques [1 ]
机构
[1] Univ Lyon, INSA Lyon, CNRS,Ctr Thermique Lyon, UMR 5008, Bat Sadi Carnot,9 Rue Phys, F-69621 Villeurbanne, France
[2] UCB Lyon 1, UMR 5668,ENS Lyon, INRIA, Lab Informat Parallelisme,CNRS, Lyon 63964, France
来源
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010 | 2011年 / 6449卷
关键词
GPU computing; CUDA; lattice Boltzmann method; CFD;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.
引用
收藏
页码:151 / +
页数:2
相关论文
共 50 条
  • [31] The efficient implementation of correction procedure via reconstruction with graphics processing unit computing
    Zimmerman, Ben J.
    Wang, Z. J.
    COMPUTERS & FLUIDS, 2014, 101 : 263 - 272
  • [32] A cache-efficient implementation of the lattice Boltzmann method for the two-dimensional diffusion equation
    Velivelli, AC
    Bryden, KM
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (14) : 1415 - 1432
  • [33] Implementation of the replica-exchange Wang-Landau sampling on Graphics Processing Units (GPUs)
    Boer, A.
    COMPUTER PHYSICS COMMUNICATIONS, 2018, 233 : 62 - 66
  • [34] AN APPROACH TO EFFICIENT FEM SIMULATIONS ON GRAPHICS PROCESSING UNITS USING CUDA
    Nutti, Bjorn
    Marinkovic, Dragan
    FACTA UNIVERSITATIS-SERIES MECHANICAL ENGINEERING, 2014, 12 (01) : 15 - 25
  • [35] Numerical implementation of thermal boundary conditions in the lattice Boltzmann method
    Kuo, Long-Sheng
    Chen, Ping-Hei
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2009, 52 (1-2) : 529 - 532
  • [36] Lattice Boltzmann Method Implementation on Multiple Devices using OpenCL
    Tekic, Jelena B.
    Tekic, Predrag M.
    Rackovic, Milos
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2018, 18 (03) : 3 - 8
  • [37] Mathematical Modelling of Volatile Gas Using Lattice Boltzmann Method
    Dhaundiyal, Alok
    Singh, Suraj Bhan
    ENVIRONMENTAL AND CLIMATE TECHNOLOGIES, 2020, 24 (01) : 483 - 500
  • [38] Comparisons of different implementations of turbulence modelling in lattice Boltzmann method
    Xia, Zhenhua
    Shi, Yipeng
    Chen, Yu
    Wang, Moran
    Chen, Shiyi
    JOURNAL OF TURBULENCE, 2015, 16 (01): : 67 - 80
  • [39] Simple lattice Boltzmann method algorithm with low memory usage
    Namvar, Morteza
    Leclaire, Sebastien
    JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 62
  • [40] Accelerating the Fourier split operator method via graphics processing units
    Bauke, Heiko
    Keitel, Christoph H.
    COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (12) : 2454 - 2463