Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

被引:0
|
作者
Obrecht, Christian [1 ]
Kuznik, Frederic [1 ]
Tourancheau, Bernard [2 ]
Roux, Jean-Jacques [1 ]
机构
[1] Univ Lyon, INSA Lyon, CNRS,Ctr Thermique Lyon, UMR 5008, Bat Sadi Carnot,9 Rue Phys, F-69621 Villeurbanne, France
[2] UCB Lyon 1, UMR 5668,ENS Lyon, INRIA, Lab Informat Parallelisme,CNRS, Lyon 63964, France
来源
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010 | 2011年 / 6449卷
关键词
GPU computing; CUDA; lattice Boltzmann method; CFD;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.
引用
收藏
页码:151 / +
页数:2
相关论文
共 50 条
  • [21] Sparse Geometries Handling in Lattice Boltzmann Method Implementation for Graphic Processors
    Tomczak, Tadeusz
    Szafran, Roman G.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (08) : 1865 - 1878
  • [22] Implementation of a Lattice Boltzmann Method for Large Eddy Simulation on Multiple GPUs
    Li, Qinjian
    Zhong, Chengwen
    Li, Kai
    Zhang, Guangyong
    Lu, Xiaowei
    Zhang, Qing
    Zhao, Kaiyong
    Chu, Xiaowen
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 818 - 823
  • [23] Implementation of Multi-GPU Based Lattice Boltzmann Method for Flow Through Porous Media
    Huang, Changsheng
    Shi, Baochang
    He, Nanzhong
    Chai, Zhenhua
    ADVANCES IN APPLIED MATHEMATICS AND MECHANICS, 2015, 7 (01) : 1 - 12
  • [24] Modelling Particle Capture Efficiency with Lattice Boltzmann Method
    Fan, Jianhua
    Lomine, Franck
    Hellou, Mustapha
    COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2018, 23 (04) : 932 - 950
  • [25] Modelling of permeability of textile reinforcements: lattice Boltzmann method
    Belov, EB
    Lomov, SV
    Verpoest, I
    Peters, T
    Roose, D
    Parnas, RS
    Hoes, K
    Sol, H
    COMPOSITES SCIENCE AND TECHNOLOGY, 2004, 64 (7-8) : 1069 - 1080
  • [26] Modelling of Polymer Flows by Lattice-Boltzmann Method
    Vikhansky, Alexander
    NOVEL TRENDS IN RHEOLOGY IV, 2011, 1375
  • [27] Performance Optimization of Implementation of Lattice Boltzmann Method in ARUZ
    Jablonski, Grzegorz
    Kupis, Joanna
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES 2018), 2018, : 188 - 191
  • [28] Gravitational tree-code on graphics processing units: implementation in CUDA
    Gaburov, Evghenii
    Bedorf, Jeroen
    Zwart, Simon Portegies
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1113 - 1121
  • [29] Performance Estimation of Lattice Boltzmann Method Implementation in ARUZ
    Jablonski, Grzegorz
    Kupis, Joanna
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS - MIXDES 2017, 2017, : 308 - 313
  • [30] High Performance Implementation of ARIA Encryption Algorithm on Graphics Processing Units
    Xiao, Limin
    Li, Yongnan
    Ruan, Li
    Yao, Guangchao
    Li, Deguo
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 504 - 510