DUGKS-GPU: An efficient parallel GPU code for 3D turbulent flow simulations using Discrete Unified Gas Kinetic Scheme

被引:4
|
作者
Karzhaubayev, Kairzhan [1 ]
Wang, Lian-Ping [1 ]
Zhakebayev, Dauren [2 ]
机构
[1] Southern Univ Sci & Technol, Guangdong Hong Kong Macao Joint Lab Data Driven, Shenzhen, Guangdong, Peoples R China
[2] Int Engn Technol Univ, Natl Engn Acad Republ Kazakhstan, Alma Ata, Kazakhstan
基金
中国国家自然科学基金;
关键词
Gas-kinetic scheme; DUGKS; CUDA; GPU; Taylor-Green; Turbulent channel flow; DIRECT NUMERICAL-SIMULATION; LATTICE BOLTZMANN METHOD; IMMERSED BOUNDARY; GRAPHICS;
D O I
10.1016/j.cpc.2024.109216
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a parallel implementation of the Discrete Unified Gas Kinetic Scheme (DUGKS) on the GPU system using the CUDA Fortran and CUDA C++ programming languages. Firstly, we conducted an extensive revision of our original CPU-based code, resulting in a threefold decrease in memory usage. This new implementation is also paired with a novel approach to compute cell face flux using trilinear interpolation. It is shown analytically that the interpolation-based approach to flux calculation is more accurate compared to the one used in the original DUGKS. The initial simulation results using this new approach suggest that trilinear interpolation can reduce numerical errors on a coarse mesh. For example, in the case of the decaying TaylorGreen vortex flow at a 128(3) mesh resolution, the relative numerical error in the energy dissipation rate at t* = 2 , using the spectral simulation result as the benchmark, is approximately 30% lower than that of the original implementation. The improved GPU DUGKS method is applied to laminar and turbulent flows in periodic and wall-bounded boundary configurations. A performance comparison of the GPU implementation is also presented and compared to the previous CPU implementation. A maximum speedup of 7.64x was achieved on a desktoplevel GPU compared to a 32-core CPU. The strong scaling test, conducted on an eight-GPU node, demonstrated the efficient utilization of available multiple GPU resources by the code. Program summary Program Title: DUGKS-GPU CPC Library link to program files: https://doi .org /10 .17632 /yykv5s9g2n.1 Developer's repository link: https://github .com /kairzhan /DUGKS-GPU Code Ocean capsule: https://codeocean .com /capsule /3b1e4f74-cdd3-4781-8923-8514d5923dfb/ Licensing provisions: GNU General Public License 3 Programming language: CUDA C++, CUDA Fortran Supplementary material: Nature of problem: DUGKS-GPU is an advanced numerical code designed to accelerate the simulation of turbulent fluid flows through the application of the kinetic method known as the Discrete Unified Gas Kinetic Scheme (DUGKS). This computational tool comprises a CUDA Fortran version, tailored for a single GPU, as well as a CUDA C++ version developed for multi-GPU platforms. Solution method: The code employs a second-order finite-volume discretization of the discrete velocity Boltzmann equation with the BGK collision model. It can be used to simulate small to medium-scale problems on modern multi-GPU platforms with CUDA architecture.
引用
收藏
页数:13
相关论文
共 22 条
  • [1] GPU implementation of the discrete unified gas kinetic scheme for low-speed isothermal flows
    Liu, Peiyao
    Huang, Changsheng
    Guo, Zhaoli
    COMPUTER PHYSICS COMMUNICATIONS, 2024, 294
  • [2] GPU accelerated simulations of 3D deterministic particle transport using discrete ordinates method
    Gong, Chunye
    Liu, Jie
    Chi, Lihua
    Huang, Haowei
    Fang, Jingyue
    Gong, Zhenghu
    JOURNAL OF COMPUTATIONAL PHYSICS, 2011, 230 (15) : 6010 - 6022
  • [3] A GPU parallel scheme for accelerating 2D and 3D peridynamics models
    Wang, Xiaoming
    Wang, Qihang
    An, Boyang
    He, Qing
    Wang, Ping
    Wu, Jun
    THEORETICAL AND APPLIED FRACTURE MECHANICS, 2022, 121
  • [4] An efficient parallel spectral code for 3D periodic flow simulations
    Karzhaubayev, Kairzhan
    Wang, Lian-Ping
    Zhakebayev, Dauren
    SOFTWAREX, 2022, 20
  • [5] Numerical study on slip flow using the discrete unified gas-kinetic scheme
    Guo, Wenqiang
    Hou, Guoxiang
    Guan, Yin
    Liu, Senyun
    INTERNATIONAL JOURNAL OF NUMERICAL METHODS FOR HEAT & FLUID FLOW, 2022, 32 (07) : 2476 - 2505
  • [6] Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU
    Nhat-Phuong Tran
    Lee, Myungho
    Choi, Dong Hoon
    2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 315 - 324
  • [7] Parallel Optimization of 3D Cardiac Electrophysiological Model Using GPU
    Xia, Yong
    Wang, Kuanquan
    Zhang, Henggui
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [8] Parallel computations on GPU in 3D using the vortex particle method
    Kosior, Andrzej
    Kudela, Henryk
    COMPUTERS & FLUIDS, 2013, 80 : 423 - 428
  • [9] Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA
    Mrozek, Dariusz
    Brozek, Milosz
    Malysiak-Mrozek, Bozena
    JOURNAL OF MOLECULAR MODELING, 2014, 20 (02)
  • [10] Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA
    Dariusz Mrozek
    Miłosz Brożek
    Bożena Małysiak-Mrozek
    Journal of Molecular Modeling, 2014, 20