DUGKS-GPU: An efficient parallel GPU code for 3D turbulent flow simulations using Discrete Unified Gas Kinetic Scheme

被引:4
|
作者
Karzhaubayev, Kairzhan [1 ]
Wang, Lian-Ping [1 ]
Zhakebayev, Dauren [2 ]
机构
[1] Southern Univ Sci & Technol, Guangdong Hong Kong Macao Joint Lab Data Driven, Shenzhen, Guangdong, Peoples R China
[2] Int Engn Technol Univ, Natl Engn Acad Republ Kazakhstan, Alma Ata, Kazakhstan
基金
中国国家自然科学基金;
关键词
Gas-kinetic scheme; DUGKS; CUDA; GPU; Taylor-Green; Turbulent channel flow; DIRECT NUMERICAL-SIMULATION; LATTICE BOLTZMANN METHOD; IMMERSED BOUNDARY; GRAPHICS;
D O I
10.1016/j.cpc.2024.109216
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a parallel implementation of the Discrete Unified Gas Kinetic Scheme (DUGKS) on the GPU system using the CUDA Fortran and CUDA C++ programming languages. Firstly, we conducted an extensive revision of our original CPU-based code, resulting in a threefold decrease in memory usage. This new implementation is also paired with a novel approach to compute cell face flux using trilinear interpolation. It is shown analytically that the interpolation-based approach to flux calculation is more accurate compared to the one used in the original DUGKS. The initial simulation results using this new approach suggest that trilinear interpolation can reduce numerical errors on a coarse mesh. For example, in the case of the decaying TaylorGreen vortex flow at a 128(3) mesh resolution, the relative numerical error in the energy dissipation rate at t* = 2 , using the spectral simulation result as the benchmark, is approximately 30% lower than that of the original implementation. The improved GPU DUGKS method is applied to laminar and turbulent flows in periodic and wall-bounded boundary configurations. A performance comparison of the GPU implementation is also presented and compared to the previous CPU implementation. A maximum speedup of 7.64x was achieved on a desktoplevel GPU compared to a 32-core CPU. The strong scaling test, conducted on an eight-GPU node, demonstrated the efficient utilization of available multiple GPU resources by the code. Program summary Program Title: DUGKS-GPU CPC Library link to program files: https://doi .org /10 .17632 /yykv5s9g2n.1 Developer's repository link: https://github .com /kairzhan /DUGKS-GPU Code Ocean capsule: https://codeocean .com /capsule /3b1e4f74-cdd3-4781-8923-8514d5923dfb/ Licensing provisions: GNU General Public License 3 Programming language: CUDA C++, CUDA Fortran Supplementary material: Nature of problem: DUGKS-GPU is an advanced numerical code designed to accelerate the simulation of turbulent fluid flows through the application of the kinetic method known as the Discrete Unified Gas Kinetic Scheme (DUGKS). This computational tool comprises a CUDA Fortran version, tailored for a single GPU, as well as a CUDA C++ version developed for multi-GPU platforms. Solution method: The code employs a second-order finite-volume discretization of the discrete velocity Boltzmann equation with the BGK collision model. It can be used to simulate small to medium-scale problems on modern multi-GPU platforms with CUDA architecture.
引用
收藏
页数:13
相关论文
共 22 条
  • [21] An immersed boundary method-discrete unified gas kinetic scheme simulation of particle-laden turbulent channel flow on a nonuniform orthogonal mesh
    Karzhaubayev, Kairzhan
    Wang, Lian-Ping
    Peng, Cheng
    Zhakebayev, Dauren
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2024, 96 (03) : 318 - 335
  • [22] Modelling large-scale landslide using a GPU-accelerated 3D MPM with an efficient terrain contact algorithm
    Zhang, Wei
    Wu, Zhengzhou
    Peng, Chong
    Li, Shuai
    Dong, Youkou
    Yuan, Weihai
    COMPUTERS AND GEOTECHNICS, 2023, 158