共 50 条
A highly-efficient locally encoded boundary scheme for lattice Boltzmann method on GPU
被引:1
|作者:
Zhang, Zehua
[1
]
Peng, Cheng
[2
]
Li, Chengxiang
[1
,3
]
Zhang, Hua
[1
,4
]
Xian, Tao
[1
]
Wang, Lian-Ping
[1
]
机构:
[1] Southern Univ Sci & Technol, Dept Mech & Aerosp Engn, Ctr Complex Flows & Soft Matter Res, Guangdong Prov Key Lab Turbulence Res & Applicat, Shenzhen 518055, Peoples R China
[2] Shandong Univ, Minist Educ, Sch Mech Engn, Key Lab High Efficiency & Clean Mech Manufacture, Jinan 250061, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R China
[4] Natl Univ Singapore, Dept Mech Engn, 10 Kent Ridge Crescent, Singapore 119260, Singapore
基金:
中国国家自然科学基金;
关键词:
Lattice Boltzmann method;
Graphics processing unit;
CUDA;
Boundary scheme;
PARTICULATE SUSPENSIONS;
NUMERICAL SIMULATIONS;
IMPLEMENTATION;
EQUATION;
FLUID;
D O I:
10.1016/j.cpc.2024.109119
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
The lattice Boltzmann method (LBM) is an algorithm to simulate fluid flows with the advantage of locality and simplicity, which is suitable for GPU acceleration and simulation of complex flows. However, LBM simulations involving complex solid boundaries require each boundary node to be aware of the types of all its neighbor nodes, i.e., fluid or solid, during the execution of boundary conditions, which involves tremendous data transfer between global and local memory on GPU. Such data transfer operations constitute a large portion of consumed time and can significantly affect simulation efficiency. This article proposes a novel boundary processing scheme that encodes the neighbor nodes' information into a single integer and stores it on the local node. We choose two- and three-dimensional porous -medium flows to test the performance of the proposed scheme on complex boundary geometries and compare it with the usual schemes that retrieve information redundantly from neighbors. The comparison shows that our proposed scheme can improve the overall computing efficiency by up to 40% for 3D flow simulations through porous media. Such improvement is achieved by reducing time consumption on data transfer.
引用
收藏
页数:11
相关论文