Accelerating CFD simulation with high order finite difference method on curvilinear coordinates for modern GPU clusters

被引:18
作者
Ye, Chuang-Chao [1 ]
Zhang, Peng-Jun-Yi [1 ]
Wan, Zhen-Hua [1 ]
Yan, Rui [1 ]
Sun, De-Jun [1 ]
机构
[1] Univ Sci & Technol China, Dept Modern Mech, Hefei, Peoples R China
关键词
Hardware-aware; High order; Finite difference methods; Curvilinear coordinates; GPU; SCHEMES;
D O I
10.1186/s42774-021-00098-3
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
A high fidelity flow simulation for complex geometries for high Reynolds number (Re) flow is still very challenging, requiring a more powerful HPC system. However, the development of HPC with traditional CPU architecture suffers bottlenecks due to its high power consumption and technical difficulties. Heterogeneous architecture computation is raised to be a promising solution to the challenges of HPC development. GPU accelerating technology has been utilized in low order scheme CFD solvers on the structured grid and high order scheme solvers on unstructured meshes. The high-order finite difference methods on structured grids possess many advantages, e.g., high efficiency, robustness, and low storage. However, the strong dependence among points for a high-order finite difference scheme still limits its application on the GPU platform. In the present work, we propose a set of hardware-aware technology to optimize data transfer efficiency between CPU and GPU, as well as communication efficiency among GPUs. An in-house multi-block structured CFD solver with high order finite difference methods on curvilinear coordinates is ported onto the GPU platform and obtains satisfying performance with a speedup maximum of around 2000x over a single CPU core. This work provides an efficient solution to apply GPU computing in CFD simulation with specific high order finite difference methods on current GPU heterogeneous computers. The test shows that significant accelerating effects can be achieved for different GPUs.
引用
收藏
页数:32
相关论文
共 20 条
[1]   Numerical study of screech generation in a planar supersonic jet [J].
Berland, Julien ;
Bogey, Christophe ;
Bailly, Christophe .
PHYSICS OF FLUIDS, 2007, 19 (07)
[2]   COMPUTATIONAL AERODYNAMICS DEVELOPMENT AND OUTLOOK [J].
CHAPMAN, DR .
AIAA JOURNAL, 1979, 17 (12) :1293-1313
[3]   Grid-point requirements for large eddy simulation: Chapman's estimates revisited [J].
Choi, Haecheon ;
Moin, Parviz .
PHYSICS OF FLUIDS, 2012, 24 (01)
[4]   A parallel direct cut algorithm for high-order overset methods with application to a spinning golf ball [J].
Crabill, J. ;
Witherden, F. D. ;
Jameson, A. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 374 :692-723
[5]   Large calculation of the flow over a hypersonic vehicle using a GPU [J].
Elsen, Erich ;
LeGresley, Patrick ;
Darve, Eric .
JOURNAL OF COMPUTATIONAL PHYSICS, 2008, 227 (24) :10148-10161
[6]   Assessment of WENO schemes for numerical simulation of some hyperbolic equations using GPU [J].
Esfahanian, Vahid ;
Darian, Hossein Mahmoodi ;
Gohari, S. M. Iman .
COMPUTERS & FLUIDS, 2013, 80 :260-268
[7]   Efficient implementation of weighted ENO schemes [J].
Jiang, GS ;
Shu, CW .
JOURNAL OF COMPUTATIONAL PHYSICS, 1996, 126 (01) :202-228
[8]   AN ALTERNATIVE FORMULATION OF FINITE DIFFERENCE WEIGHTED ENO SCHEMES WITH LAX-WENDROFF TIME DISCRETIZATION FOR CONSERVATION LAWS [J].
Jiang, Yan ;
Shu, Chi-Wang ;
Zhang, Mengping .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2013, 35 (02) :A1137-A1160
[9]   High order accurate simulation of compressible flows on GPU clusters over Software Distributed Shared Memory [J].
Karantasis, Konstantinos I. ;
Polychronopoulos, Eleftherios D. ;
Ekaterinaris, John A. .
COMPUTERS & FLUIDS, 2014, 93 :18-29
[10]   A Multi-GPU Parallel Algorithm in Hypersonic Flow Computations [J].
Lai, Jianqi ;
Li, Hua ;
Tian, Zhengyu ;
Zhang, Ye .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019