A high-order cross-platform incompressible Navier-Stokes solver via artificial compressibility with application to a turbulent jet

被引:35
作者
Loppi, N. A. [1 ]
Witherden, F. D. [2 ]
Jameson, A. [2 ]
Vincent, P. E. [1 ]
机构
[1] Imperial Coll London, Dept Aeronaut, London SW7 2AZ, England
[2] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
基金
英国工程与自然科学研究理事会;
关键词
Incompressible flows; Artificial compressibility; Flux reconstruction; Modern hardware; Parallel algorithms; Turbulence; FLUX RECONSTRUCTION SCHEMES; SPECTRAL DIFFERENCE METHOD; FINITE-ELEMENT-METHOD; UNSTRUCTURED GRIDS; CONSERVATION-LAWS; EQUATIONS; SIMULATION; FRAMEWORK; PYFR;
D O I
10.1016/j.cpc.2018.06.016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Modern hardware architectures such as GPUs and manycore processors are characterised by an abundance of compute capability relative to memory bandwidth. This makes them well-suited to solving temporally explicit and spatially compact discretisations of hyperbolic conservation laws. However, classical pressure-projection-based incompressible Navier-Stokes formulations do not fall into this category. One attractive formulation for solving incompressible problems on modern hardware is the method of artificial compressibility. When combined with explicit dual time stepping and a high-order Flux Reconstruction discretisation, the majority of operations can be cast as compute bound matrix-matrix multiplications that are well-suited for GPU acceleration and manycore processing. In this work, we develop a high-order cross-platform incompressible Navier-Stokes solver, via artificial compressibility and dual time stepping, in the PyFR framework. The solver runs on a range of computer architectures, from laptops to the largest supercomputers, via a platform-unified templating approach that can generate/compile CUDA, OpenCL and C/OpenMP code at runtime. The extensibility of the cross-platform templating framework defined within PyFR is clearly demonstrated, as is the utility of P-multigrid for convergence acceleration. The platform independence of the solver is verified on Nvidia Tesla P100 GPUs and Intel Xeon Phi 7210 KNL manycore processors with a 3D Taylor-Green vortex test case. Additionally, the solver is applied to a 3D turbulent jet test case at Re = 10,000, and strong scaling is reported up to 144 GPUs. The new software constitutes the first high-order accurate cross-platform implementation of an incompressible Navier-Stokes solver via artificial compressibility and P-multigrid accelerated dual time stepping to be published in the literature. The technology has applications in a range of sectors, including the maritime and automotive industries. Moreover, due to its cross-platform nature, the technology is well placed to remain relevant in an era of rapidly evolving hardware architectures. Program summary Program Title: PyFR v1.7.5 Program Files doi : http://dx. doi .org/10.17632/65m665nt9c.1 Licensing provisions: BSD 3-clause Programming language: Python, CUDA, OpenCL and C Supplementary material: Configuration and mesh files for the Taylor-Green Vortex and Turbulent jet test cases Journal reference of previous version: Comput. Phys. Commun. 185 (2014) 3028-3040 Does the new version supersede the previous version?: Yes Reasons for the new version: Adding support for incompressible flows Summary of revisions: Introducing a new high-order cross-platform incompressible flow solver via artificial compressibility and P-multigrid accelerated dual time stepping. Nature of problem: Incompressible Euler and Navier-Stokes equations for solving unsteady turbulent flows. Solution method: Artificial compressibility formulation discretised with a high-order Flux Reconstruction approach in space and P-multigrid accelerated dual time stepping in time. Additional comments including restrictions and unusual features: The algorithm targets modern massively parallel hardware platforms. Cross-platform capability is achieved via runtime code generation. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:193 / 205
页数:13
相关论文
共 34 条
[1]  
[Anonymous], 2008, NODAL DISCONTINUOUS
[2]   An artificial compressibility flux for the discontinuous Galerkin solution of the incompressible Navier-Stokes equations [J].
Bassi, F. ;
Crivellini, A. ;
Di Pietro, D. A. ;
Rebay, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2006, 218 (02) :794-815
[3]   Numerical simulation of the noise generated by a low Mach number, low Reynolds number jet [J].
Boersma, BJ .
FLUID DYNAMICS RESEARCH, 2004, 35 (06) :425-447
[4]   Turbulence and energy budget in a self-preserving round jet: direct evaluation using large eddy simulation [J].
Bogey, C. ;
Bailly, C. .
JOURNAL OF FLUID MECHANICS, 2009, 627 :129-160
[5]   A numerical method for solving incompressible viscous flow problems (Reprinted from the Journal of Computational Physics, vol 2, pg 12-26, 1997) [J].
Chorin, AJ .
JOURNAL OF COMPUTATIONAL PHYSICS, 1997, 135 (02) :118-125
[6]   TVB RUNGE-KUTTA LOCAL PROJECTION DISCONTINUOUS GALERKIN FINITE-ELEMENT METHOD FOR CONSERVATION-LAWS .2. GENERAL FRAMEWORK [J].
COCKBURN, B ;
SHU, CW .
MATHEMATICS OF COMPUTATION, 1989, 52 (186) :411-435
[7]   THE RUNGE-KUTTA LOCAL PROJECTION DISCONTINUOUS GALERKIN FINITE-ELEMENT METHOD FOR CONSERVATION-LAWS .4. THE MULTIDIMENSIONAL CASE [J].
COCKBURN, B ;
HOU, SC ;
SHU, CW .
MATHEMATICS OF COMPUTATION, 1990, 54 (190) :545-581
[8]   A high-order solver for unsteady incompressible Navier-Stokes equations using the flux reconstruction method on unstructured grids with implicit dual time stepping [J].
Cox, Christopher ;
Liang, Chunlei ;
Plesniak, Michael W. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 314 :414-435
[9]  
Elsworth D., 1992, NASA STI RECON TECH, V94
[10]   p-Multigrid solution of high-order discontinuous Galerkin discretizations of the compressible Navier-Stokes equations [J].
Fidkowski, KJ ;
Oliver, TA ;
Lu, J ;
Darmofal, DL .
JOURNAL OF COMPUTATIONAL PHYSICS, 2005, 207 (01) :92-113