OpenACC acceleration of an unstructured CFD solver based on a reconstructed discontinuous Galerkin method for compressible flows

被引:23
作者
Xia, Yidong [1 ]
Lou, Jialin [1 ]
Luo, Hong [1 ]
Edwards, Jack [1 ]
Mueller, Frank [2 ]
机构
[1] N Carolina State Univ, Dept Mech & Aerosp Engn, Raleigh, NC 27695 USA
[2] N Carolina State Univ, Dept Comp Sci, Raleigh, NC 27695 USA
关键词
GPU computing; OpenACC; CUDA; discontinuous Galerkin; compressible flow; Navier-Stokes equations; NAVIER-STOKES; CODE;
D O I
10.1002/fld.4009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An OpenACC directive-based graphics processing unit (GPU) parallel scheme is presented for solving the compressible Navier-Stokes equations on 3D hybrid unstructured grids with a third-order reconstructed discontinuous Galerkin method. The developed scheme requires the minimum code intrusion and algorithm alteration for upgrading a legacy solver with the GPU computing capability at very little extra effort in programming, which leads to a unified and portable code development strategy. A face coloring algorithm is adopted to eliminate the memory contention because of the threading of internal and boundary face integrals. A number of flow problems are presented to verify the implementation of the developed scheme. Timing measurements were obtained by running the resulting GPU code on one Nvidia Tesla K20c GPU card (Nvidia Corporation, Santa Clara, CA, USA) and compared with those obtained by running the equivalent Message Passing Interface (MPI) parallel CPU code on a compute node (consisting of two AMD Opteron 6128 eight-core CPUs (Advanced Micro Devices, Inc., Sunnyvale, CA, USA)). Speedup factors of up to 24x and 1.6x for the GPU code were achieved with respect to one and 16 CPU cores, respectively. The numerical results indicate that this OpenACC-based parallel scheme is an effective and extensible approach to port unstructured high-order CFD solvers to GPU computing. Copyright (c) 2015John Wiley & Sons, Ltd.
引用
收藏
页码:123 / 139
页数:17
相关论文
共 42 条
[1]  
[Anonymous], 2009, P 47 AIAA AER SCI M
[2]   Unsteady CFD computations using vertex-centered finite volumes for unstructured grids on Graphics Processing Units [J].
Asouti, V. G. ;
Trompoukis, X. S. ;
Kampolis, I. C. ;
Giannakoglou, K. C. .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2011, 67 (02) :232-246
[3]   Discontinuous Galerkin solution of the Reynolds-averaged Navier-Stokes and k-ω turbulence model equations [J].
Bassi, F ;
Crivellini, A ;
Rebay, S ;
Savini, M .
COMPUTERS & FLUIDS, 2005, 34 (4-5) :507-540
[4]   Average-state Jacobians and implicit methods for compressible viscous and turbulent flows [J].
Batten, P ;
Leschziner, MA ;
Goldberg, UC .
JOURNAL OF COMPUTATIONAL PHYSICS, 1997, 137 (01) :38-78
[5]   Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware [J].
Brandvik, T. ;
Pullan, G. .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2007, 221 (12) :1745-1748
[6]  
Brandvik T., 2008, AIAA PAPER, P2008
[7]   The Runge-Kutta discontinuous Galerkin method for conservation laws V - Multidimensional systems [J].
Cockburn, B ;
Shu, CW .
JOURNAL OF COMPUTATIONAL PHYSICS, 1998, 141 (02) :199-224
[8]  
Cockburns B, 1990, J MATH PHYS, V55, P545
[9]  
Cohen J., 2009, PARALLEL COMPUTATION, P414
[10]   Semi-automatic porting of a large-scale Fortran CFD code to GPUs [J].
Corrigan, Andrew ;
Camelli, Fernando ;
Loehner, Rainald ;
Mut, Fernando .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2012, 69 (02) :314-331