Performance of a three-dimensional unstructured mesh compressible flow solver on NVIDIA Fermi-class graphics processing unit hardware

被引:10
作者
Waltz, Jacob [1 ]
机构
[1] Los Alamos Natl Lab, Computat Phys Div, Los Alamos, NM 87545 USA
关键词
Eulerian; finite element; partial differential equations; compressible flow; parallelization; explicit; GRIDS;
D O I
10.1002/fld.3744
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe the performance of Chicoma, a 3D unstructured mesh compressible flow solver, on graphics processing unit (GPU) hardware. The approach used to deploy the solver on GPU architectures derives from the threaded multicore execution model used in Chicoma, and attempts to improve memory performance via the application of graph theory techniques. The result is a scheme that can be deployed on the GPU with high-level programming constructs, for example, compiler directives, rather than low-level programming extensions. With an NVIDIA Fermi-class GPU (NVIDIA Corp., Sta. Clara, CA, USA) and double precision floating point arithmetic, we observe performance gains of 45xon problem sizes of 106 107 tetrahedra. We also compare GPU performance to threaded multicore performance with OpenMP and demonstrate hybrid multicore-GPU calculations with adaptive mesh refinement. Published 2012. This article is a US Government work and is in the public domain in the USA.
引用
收藏
页码:259 / 268
页数:10
相关论文
共 20 条
[1]  
Amdahl G.M, 1967, AFIPS67 SPRING P AFI, P483, DOI DOI 10.1145/1465482.1465560
[2]  
[Anonymous], NVIDIA CUDA C PROGR
[3]  
Castonguay P, 2011, 20 AIAA COMP FLUID D
[4]   Running unstructured grid-based CFD solvers on modern graphics hardware [J].
Corrigan, Andrew ;
Camelli, Fernando F. ;
Loehner, Rainald ;
Wallin, John .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2011, 66 (02) :221-229
[5]  
Gaster B., 2011, Heterogeneous Computing with OpenCL, V1st
[6]  
Hirsch C., 1991, NUMERICAL COMPUTATIO
[7]  
Jarloff HJ, 1987, J ALGORITHMS, V8, P39
[8]  
KLOCKNER A, 2009, ARXIVORG09011024
[9]  
Lai YC, 1996, INT J NUMER METH ENG, V39, P1137, DOI 10.1002/(SICI)1097-0207(19960415)39:7<1137::AID-NME897>3.0.CO
[10]  
2-R