A massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves

被引:44
|
作者
Engsig-Karup, A. P. [1 ]
Madsen, Morten G. [1 ]
Glimberg, Stefan L. [1 ]
机构
[1] Tech Univ Denmark, Dept Informat & Math Modeling, DK-2800 Lyngby, Denmark
关键词
nonlinear water waves; coastal and offshore engineering; finite difference method; potential flow; time domain; scientific GPU computations; high-performance computing; WATER-WAVES;
D O I
10.1002/fld.2675
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We implement and evaluate a massively parallel and scalable algorithm based on a multigrid preconditioned Defect Correction method for the simulation of fully nonlinear free surface flows. The simulations are based on a potential model that describes wave propagation over uneven bottoms in three space dimensions and is useful for fast analysis and prediction purposes in coastal and offshore engineering. A dedicated numerical model based on the proposed algorithm is executed in parallel by utilizing affordable modern special purpose graphics processing unit (GPU). The model is based on a low-storage flexible-order accurate finite difference method that is known to be efficient and scalable on a CPU core (single thread). To achieve parallel performance of the relatively complex numerical model, we investigate a new trend in high-performance computing where many-core GPUs are utilized as high-throughput co-processors to the CPU. We describe and demonstrate how this approach makes it possible to do fast desktop computations for large nonlinear wave problems in numerical wave tanks (NWTs) with close to 50/100 million total grid points in double/single precision with 4?GB global device memory available. A new code base has been developed in C++ and compute unified device architecture C and is found to improve the runtime more than an order in magnitude in double precision arithmetic for the same accuracy over an existing CPU (single thread) Fortran 90 code when executed on a single modern GPU. These significant improvements are achieved by carefully implementing the algorithm to minimize data-transfer and take advantage of the massive multi-threading capability of the GPU device. Copyright (c) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:20 / 36
页数:17
相关论文
共 50 条
  • [31] Parallel Statistical Analysis of Analog Circuits by GPU-accelerated Graph-based Approach
    Liu, Xue-Xin
    Tan, Sheldon X. -D.
    Wang, Hai
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 852 - 857
  • [32] A GPU-Accelerated Parallel Shooting Algorithm for Analysis of Radio Frequency and Microwave Integrated Circuits
    Liu, Xue-Xin
    Yu, Hao
    Tan, Sheldon X-D
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (03) : 480 - 492
  • [33] GPU-accelerated differential dependency network analysis
    Speyer, Gil
    Rodriguez, Juan J.
    Bencomo, Tomas
    Kim, Seungchan
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 410 - 414
  • [34] A Tool for Performance Analysis of GPU-Accelerated Applications
    Zhou, Keren
    Mellor-Crummey, John
    PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO '19), 2019, : 282 - 282
  • [35] A GPU-accelerated shallow flow model for tsunami simulations
    Amouzgar, Reza
    Liang, Qiuhua
    Smith, Luke
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-ENGINEERING AND COMPUTATIONAL MECHANICS, 2014, 167 (03) : 117 - 125
  • [36] GPU-accelerated parallel image reconstruction strategies for magnetic particle imaging
    Quelhas, Klaus N.
    Henn, Mark-Alexander
    Farias, Ricardo
    Tew, Weston L.
    Woods, Solomon, I
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (13):
  • [37] Genomics-GPU: A Benchmark Suite for GPU-accelerated Genome Analysis
    Liu, Zhuren
    Zhang, Shouzhe
    Garrigus, Justin
    Zhao, Hui
    2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 178 - 188
  • [38] Gridlock resolution in a GPU-accelerated traffic queue model
    Saprykin, Aleksandr
    Chokani, Ndaona
    Abhari, Reza S.
    11TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 3RD INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2020, 170 : 681 - 687
  • [39] A GPU-Accelerated Hydrodynamic Model for Urban Flood Inundation
    Liang, Qiuhua
    Smith, Luke S.
    PROCEEDINGS OF THE 35TH IAHR WORLD CONGRESS, VOLS III AND IV, 2013,
  • [40] A GPU-Accelerated TLSPH Algorithm for 3D Geometrical Nonlinear Structural Analysis
    He, Jiandong
    Lei, Juanmian
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2019, 16 (07)