A massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves

被引:44
|
作者
Engsig-Karup, A. P. [1 ]
Madsen, Morten G. [1 ]
Glimberg, Stefan L. [1 ]
机构
[1] Tech Univ Denmark, Dept Informat & Math Modeling, DK-2800 Lyngby, Denmark
关键词
nonlinear water waves; coastal and offshore engineering; finite difference method; potential flow; time domain; scientific GPU computations; high-performance computing; WATER-WAVES;
D O I
10.1002/fld.2675
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We implement and evaluate a massively parallel and scalable algorithm based on a multigrid preconditioned Defect Correction method for the simulation of fully nonlinear free surface flows. The simulations are based on a potential model that describes wave propagation over uneven bottoms in three space dimensions and is useful for fast analysis and prediction purposes in coastal and offshore engineering. A dedicated numerical model based on the proposed algorithm is executed in parallel by utilizing affordable modern special purpose graphics processing unit (GPU). The model is based on a low-storage flexible-order accurate finite difference method that is known to be efficient and scalable on a CPU core (single thread). To achieve parallel performance of the relatively complex numerical model, we investigate a new trend in high-performance computing where many-core GPUs are utilized as high-throughput co-processors to the CPU. We describe and demonstrate how this approach makes it possible to do fast desktop computations for large nonlinear wave problems in numerical wave tanks (NWTs) with close to 50/100 million total grid points in double/single precision with 4?GB global device memory available. A new code base has been developed in C++ and compute unified device architecture C and is found to improve the runtime more than an order in magnitude in double precision arithmetic for the same accuracy over an existing CPU (single thread) Fortran 90 code when executed on a single modern GPU. These significant improvements are achieved by carefully implementing the algorithm to minimize data-transfer and take advantage of the massive multi-threading capability of the GPU device. Copyright (c) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:20 / 36
页数:17
相关论文
共 50 条
  • [41] Towards a real-time modeling of global ocean waves by the fully GPU-accelerated spectral wave model WAM6-GPU v1.0
    Yuan, Ye
    Yu, Fujiang
    Chen, Zhi
    Li, Xueding
    Hou, Fang
    Gao, Yuanyong
    Gao, Zhiyi
    Pang, Renbo
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2024, 17 (16) : 6123 - 6136
  • [42] Parallel GPU-accelerated Recursion-based Generators of Pseudorandom Numbers
    Stpiczynski, Przemyslaw
    Szalkowski, Dominik
    Potiopa, Joanna
    2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 571 - 578
  • [43] Adapting a Message-Driven Parallel Application to GPU-Accelerated Clusters
    Phillips, James C.
    Stone, John E.
    Schultent, Klaus
    INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2008, : 444 - +
  • [44] Tuned and GPU-Accelerated Parallel Data Mining from Comparable Corpora
    Wolk, Krzysztof
    Marasek, Krzysztof
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 32 - 40
  • [45] Distributed Multi-GPU Accelerated Hybrid Parallel Rendering for Massively Parallel Environment
    Cao, Yi
    Wang, Huawei
    Ai, Zhiwei
    2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 30 - 36
  • [46] GPU-accelerated surface denoising and morphing with lattice Boltzmann scheme
    Zhao, Ye
    IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS 2008, PROCEEDINGS, 2008, : 19 - 28
  • [47] Physics-guided neural network and GPU-accelerated nonlinear model predictive control for quadcopter
    Seong Hyeon Hong
    Junlin Ou
    Yi Wang
    Neural Computing and Applications, 2023, 35 : 393 - 413
  • [48] GPU-accelerated Path-based Timing Analysis
    Guo, Guannan
    Huang, Tsung-Wei
    Lin, Yibo
    Wong, Martin
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 721 - 726
  • [49] Physics-guided neural network and GPU-accelerated nonlinear model predictive control for quadcopter
    Hong, Seong Hyeon
    Ou, Junlin
    Wang, Yi
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 393 - 413
  • [50] ddcMD: A fully GPU-accelerated molecular dynamics program for the Martini force field
    Zhang, Xiaohua
    Sundram, Shiv
    Oppelstrup, Tomas
    Kokkila-Schumacher, Sara I. L.
    Carpenter, Timothy S.
    Ingolfsson, Helgi I.
    Streitz, Frederick H.
    Lightstone, Felice C.
    Glosli, James N.
    JOURNAL OF CHEMICAL PHYSICS, 2020, 153 (04):