Multi-GPU Accelerated Finite-difference Time-domain Solver in Open Computing Language

被引:0
|
作者
Stefanski, T. P. [1 ,2 ]
Chavannes, N. [3 ]
Kuster, N. [2 ]
机构
[1] ETH, Integrated Syst Lab, Gloriastr 35, CH-8092 Zurich, Switzerland
[2] ITIS Fdn, CH-8004 Zurich, Switzerland
[3] SPEAG Software R&D, CH-8004 Zurich, Switzerland
来源
PIERS 2011 MARRAKESH: PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM | 2011年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents results of an evaluation of a novel multi-GPU OpenCL FDTD solver. Multi-GPU implementations are necessary, not only to speed up computations but also to aggregate relatively small GPU memories. The advantage of this solver is its portability between hardware manufactured by different vendors and also between highly specialized and parallel computing architectures available on the market, i.e., GPUs and multi-core CPUs. In our implementation, the computational domain is decomposed along the slowest direction, and electromagnetic field boundary data is shared between neighboring subdomains allocated on different GPUs. The communication overhead between GPUs is proportional to the area of the boundary and represents the rate-limiting step of the method. Portability and efficiency of the developed code were tested on a double CPU machine supported by four GPUs. For the utilized hardware devices, the communication overhead can be hidden by computations for sufficiently large simulation domains, giving scaling efficiency higher than 90%. Porting the OpenCL code dedicated to GPUs directly to multi-core CPUs gives unsatisfactory performance due to the application of architecture specific features in the GPU code. OpenCL kernels of the related FDTD code were therefore optimized to improve performance on multi-core CPUs. Despite this, OpenCL FDTD implementation on CPUs was up to 1.5 times slower than on a commercially available FDTD solver developed in the OpenMP standard. Subsequently, this paper presents an application of the developed multi-GPU OpenCL FDTD code in solving a real-life problem of computational electromagnetics in order to demonstrate its performance and applicability.
引用
收藏
页码:1850 / 1853
页数:4
相关论文
共 50 条
  • [1] PARALLEL 3D FINITE-DIFFERENCE TIME-DOMAIN METHOD ON MULTI-GPU SYSTEMS
    Du, Liu-Ge
    Li, Kang
    Kong, Fan-Min
    Hu, Yuan
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2011, 22 (02): : 107 - 121
  • [2] GPU-accelerated finite-difference time-domain method for dielectric media based on CUDA
    Wang, Ximin
    Liu, Song
    Li, Xuan
    Zhong, Shuangying
    INTERNATIONAL JOURNAL OF RF AND MICROWAVE COMPUTER-AIDED ENGINEERING, 2016, 26 (06) : 512 - 518
  • [3] A GPU-Accelerated Finite-Difference Time-Domain Scheme for Electromagnetic Wave Interaction With Plasma
    Cannon, Patrick D.
    Honary, Farideh
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2015, 63 (07) : 3042 - 3054
  • [4] A New Parallel Frequency-Domain Finite-Difference Algorithm Using Multi-GPU
    Wang, Yijing
    He, Xinbo
    Wei, Bin
    IEEE MICROWAVE AND WIRELESS TECHNOLOGY LETTERS, 2024, 34 (08): : 971 - 974
  • [5] Finite-difference time-domain methods
    不详
    NATURE REVIEWS METHODS PRIMERS, 2023, 3 (01):
  • [6] Finite-difference time-domain methods
    F. L. Teixeira
    C. Sarris
    Y. Zhang
    D.-Y. Na
    J.-P. Berenger
    Y. Su
    M. Okoniewski
    W. C. Chew
    V. Backman
    J. J. Simpson
    Nature Reviews Methods Primers, 3
  • [7] A Stable Subgridding Finite Difference Time Domain Method on Multi-GPU Cluster
    Ritter, Jan
    Benjes, Magnus
    Murso, Martin
    Wulf, Daniela
    Lange, Sebastian
    2015 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC), 2015, : 507 - 511
  • [8] Multi-time-step finite-difference time-domain method
    Zheng, Yang-Ming
    Chu, Qing-Xin
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2004, 32 (09): : 1504 - 1506
  • [9] On optimization of finite-difference time-domain (FDTD) computation on heterogeneous and GPU clusters
    Shams, Ramtin
    Sadeghi, Parastoo
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (04) : 584 - 593
  • [10] Parallel Spatially Dispersive Finite-Difference Time-Domain Method Based on GPU
    Song, Wei
    Bao, Meng
    Sheng, Xin-Qing
    2012 10TH INTERNATIONAL SYMPOSIUM ON ANTENNAS, PROPAGATION & EM THEORY (ISAPE), 2012, : 1003 - 1007