Single and Dual-GPU Generalized Sparse Eigenvalue Solvers for Finding a Few Low-Order Resonances of a Microwave Cavity Using the Finite-Element Method

被引:0
作者
Dziekonski, Adam [1 ]
Mrozowski, Michal [1 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Narutowicza 11-12, PL-80233 Gdansk, Poland
关键词
FEM; generalized eigenvalue problem; GPU; Maxwell's equations; resonators; EFFICIENT; IMPLEMENTATION; ACCELERATION; EIGENSOLVER; CIRCUITS; MOMENTS;
D O I
10.13164/re.2018.0930
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents two fast generalized eigenvalue solvers for sparse symmetric matrices that arise when electromagnetic cavity resonances are investigated using the higher-order finite element method (FEM). To find a few low-order resonances, the locally optimal block preconditioned conjugate gradient (LOBPCG) algorithm with null-space deflation is applied. The computations are expedited by using one or two graphical processing units (GPUs) as accelerators. The performance of the solver is tested for single and dual GPU hardware setups, making use of two types of GPU: NVIDIA Kepler K40s and NVIDIA Pascal P100s. The speed of the GPU-accelerated solvers is compared to a multithreaded implementation of the same algorithm using a multicore central processing unit (CPU, IntelXeon E5-2680 v3 with twelve cores). It was found that, even for the least efficient setups, the GPU-accelerated code is approximately twice as fast as a parallel CPU-only implementation.
引用
收藏
页码:930 / 936
页数:7
相关论文
共 31 条
  • [1] [Anonymous], 1967, P APRIL 18 20 1967 S, DOI DOI 10.1145/1465482.1465560
  • [2] Multilevel preconditioned iterative eigensolvers for Maxwell eigenvalue problems
    Arbenz, P
    Geus, R
    [J]. APPLIED NUMERICAL MATHEMATICS, 2005, 54 (02) : 107 - 121
  • [3] Cucheb: A GPU implementation of the filtered Lanczos procedure
    Aurentz, Jared L.
    Kalantzis, Vassilis
    Saad, Yousef
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2017, 220 : 332 - 340
  • [4] CAPOZZOLI A, 2018, APPL COMPUTATIONAL E, V33
  • [5] PARALLEL EFFICIENT METHOD OF MOMENTS EXPLOITING GRAPHICS PROCESSING UNITS
    De Donno, D.
    Esposito, A.
    Monti, G.
    Tarricone, L.
    [J]. MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2010, 52 (11) : 2568 - 2572
  • [6] MPIE/MoM Acceleration With a General-Purpose Graphics Processing Unit
    De Donno, Danilo
    Esposito, Alessandra
    Monti, Giuseppina
    Tarricone, Luciano
    [J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2012, 60 (09) : 2693 - 2701
  • [7] Introduction to GPU Computing and CUDA Programming: A Case Study on FDTD
    De Donno, Danilo
    Esposito, Alessandra
    Tarricone, Luciano
    Catarinucci, Luca
    [J]. IEEE ANTENNAS AND PROPAGATION MAGAZINE, 2010, 52 (03) : 116 - 122
  • [8] Dongarra J. J., 2015, P S HIGH PERFORMANCE, P75
  • [9] GPU-Accelerated LOBPCG Method with Inexact Null-Space Filtering for Solving Generalized Eigenvalue Problems in Computational Electromagnetics Analysis with Higher-Order FEM
    Dziekonski, A.
    Rewienski, M.
    Sypek, P.
    Lamecki, A.
    Mrozowski, M.
    [J]. COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2017, 22 (04) : 997 - 1014
  • [10] Generation of large finite-element matrices on multiple graphics processors
    Dziekonski, A.
    Sypek, P.
    Lamecki, A.
    Mrozowski, M.
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2013, 94 (02) : 204 - 220