FINITE ELEMENT MATRIX GENERATION ON A GPU

被引:59
作者
Dziekonski, A. [1 ]
Sypek, P. [1 ]
Lamecki, A. [1 ]
Mrozowski, M. [1 ]
机构
[1] Gdansk Univ Technol, Wireless Commun Engn WiComm Ctr Excellence, Dept Microwave & Antenna Engn, Fac Elect Telecommun & Informat,CUDA Res Ctr Comp, PL-80233 Gdansk, Poland
来源
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER | 2012年 / 128卷
关键词
TRANSMISSION CHARACTERISTICS; FEM; SCATTERING; EFFICIENT; DOMAIN; MODEL;
D O I
10.2528/PIER12040301
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an efficient technique for fast generation of sparse systems of linear equations arising in computational electromagnetics in a finite element method using higher order elements. The proposed approach employs a graphics processing unit (GPU) for both numerical integration and matrix assembly. The performance results obtained on a test platform consisting of a Fermi GPU (1x Tesla C2075) and a CPU (2x twelve-core Opterons), indicate that the GPU implementation of the matrix generation allows one to achieve speedups by a factor of 81 and 19 over the optimized single- and multi-threaded CPU-only implementations, respectively.
引用
收藏
页码:249 / 265
页数:17
相关论文
共 27 条
[1]  
[Anonymous], 2011, CUDA EXAMPLE INTRO G
[2]   IMPROVING THREE-DIMENSIONAL ELECTRICAL CAPACITANCE TOMOGRAPHY IMAGING USING APPROXIMATION ERROR MODEL THEORY [J].
Banasiak, R. ;
Ye, Z. ;
Soleimani, M. .
JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2012, 26 (2-3) :411-421
[3]  
Cecka C., 2011, GPU GEMS, V3
[4]   Finite-Element Sparse Matrix Vector Multiplication on Graphic Processing Units [J].
Dehnavi, Maryam Mehri ;
Fernandez, David M. ;
Giannacopoulos, Dennis .
IEEE TRANSACTIONS ON MAGNETICS, 2010, 46 (08) :2982-2985
[5]   A MEMORY EFFICIENT AND FAST SPARSE MATRIX VECTOR PRODUCT ON A GPU [J].
Dziekonski, A. ;
Lamecki, A. ;
Mrozowski, M. .
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2011, 116 :49-63
[6]   Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations [J].
Dziekonski, Adam ;
Lamecki, Adam ;
Mrozowski, Michal .
IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2011, 10 :619-622
[7]   GPU Acceleration of Multilevel Solvers for Analysis of Microwave Components With Finite Element Method [J].
Dziekonski, Adam ;
Lamecki, Adam ;
Mrozowski, Michal .
IEEE MICROWAVE AND WIRELESS COMPONENTS LETTERS, 2011, 21 (01) :1-3
[8]   EFFICIENT MODEL ORDER REDUCTION FOR FEM ANALYSIS OF WAVEGUIDE STRUCTURES AND RESONATORS [J].
Fotyga, G. ;
Nyka, K. ;
Mrozowski, M. .
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2012, 127 :277-295
[9]   MAPPING THE SBR AND TW-ILDCs TO HETEROGENEOUS CPU-GPU ARCHITECTURE FOR FAST COMPUTATION OF ELECTROMAGNETIC SCATTERING [J].
Gao, P. C. ;
Tao, Y. B. ;
Bai, Z. H. ;
Lin, H. .
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2012, 122 :137-154
[10]   FAST RCS PREDICTION USING MULTIRESOLUTION SHOOTING AND BOUNCING RAY METHOD ON THE GPU [J].
Gao, P. C. ;
Tao, Y. B. ;
Lin, H. .
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2010, 107 :187-202