GPU-warp based finite element matrices generation and assembly using coloring method

被引:16
|
作者
Kiran, Utpal [1 ]
Sharma, Deepak [1 ]
Gautam, Sachin Singh [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Mech Engn, Gauhati 781039, Assam, India
关键词
Finite element method; Numerical integration; Assembly; GPU; CUDA; Coloring method; NUMERICAL-INTEGRATION; IMPLEMENTATION; ACCELERATION; SOLVERS; SYSTEM;
D O I
10.1016/j.jcde.2018.11.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite element method has been successfully implemented on the graphics processing units to achieve a significant reduction in simulation time. In this paper, new strategies for the finite element matrix generation including numerical integration and assembly are proposed by using a warp per element for a given mesh. These strategies are developed using the well-known coloring method. The proposed strategies use a specialized algorithm to realize fine-grain parallelism and efficient use of on-chip memory resources. The warp shuffle feature of Compute Unified Device Architecture (CUDA) is used to accelerate numerical integration. The evaluation of elemental stiffness matrix is further optimized by adopting a partial parallel implementation of numerical integration. Performance evaluations of the proposed strategies are done for three-dimensional elasticity problem using the 8-noded hexahedral elements with three degrees of freedom per node. We obtain a speedup of up to 8.2x over the coloring based assembly by element strategy (using a single thread per element) on NVIDIA Tesla K40 GPU. Also, the proposed strategies achieve better arithmetic throughput and bandwidth. (C) 2018 Society for Computational Design and Engineering. Publishing Services by Elsevier.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [1] GPU finite element method computation strategy without mesh coloring
    Amorim L.
    Goveia T.
    Mesquita R.
    Baratta I.
    Journal of Microwaves, Optoelectronics and Electromagnetic Applications, 2020, 19 (02): : 252 - 264
  • [2] GPU-based matrix-free finite element solver exploiting symmetry of elemental matrices
    Kiran, Utpal
    Gautam, Sachin Singh
    Sharma, Deepak
    COMPUTING, 2020, 102 (09) : 1941 - 1965
  • [3] GPU-Accelerated Finite Element Method
    Dziekonski, Adam
    Lamecki, Adam
    Mrozowski, Michal
    2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
  • [4] GPU-based implementation of finite element method for elasticity using CUDA
    Zhang, Jianfei
    Shen, Defei
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1003 - 1008
  • [5] GPU-based matrix-free finite element solver exploiting symmetry of elemental matrices
    Utpal Kiran
    Sachin Singh Gautam
    Deepak Sharma
    Computing, 2020, 102 : 1941 - 1965
  • [7] A novel parallel finite element procedure for nonlinear dynamic problems using GPU and mixed-precision algorithm
    Wang, Shengquan
    Wang, Chao
    Cai, Yong
    Li, Guangyao
    ENGINEERING COMPUTATIONS, 2020, 37 (06) : 2193 - 2211
  • [8] Parallel assembly of finite element matrices on multicore computers
    Krysl, Petr
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 428
  • [9] Generation of large finite-element matrices on multiple graphics processors
    Dziekonski, A.
    Sypek, P.
    Lamecki, A.
    Mrozowski, M.
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2013, 94 (02) : 204 - 220