GPU-warp based finite element matrices generation and assembly using coloring method

被引:16
|
作者
Kiran, Utpal [1 ]
Sharma, Deepak [1 ]
Gautam, Sachin Singh [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Mech Engn, Gauhati 781039, Assam, India
关键词
Finite element method; Numerical integration; Assembly; GPU; CUDA; Coloring method; NUMERICAL-INTEGRATION; IMPLEMENTATION; ACCELERATION; SOLVERS; SYSTEM;
D O I
10.1016/j.jcde.2018.11.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite element method has been successfully implemented on the graphics processing units to achieve a significant reduction in simulation time. In this paper, new strategies for the finite element matrix generation including numerical integration and assembly are proposed by using a warp per element for a given mesh. These strategies are developed using the well-known coloring method. The proposed strategies use a specialized algorithm to realize fine-grain parallelism and efficient use of on-chip memory resources. The warp shuffle feature of Compute Unified Device Architecture (CUDA) is used to accelerate numerical integration. The evaluation of elemental stiffness matrix is further optimized by adopting a partial parallel implementation of numerical integration. Performance evaluations of the proposed strategies are done for three-dimensional elasticity problem using the 8-noded hexahedral elements with three degrees of freedom per node. We obtain a speedup of up to 8.2x over the coloring based assembly by element strategy (using a single thread per element) on NVIDIA Tesla K40 GPU. Also, the proposed strategies achieve better arithmetic throughput and bandwidth. (C) 2018 Society for Computational Design and Engineering. Publishing Services by Elsevier.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [21] Acceleration of Finite Element Method for 3D DC Resistivity Modeling Using Multi-GPU
    Anwar, Hairil
    Kistijantoro, Achmad Imam
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2016,
  • [22] Targeting performance and user-friendliness: GPU-accelerated finite element computation with automated code generation in FEniCS
    Trotter, James D.
    Langguth, Johannes
    Cai, Xing
    PARALLEL COMPUTING, 2023, 118
  • [23] Adaptive finite element method for fractional differential equations using hierarchical matrices
    Zhao, Xuan
    Hu, Xiaozhe
    Cai, Wei
    Karniadakis, George Em
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2017, 325 : 56 - 76
  • [24] PARALLELIZATION OF ASSEMBLY OPERATION IN FINITE ELEMENT METHOD
    Bosansky, Michal
    Patzak, Borek
    ACTA POLYTECHNICA, 2020, 60 (01) : 25 - 37
  • [25] Parallelized combined finite-discrete element (FDEM) procedure using multi-GPU with CUDA
    Liu, Quansheng
    Wang, Weiqin
    Ma, Hao
    INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS, 2020, 44 (02) : 208 - 238
  • [26] Communication and Load Balancing Optimization for Finite Element Electromagnetic Simulations Using Multi-GPU Workstation
    Dziekonski, Adam
    Sypek, Piotr
    Lamecki, Adam
    Mrozowski, Michal
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2017, 65 (08) : 2661 - 2671
  • [27] A Script-Based, Parameterized Finite Element Mesh for Design and NDE on a GPU
    Sivasuthan, S.
    Karthik, V. U.
    Rahunanthan, A.
    Jayakumar, P.
    Thyagarajan, R. S.
    Udpa, Lalita
    Hoole, S. R. H.
    IETE TECHNICAL REVIEW, 2015, 32 (02) : 94 - 103
  • [28] Voxel Based Finite Element Method Using Homogenization
    Watanabe, Kota
    Iijima, Yosuke
    Kawano, Kenji
    Igarashi, Hajime
    IEEE TRANSACTIONS ON MAGNETICS, 2012, 48 (02) : 543 - 546
  • [29] Adaptive multiresolution finite element method based on second generation wavelets
    He, Yumin
    Chen, Xuefeng
    Xiang, Jiawei
    He, Zhengjia
    FINITE ELEMENTS IN ANALYSIS AND DESIGN, 2007, 43 (6-7) : 566 - 579
  • [30] Accelerated Finite Element Method Solver for RCS Analysis Using CUDA-Based Parallel Computing
    Jo, Mincheol
    Park, Woobin
    Kim, Moonseong
    Lee, Woochan
    IEEE ACCESS, 2024, 12 : 120375 - 120388