GPU-warp based finite element matrices generation and assembly using coloring method

被引:16
|
作者
Kiran, Utpal [1 ]
Sharma, Deepak [1 ]
Gautam, Sachin Singh [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Mech Engn, Gauhati 781039, Assam, India
关键词
Finite element method; Numerical integration; Assembly; GPU; CUDA; Coloring method; NUMERICAL-INTEGRATION; IMPLEMENTATION; ACCELERATION; SOLVERS; SYSTEM;
D O I
10.1016/j.jcde.2018.11.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite element method has been successfully implemented on the graphics processing units to achieve a significant reduction in simulation time. In this paper, new strategies for the finite element matrix generation including numerical integration and assembly are proposed by using a warp per element for a given mesh. These strategies are developed using the well-known coloring method. The proposed strategies use a specialized algorithm to realize fine-grain parallelism and efficient use of on-chip memory resources. The warp shuffle feature of Compute Unified Device Architecture (CUDA) is used to accelerate numerical integration. The evaluation of elemental stiffness matrix is further optimized by adopting a partial parallel implementation of numerical integration. Performance evaluations of the proposed strategies are done for three-dimensional elasticity problem using the 8-noded hexahedral elements with three degrees of freedom per node. We obtain a speedup of up to 8.2x over the coloring based assembly by element strategy (using a single thread per element) on NVIDIA Tesla K40 GPU. Also, the proposed strategies achieve better arithmetic throughput and bandwidth. (C) 2018 Society for Computational Design and Engineering. Publishing Services by Elsevier.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [41] Multiblock method for database generation in finite element programs
    Carstea, Daniela
    MATHEMATICAL METHODS AND COMPUTATIONAL TECHNIQUES IN RESEARCH AND EDUCATION, 2007, : 53 - +
  • [42] A GPU-based framework for finite element analysis of elastoplastic problems
    Utpal Kiran
    Deepak Sharma
    Sachin Singh Gautam
    Computing, 2023, 105 : 1673 - 1696
  • [43] High-Performance Simulation of Drug Release Model Using Finite Element Method with CPU/GPU Platform
    Ali, Akhtar
    Bajwa, Imran Sarwar
    Kazmi, Rafaqat
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2019, 25 (10) : 1261 - 1278
  • [44] Numerical modeling of second harmonic generation in optical waveguides using the finite element method
    Katsriku, FA
    Rahman, BMA
    Grattan, KTV
    IEEE JOURNAL OF QUANTUM ELECTRONICS, 1997, 33 (10) : 1727 - 1733
  • [45] Fast assembly of Galerkin matrices for 3D solid laminated composites using finite element and isogeometric discretizations
    Antolin, Pablo
    COMPUTATIONAL MECHANICS, 2020, 65 (01) : 135 - 148
  • [46] Comparison of the structure of equation systems and the GPU multifrontal solver for finite difference, collocation and finite element method
    Lipski, P.
    Wozniak, M.
    Paszynski, M.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1072 - 1081
  • [47] A parallel finite element procedure for contact-impact problems using edge-based smooth triangular element and GPU
    Cai, Yong
    Cui, Xiangyang
    Li, Guangyao
    Liu, Wenyang
    COMPUTER PHYSICS COMMUNICATIONS, 2018, 225 : 47 - 58
  • [48] Research Finite Element Method Mesh Generation Based on Material-discontinuous-nature
    Hu, Jiangchun
    Wang, Hongfang
    He, Manchao
    ADVANCED MANUFACTURING TECHNOLOGY, PTS 1, 2, 2011, 156-157 : 74 - +
  • [49] GPU-Accelerated Finite-Element Matrix Generation for Lossless, Lossy, and Tensor Media
    Dziekonski, Adam
    Sypek, Piotr
    Lamecki, Adam
    Mrozowski, Michal
    IEEE ANTENNAS AND PROPAGATION MAGAZINE, 2014, 56 (05) : 186 - 197
  • [50] Fine-grained GPU implementation of assembly-free iterative solver for finite element problems
    Martinez-Frutos, Jesus
    Martinez-Castejon, Pedro J.
    Herrero-Perez, David
    COMPUTERS & STRUCTURES, 2015, 157 : 9 - 18