Symbolic and Numeric Kernel Division for Graphics Processing Unit-Based Finite Element Analysis Assembly of Regular Meshes With Modified Sparse Storage Formats

被引:4
作者
Sanfui, Subhajit [1 ]
Sharma, Deepak [1 ]
机构
[1] Indian Inst Technol, Dept Mech Engn, Gauhati 781039, Assam, India
关键词
FEA; GPU computing; assembly methods; sparse storage; GPU; GENERATION; MATRICES;
D O I
10.1115/1.4051123
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents an efficient strategy to perform the assembly stage of finite element analysis (FEA) on general purpose graphics processing units (GPUs). This strategy involves dividing the assembly task using symbolic and numeric kernels, and thereby reducing the complexity of the standard single-kernel assembly approach. Two sparse storage formats based on the proposed strategy are also developed by modifying the existing sparse storage formats with the intention of removing the degrees-of-freedom-based redundancies in the global matrix. The inherent problem of race condition is resolved through the implementation of coloring and atomics. The proposed strategy is compared with the state-of-the-art GPU-based and central processing unit (CPU)-based assembly techniques. These comparisons reveal a significant number of benefits in terms of reducing storage space requirements and execution time and increasing performance (GFLOPS). Moreover, using the proposed strategy, it is found that the coloring method is more effective compared to the atomics-based method for the existing as well as the modified storage formats.
引用
收藏
页数:12
相关论文
共 34 条
  • [11] Topology optimization design of 3D electrothermomechanical actuators by using GPU as a co-processor
    Javier Ramirez-Gil, Francisco
    Nelli Silva, Emilio Carlos
    Montealegre-Rubio, Wilfredo
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2016, 302 : 44 - 69
  • [12] A fast and high quality multilevel scheme for partitioning irregular graphs
    Karypis, G
    Kumar, V
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) : 359 - 392
  • [13] GPU-based matrix-free finite element solver exploiting symmetry of elemental matrices
    Kiran, Utpal
    Gautam, Sachin Singh
    Sharma, Deepak
    [J]. COMPUTING, 2020, 102 (09) : 1941 - 1965
  • [14] GPU-warp based finite element matrices generation and assembly using coloring method
    Kiran, Utpal
    Sharma, Deepak
    Gautam, Sachin Singh
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2019, 6 (04) : 705 - 718
  • [15] Kirk D.B., 2016, PROGRAMMING MASSIVEL
  • [16] Parallel Realization of the Element-by-Element FEM Technique by CUDA
    Kiss, Imre
    Gyimothy, Szabolcs
    Badics, Zsolt
    Pavo, Jozsef
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2012, 48 (02) : 507 - 510
  • [17] Knepley M.G., 2011, ABS11030066 CORR
  • [18] Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA
    Komatitsch, Dimitri
    Michea, David
    Erlebacher, Gordon
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2009, 69 (05) : 451 - 460
  • [19] Sparse matrix-vector multiplication on GPGPU clusters: A new storage format and a scalable implementation
    Kreutzer, Moritz
    Hager, Georg
    Wellein, Gerhard
    Fehske, Holger
    Basermann, Achim
    Bishop, Alan R.
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1696 - 1702
  • [20] Optimization and acceleration of flow simulations for CFD on CPU/GPU architecture
    Lei, Jiang
    Li, Da-li
    Zhou, Yun-long
    Liu, Wei
    [J]. JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2019, 41 (07)