Symbolic and Numeric Kernel Division for Graphics Processing Unit-Based Finite Element Analysis Assembly of Regular Meshes With Modified Sparse Storage Formats

被引:4
作者
Sanfui, Subhajit [1 ]
Sharma, Deepak [1 ]
机构
[1] Indian Inst Technol, Dept Mech Engn, Gauhati 781039, Assam, India
关键词
FEA; GPU computing; assembly methods; sparse storage; GPU; GENERATION; MATRICES;
D O I
10.1115/1.4051123
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents an efficient strategy to perform the assembly stage of finite element analysis (FEA) on general purpose graphics processing units (GPUs). This strategy involves dividing the assembly task using symbolic and numeric kernels, and thereby reducing the complexity of the standard single-kernel assembly approach. Two sparse storage formats based on the proposed strategy are also developed by modifying the existing sparse storage formats with the intention of removing the degrees-of-freedom-based redundancies in the global matrix. The inherent problem of race condition is resolved through the implementation of coloring and atomics. The proposed strategy is compared with the state-of-the-art GPU-based and central processing unit (CPU)-based assembly techniques. These comparisons reveal a significant number of benefits in terms of reducing storage space requirements and execution time and increasing performance (GFLOPS). Moreover, using the proposed strategy, it is found that the coloring method is more effective compared to the atomics-based method for the existing as well as the modified storage formats.
引用
收藏
页数:12
相关论文
共 34 条
  • [21] Accelerating molecular dynamics simulations using Graphics Processing Units with CUDA
    Liu, Weiguo
    Schmidt, Bertil
    Voss, Gerrit
    Mueller-Wittig, Wolfgang
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2008, 179 (09) : 634 - 641
  • [22] 3D finite element numerical integration on GPUs
    Maciol, Pawel
    Plaszewski, Przemyslaw
    Banas, Krzysztof
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1087 - 1094
  • [23] Finite element assembly strategies on multi-core and many-core architectures
    Markall, G. R.
    Slemmer, A.
    Ham, D. A.
    Kelly, P. H. J.
    Cantwell, C. D.
    Sherwin, S. J.
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2013, 71 (01) : 80 - 97
  • [24] Toward Real-Time Finite-Element Simulation on GPU
    Quang Dinh
    Marechal, Yves
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2016, 52 (03)
  • [25] Evolutionary and GPU computing for topology optimization of structures
    Ram, Laxman
    Sharma, Deepak
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2017, 35 : 1 - 13
  • [26] Ratnakar S. K., 2020, Advances in Interdisciplinary Engineering, P87
  • [27] Finite Element Algorithms and Data Structures on Graphical Processing Units
    Reguly, I. Z.
    Giles, M. B.
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (02) : 203 - 239
  • [28] Rodriguez-Navarro J., 2006, CESAR MENDOZA ISABEL, V1, P7, DOI [10.2312/PE/vriphys/vriphys06/001-007, DOI 10.2312/PE/VRIPHYS/VRIPHYS06/001-007]
  • [29] Sanfui S., 2019, P 10 INT C COMPUTATI, P641
  • [30] A three-stage graphics processing unit-based finite element analyses matrix generation strategy for unstructured meshes
    Sanfui, Subhajit
    Sharma, Deepak
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2020, 121 (17) : 3824 - 3848