Symbolic and Numeric Kernel Division for Graphics Processing Unit-Based Finite Element Analysis Assembly of Regular Meshes With Modified Sparse Storage Formats

被引：4

作者：

Sanfui, Subhajit ^{[1
]}

Sharma, Deepak ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Mech Engn, Gauhati 781039, Assam, India

来源：

JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING | 2022年 / 22卷 / 01期

关键词：

FEA; GPU computing; assembly methods; sparse storage; GPU; GENERATION; MATRICES;

D O I：

10.1115/1.4051123

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This paper presents an efficient strategy to perform the assembly stage of finite element analysis (FEA) on general purpose graphics processing units (GPUs). This strategy involves dividing the assembly task using symbolic and numeric kernels, and thereby reducing the complexity of the standard single-kernel assembly approach. Two sparse storage formats based on the proposed strategy are also developed by modifying the existing sparse storage formats with the intention of removing the degrees-of-freedom-based redundancies in the global matrix. The inherent problem of race condition is resolved through the implementation of coloring and atomics. The proposed strategy is compared with the state-of-the-art GPU-based and central processing unit (CPU)-based assembly techniques. These comparisons reveal a significant number of benefits in terms of reducing storage space requirements and execution time and increasing performance (GFLOPS). Moreover, using the proposed strategy, it is found that the coloring method is more effective compared to the atomics-based method for the existing as well as the modified storage formats.

引用

页数：12

共 34 条

[21] Accelerating molecular dynamics simulations using Graphics Processing Units with CUDA
Liu, Weiguo
Schmidt, Bertil
Voss, Gerrit
Mueller-Wittig, Wolfgang
[J]. COMPUTER PHYSICS COMMUNICATIONS, 2008, 179 (09) : 634 - 641
[22] 3D finite element numerical integration on GPUs
Maciol, Pawel
Plaszewski, Przemyslaw
Banas, Krzysztof
[J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1087 - 1094
[23] Finite element assembly strategies on multi-core and many-core architectures
Markall, G. R.
Slemmer, A.
Ham, D. A.
Kelly, P. H. J.
Cantwell, C. D.
Sherwin, S. J.
[J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2013, 71 (01) : 80 - 97
[24] Toward Real-Time Finite-Element Simulation on GPU
Quang Dinh
Marechal, Yves
[J]. IEEE TRANSACTIONS ON MAGNETICS, 2016, 52 (03)
[25] Evolutionary and GPU computing for topology optimization of structures
Ram, Laxman
Sharma, Deepak
[J]. SWARM AND EVOLUTIONARY COMPUTATION, 2017, 35 : 1 - 13
[26] Ratnakar S. K., 2020, Advances in Interdisciplinary Engineering, P87
[27] Finite Element Algorithms and Data Structures on Graphical Processing Units
Reguly, I. Z.
Giles, M. B.
[J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (02) : 203 - 239
[28] Rodriguez-Navarro J., 2006, CESAR MENDOZA ISABEL, V1, P7, DOI [10.2312/PE/vriphys/vriphys06/001-007, DOI 10.2312/PE/VRIPHYS/VRIPHYS06/001-007]
[29] Sanfui S., 2019, P 10 INT C COMPUTATI, P641
[30] A three-stage graphics processing unit-based finite element analyses matrix generation strategy for unstructured meshes
Sanfui, Subhajit
Sharma, Deepak
[J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2020, 121 (17) : 3824 - 3848

← 1 2 3 4 →