An OpenMP GPU-offload implementation of a non-equilibrium solidification cellular automata model for additive manufacturing

被引:7
作者
Sabau, Adrian S. [1 ]
Yuan, Lang [2 ]
Fattebert, Jean-Luc [1 ]
Turner, John A. [1 ]
机构
[1] Oak Ridge Natl Lab, Computat Sci & Engn Div, Oak Ridge, TN 37831 USA
[2] Univ South Carolina, Dept Mech Engn, Columbia, SC 29208 USA
关键词
GPU; Cellular automata; Solidification; Additive manufacturing; SIMULATION; GROWTH; EVOLUTION; CPU;
D O I
10.1016/j.cpc.2022.108605
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, performance strategies on GPU-based HPC platforms of a cellular automata (CA) simulation code for non-equilibrium solidification, including nucleation, grain growth, solute partitioning and transport for the metal additive manufacturing (AM) process are investigated using OpenMP 4.5. To accurately report the speed-up for multicore CPUs and GPUs, a rigorous performance analysis employed optimizations appropriate for both CPU-only code (baseline) and GPU offload codes for an isothermal test problem. The performance results on Summit at the Oak Ridge Leadership Computing Facility indicate that using a precomputed list of interface cells significantly decreased the wall-clock time on GPUs. The speedup due to GPU acceleration was evaluated for a full Summit node and measured to be 1.8X when comparing a 6 MPI tasks run with 6 GPUs versus 36 MPI tasks on the CPU only. That speed-up was found to be 7.9X when comparing 6 MPI tasks with 6 GPUs versus the 6 MPI tasks running on the CPU only. Performance measurements showed that system total time is almost constant for runs with more than 96 MPI tasks (or GPUs), indicating that the GPU-accelerated code showed an excellent weak scaling performance. Finally, a rapid directional solidification problem was considered to demonstrate the CA code capability on Summit. It was found that a mesh size of at least 0.05 mu m is recommended for the AM-like simulations in order to obtain accurate elongated grain microstructure and elongated subgrain features, which are in qualitative good agreement with experimental data. The results presented in this study indicate that the performance strategies on GPU-based HPC platforms for the CA code are appropriate for novel HPC exascale platforms.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 27 条
  • [1] Modeling and simulation of thermal field and solidification in laser powder bed fusion of nickel alloy IN625
    Arisoy, Yigit M.
    Criales, Luis E.
    Ozel, Tugrul
    [J]. OPTICS AND LASER TECHNOLOGY, 2019, 109 : 278 - 292
  • [2] Additive Manufacturing of Nickel Superalloys: Opportunities for Innovation and Challenges Related to Qualification
    Babu, S. S.
    Raghavan, N.
    Raplee, J.
    Foster, S. J.
    Frederick, C.
    Haines, M.
    Dinwiddie, R.
    Kirka, M. K.
    Plotkowski, A.
    Lee, Y.
    Dehoff, R. R.
    [J]. METALLURGICAL AND MATERIALS TRANSACTIONS A-PHYSICAL METALLURGY AND MATERIALS SCIENCE, 2018, 49A (09): : 3764 - 3780
  • [3] OpenMP application experiences: Porting to accelerated nodes
    Bak, Seonmyeong
    Bertoni, Colleen
    Boehm, Swen
    Budiardja, Reuben
    Chapman, Barbara M.
    Doerfert, Johannes
    Eisenbach, Markus
    Finkel, Hal
    Hernandez, Oscar
    Huber, Joseph
    Iwasaki, Shintaro
    Kale, Vivek
    Kent, Paul R. C.
    Kwack, JaeHyuk
    Lin, Meifeng
    Luszczek, Piotr
    Luo, Ye
    Pham, Buu
    Pophale, Swaroop
    Ravikumar, Kiran
    Sarkar, Vivek
    Scogland, Thomas
    Tian, Shilei
    Yeung, P. K.
    [J]. PARALLEL COMPUTING, 2022, 109
  • [4] Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead
    Diaz, Jose Monsalve
    Friedline, Kyle
    Pophale, Swaroop
    Hernandez, Oscar
    Bernholdt, David E.
    Chandrasekaran, Sunita
    [J]. PARALLEL COMPUTING, 2019, 89
  • [5] A 3D cellular automaton algorithm for the prediction of dendritic grain growth
    Gandin, CA
    Rappaz, M
    [J]. ACTA MATERIALIA, 1997, 45 (05) : 2187 - 2195
  • [6] Development of a CPU/GPU portable software library for Lagrangian-Eulerian simulations of liquid sprays
    Ge, Wenjun
    Sankaran, Ramanan
    Chen, Jacqueline H.
    [J]. INTERNATIONAL JOURNAL OF MULTIPHASE FLOW, 2020, 128
  • [7] Assessment of offload-based programming environments for hybrid CPU-MIC platforms in numerical modeling of solidification
    Halbiniak, Kamil
    Wyrzykowski, Roman
    Szustak, Lukasz
    Olas, Tomasz
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2018, 87 : 48 - 72
  • [8] Lee P.D., US
  • [9] Lee VW, 2010, CONF PROC INT SYMP C, P451, DOI 10.1145/1816038.1816021
  • [10] Levesque J.M., 2012, SC 12 P INT C HIGH P, P1