On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures

被引:1
作者
Morgan, Nathaniel [1 ]
Yenusah, Caleb [2 ]
Diaz, Adrian [3 ]
Dunning, Daniel [1 ]
Moore, Jacob [3 ,6 ]
Heilman, Erin [3 ]
Roth, Calvin [3 ,7 ]
Lieberman, Evan [3 ]
Walton, Steven [2 ]
Brown, Sarah [1 ]
Holladay, Daniel [4 ]
Knezevic, Marko [5 ]
Whetstone, Gavin [3 ,8 ]
Baker, Zachary [1 ,9 ]
Robey, Robert [3 ,10 ]
机构
[1] Los Alamos Natl Lab, Engn Technol & Design Div, Los Alamos, NM 87545 USA
[2] Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA
[3] Los Alamos Natl Lab, Computat Phys Div, Los Alamos, NM 87545 USA
[4] Los Alamos Natl Lab, Comp Computat & Stat Sci Div, Los Alamos, NM 87545 USA
[5] Univ New Hampshire, Dept Mech Engn, Durham, NH 03824 USA
[6] Mississippi State Univ, Mississippi State, MS USA
[7] Univ Minnesota, Minneapolis, MN USA
[8] Texas A&M Univ, College Stn, TX USA
[9] Univ Colorado, Boulder, CO USA
[10] AMD Corp, Santa Clara, CA USA
关键词
performance; portability; productivity; GPUs; dense and sparse data; fine-grained parallelism; GALERKIN HYDRODYNAMIC METHOD; SCHEME;
D O I
10.3390/info15110673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents software advances to easily exploit computer architectures consisting of a multi-core CPU and CPU+GPU to accelerate diverse types of high-performance computing (HPC) applications using a single code implementation. The paper describes and demonstrates the performance of the open-source C++ matrix and array (MATAR) library that uniquely offers: (1) a straightforward syntax for programming productivity, (2) usable data structures for data-oriented programming (DOP) for performance, and (3) a simple interface to the open-source C++ Kokkos library for portability and memory management across CPUs and GPUs. The portability across architectures with a single code implementation is achieved by automatically switching between diverse fine-grained parallelism backends (e.g., CUDA, HIP, OpenMP, pthreads, etc.) at compile time. The MATAR library solves many longstanding challenges associated with easily writing software that can run in parallel on any computer architecture. This work benefits projects seeking to write new C++ codes while also addressing the challenges of quickly making existing Fortran codes performant and portable over modern computer architectures with minimal syntactical changes from Fortran to C++. We demonstrate the feasibility of readily writing new C++ codes and modernizing existing codes with MATAR to be performant, parallel, and portable across diverse computer architectures.
引用
收藏
页数:24
相关论文
共 49 条
  • [1] MULTIDIMENSIONAL STAGGERED GRID RESIDUAL DISTRIBUTION SCHEME FOR LAGRANGIAN HYDRODYNAMICS
    Abgrall, Remi
    Lipnikov, Konstantin
    Morgan, Nathaniel
    Tokareva, Svetlana
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2020, 42 (01) : A343 - A370
  • [2] Arndt D., 2024, P 12 INT WORKSH OPEN, P1
  • [3] RAJA: Portable Performance for Large-Scale Scientific Applications
    Beckingsale, David Alexander
    Burmark, Jason
    Hornung, Rich
    Jones, Holger
    Killian, William
    Kunen, Adam J.
    Pearce, Olga
    Robinson, Peter
    Ryujin, Brian S.
    Scogland, Thomas R. W.
    [J]. PROCEEDINGS OF P3HPC 2019: 2019 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2019, : 71 - 81
  • [4] NONLINEAR INTERACTIONS OF RANDOM WAVES IN A DISPERSIVE MEDIUM
    BENNEY, DJ
    SAFFMAN, PG
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1966, 289 (1418): : 301 - &
  • [5] BENNEY DJ, 1969, STUD APPL MATH, V48, P29
  • [6] A cell-centered Lagrangian Godunov-like method for solid dynamics
    Burton, D. E.
    Carney, T. C.
    Morgan, N. R.
    Sambasivan, S. K.
    Shashkov, M. J.
    [J]. COMPUTERS & FLUIDS, 2013, 83 : 33 - 47
  • [7] Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579
  • [8] A 3D finite element ALE method using an approximate Riemann solution
    Chiravalle, V. P.
    Morgan, N. R.
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2017, 83 (08) : 642 - 663
  • [9] Chollet F., 2015, KERAS
  • [10] Diaz A, 2022, PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2