On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures

被引：1

作者：

Morgan, Nathaniel ^{[1
]}

Yenusah, Caleb ^{[2
]}

Diaz, Adrian ^{[3
]}

Dunning, Daniel ^{[1
]}

Moore, Jacob ^{[3
,6
]}

Heilman, Erin ^{[3
]}

Roth, Calvin ^{[3
,7
]}

Lieberman, Evan ^{[3
]}

Walton, Steven ^{[2
]}

Brown, Sarah ^{[1
]}

Holladay, Daniel ^{[4
]}

Knezevic, Marko ^{[5
]}

Whetstone, Gavin ^{[3
,8
]}

Baker, Zachary ^{[1
,9
]}

Robey, Robert ^{[3
,10
]}

机构：

[1] Los Alamos Natl Lab, Engn Technol & Design Div, Los Alamos, NM 87545 USA

[2] Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA

[3] Los Alamos Natl Lab, Computat Phys Div, Los Alamos, NM 87545 USA

[4] Los Alamos Natl Lab, Comp Computat & Stat Sci Div, Los Alamos, NM 87545 USA

[5] Univ New Hampshire, Dept Mech Engn, Durham, NH 03824 USA

[6] Mississippi State Univ, Mississippi State, MS USA

[7] Univ Minnesota, Minneapolis, MN USA

[8] Texas A&M Univ, College Stn, TX USA

[9] Univ Colorado, Boulder, CO USA

[10] AMD Corp, Santa Clara, CA USA

来源：

INFORMATION | 2024年 / 15卷 / 11期

关键词：

performance; portability; productivity; GPUs; dense and sparse data; fine-grained parallelism; GALERKIN HYDRODYNAMIC METHOD; SCHEME;

D O I：

10.3390/info15110673

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents software advances to easily exploit computer architectures consisting of a multi-core CPU and CPU+GPU to accelerate diverse types of high-performance computing (HPC) applications using a single code implementation. The paper describes and demonstrates the performance of the open-source C++ matrix and array (MATAR) library that uniquely offers: (1) a straightforward syntax for programming productivity, (2) usable data structures for data-oriented programming (DOP) for performance, and (3) a simple interface to the open-source C++ Kokkos library for portability and memory management across CPUs and GPUs. The portability across architectures with a single code implementation is achieved by automatically switching between diverse fine-grained parallelism backends (e.g., CUDA, HIP, OpenMP, pthreads, etc.) at compile time. The MATAR library solves many longstanding challenges associated with easily writing software that can run in parallel on any computer architecture. This work benefits projects seeking to write new C++ codes while also addressing the challenges of quickly making existing Fortran codes performant and portable over modern computer architectures with minimal syntactical changes from Fortran to C++. We demonstrate the feasibility of readily writing new C++ codes and modernizing existing codes with MATAR to be performant, parallel, and portable across diverse computer architectures.

引用

页数：24

共 49 条

[1] MULTIDIMENSIONAL STAGGERED GRID RESIDUAL DISTRIBUTION SCHEME FOR LAGRANGIAN HYDRODYNAMICS
Abgrall, Remi
Lipnikov, Konstantin
Morgan, Nathaniel
Tokareva, Svetlana
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2020, 42 (01) : A343 - A370
[2] Arndt D., 2024, P 12 INT WORKSH OPEN, P1
[3] RAJA: Portable Performance for Large-Scale Scientific Applications
Beckingsale, David Alexander
Burmark, Jason
Hornung, Rich
Jones, Holger
Killian, William
Kunen, Adam J.
Pearce, Olga
Robinson, Peter
Ryujin, Brian S.
Scogland, Thomas R. W.
[J]. PROCEEDINGS OF P3HPC 2019: 2019 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC), 2019, : 71 - 81
[4] NONLINEAR INTERACTIONS OF RANDOM WAVES IN A DISPERSIVE MEDIUM
BENNEY, DJ
SAFFMAN, PG
[J]. PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1966, 289 (1418): : 301 - &
[5] BENNEY DJ, 1969, STUD APPL MATH, V48, P29
[6] A cell-centered Lagrangian Godunov-like method for solid dynamics
Burton, D. E.
Carney, T. C.
Morgan, N. R.
Sambasivan, S. K.
Shashkov, M. J.
[J]. COMPUTERS & FLUIDS, 2013, 83 : 33 - 47
[7] Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579
[8] A 3D finite element ALE method using an approximate Riemann solution
Chiravalle, V. P.
Morgan, N. R.
[J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2017, 83 (08) : 642 - 663
[9] Chollet F., 2015, KERAS
[10] Diaz A, 2022, PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2

← 1 2 3 4 5 →