Portable Node-Level Parallelism for the PGAS Model

被引:0
|
作者
Jungblut, Pascal [1 ]
Fuerlinger, Karl [1 ]
机构
[1] Ludwig Maximilians Univ LMU Munchen, Dept Comp Sci, MNM Team, Oettingenstr 67, D-80538 Munich, Germany
关键词
PGAS; Parallel computing; Programming models;
D O I
10.1007/s10766-021-00718-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Partitioned Global Address Space (PGAS) programming model brings intuitive shared memory semantics to distributed memory systems. Even with an abstract and unifying virtual global address space it is, however, challenging to use the full potential of different systems. Without explicit support by the implementation node-local operations have to be optimized manually for each architecture. A goal of this work is to offer a user-friendly programming model that provides portable performance across systems. In this paper we present an approach to integrate node-level programming abstractions with the PGAS programming model. We describe the hierarchical data distribution with local patterns and our implementation, MEPHISTO, in C++ using two existing projects. The evaluation of MEPHISTO shows that our approach achieves portable performance while requiring only minimal changes to port it from a CPU-based system to a GPU-based one using a CUDA or HIP back-end.
引用
收藏
页码:867 / 885
页数:19
相关论文
共 38 条
  • [31] Loop Level Speculation in a Task Based Programming Model
    Gayatri, Rahulkumar
    Badia, Rosa. M.
    Aygaude, Eduard
    2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 39 - 48
  • [32] Shortening design time through multiplatform simulations with a portable OpenCL golden-model: the LDPC decoder case
    Falcao, G.
    Owaida, M.
    Novo, D.
    Purnaprajna, M.
    Bellas, N.
    Antonopoulos, C. D.
    Karakonstantis, G.
    Burg, A.
    Ienne, P.
    2012 IEEE 20TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2012, : 224 - 231
  • [33] Research on the Identity Authentication Mechanism of Computing Node in Medical High Performance Distributed Parallel Computing Model
    Hu, Yanmei
    Yang, Mu
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON SENSOR NETWORK AND COMPUTER ENGINEERING, 2016, 68 : 515 - 519
  • [34] Multichannel Phase Unwrapping: Problem Topology and Dual-Level Parallel Computational Model
    Imperatore, Pasquale
    Pepe, Antonio
    Lanari, Riccardo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (10): : 5774 - 5793
  • [35] GPGPU-based Highly Parallelized 3D Node Localization for Real-Time 3D Model Reproduction
    Hirosue, Kazuki
    Ukawa, Shohei
    Itoh, Yuichi
    Onoye, Takao
    Hashimoto, Masanori
    IUI'17: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2017, : 173 - 178
  • [36] Detection Method of DC Microgrid Network Attack Based on Two-level and Multi-segment Model
    Liren Zou
    Wireless Personal Communications, 2022, 127 : 1665 - 1681
  • [37] Detection Method of DC Microgrid Network Attack Based on Two-level and Multi-segment Model
    Zou, Liren
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 127 (02) : 1665 - 1681
  • [38] Hardware-Software-Co-Design of Parallel and Distributed Systems Using a Behavioural Programming and Multi-Process Model with High-Level Synthesis
    Bosse, Stefan
    VLSI CIRCUITS AND SYSTEMS V, 2011, 8067