High-level Language Support for User-defined Reductions

被引:0
作者
Steven J. Deitz
Bradford L. Chamberlain
Lawrence Snyder
机构
[1] University of Washington,
来源
The Journal of Supercomputing | 2002年 / 23卷
关键词
user-defined reductions; parallel programming; high-level languages; scientific computing;
D O I
暂无
中图分类号
学科分类号
摘要
The optimized handling of reductions on parallel supercomputers or clusters of workstations is critical to high performance because reductions are common in scientific codes and a potential source of bottlenecks. Yet in many high-level languages, a mechanism for writing efficient reductions remains surprisingly absent. Further, when such mechanisms do exist, they often do not provide the flexibility a programmer needs to achieve a desirable level of performance. In this paper, we present a new language construct for arbitrary reductions that lets a programmer achieve a level of performance equal to that achievable with the highly flexible, but low-level combination of Fortran and MPI. We have implemented this construct in the ZPL language and evaluate it in the context of the initialization of the NAS MG benchmark. We show a 45 times speedup over the same code written in ZPL without this construct. In addition, performance on a large number of processors surpasses that achieved in the NAS implementation showing that our mechanism provides programmers with the needed flexibility.
引用
收藏
页码:23 / 37
页数:14
相关论文
共 34 条
  • [21] OpenMP as runtime for providing high-level stream parallelism on multi-cores
    Renato B. Hoffmann
    Júnior Löff
    Dalvan Griebler
    Luiz G. Fernandes
    The Journal of Supercomputing, 2022, 78 : 7655 - 7676
  • [22] FastPara: a high-level declarative data-parallel programming framework on clusters
    Mao, Yong
    Gu, Yunhong
    Chen, Jia
    Grossman, Robert L.
    PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 321 - +
  • [23] High-level programming of massively parallel computers based on shared virtual memory
    Gerndt, M
    PARALLEL COMPUTING, 1998, 24 (3-4) : 383 - 400
  • [24] High-Level and Productive Stream Parallelism for Dedup, Ferret, and Bzip2
    Dalvan Griebler
    Renato B. Hoffmann
    Marco Danelutto
    Luiz G. Fernandes
    International Journal of Parallel Programming, 2019, 47 : 253 - 271
  • [25] High-Level and Productive Stream Parallelism for Dedup, Ferret, and Bzip2
    Griebler, Dalvan
    Hoffmann, Renato B.
    Danelutto, Marco
    Fernandes, Luiz G.
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (02) : 253 - 271
  • [26] Development Effort and Performance Trade-off in High-Level Parallel Programming
    Legaux, Joeffrey
    Loulergue, Frederic
    Jubertie, Sylvain
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 162 - 169
  • [27] Malacoda: Towards High-Level Compilation of Network Security Applications on Reconfigurable Hardware
    Muehlbach, Sascha
    Koch, Andreas
    PROCEEDINGS OF THE EIGHTH ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'12), 2012, : 247 - 257
  • [28] High-level and efficient structured stream parallelism for rust on multi-cores
    Pieper, Ricardo
    Loff, Junior
    Hoffmann, Renato B.
    Griebler, Dalvan
    Fernandes, Luiz G.
    JOURNAL OF COMPUTER LANGUAGES, 2021, 65
  • [29] A high level, algorithmic programming language and compiler for reconfigurable systems
    Hammes, JP
    Rinker, R
    Böhm, W
    Najjar, WA
    Draper, B
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 135 - 141
  • [30] High-Level Programming for Many-Cores Using C++14 and the STL
    Haidl, Michael
    Gorlatch, Sergei
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (01) : 23 - 41