COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES

被引:75
|
作者
DAS, R
UYSAL, M
SALTZ, J
HWANG, YS
机构
[1] Department of Computer Science, University of Maryland, College Park
[2] UMIACS, University of Maryland, College Park
关键词
D O I
10.1006/jpdc.1994.1104
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a number of optimizations that can be used to support the efficient execution of irregular problems on distributed memory parallel machines. These primitives (1) coordinate interprocessor data movement, (2) manage the storage of, and access to, copies of off-processor data, (3) minimize interprocessor communication requirements, and (4) support a shared name space. We present a detailed performance and scalability analysis of the communication primitives. This performance and scalability analysis is carried out using a workload generator, kernels from real applications, and a large unstructured adaptive application (the molecular dynamics code CHARMM). (C) 1994 Academic Press, Inc.
引用
收藏
页码:462 / 478
页数:17
相关论文
共 50 条
  • [21] An algorithmic framework for parallelizing vision computations on distributed-memory machines
    Chung, Y
    1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 160 - 165
  • [22] Optimizing I/O for irregular applications on distributed-memory machines
    Carretero, J
    No, J
    Choudhary, A
    PARALLEL COMPUTATION, 1999, 1557 : 470 - 479
  • [23] PARALLEL RENDERING OF VOLUMETRIC DATA SET ON DISTRIBUTED-MEMORY ARCHITECTURES
    MONTANI, C
    PEREGO, R
    SCOPIGNO, R
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (02): : 153 - 167
  • [24] ADAPTING FORTRAN 90 ARRAY PROGRAMS FOR DISTRIBUTED-MEMORY ARCHITECTURES
    MERLIN, JH
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 184 - 200
  • [25] Parallelizing RRT on Large-Scale Distributed-Memory Architectures
    Devaurs, Didier
    Simeon, Thierry
    Cortes, Juan
    IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 571 - 579
  • [26] Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures
    Bondhugula, Uday
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [27] Efficient Lagrangian particle tracking algorithms for distributed-memory architectures
    Baldan, Giacomo
    Bellosta, Tommaso
    Guardone, Alberto
    COMPUTERS & FLUIDS, 2023, 256
  • [28] MAPPING UNIFORM LOOP NESTS ONTO DISTRIBUTED-MEMORY ARCHITECTURES
    DARTE, A
    ROBERT, Y
    PARALLEL COMPUTING, 1994, 20 (05) : 679 - 710
  • [29] Evolution-Based Scheduling of Computations and Communications on Distributed-Memory Multicomputers
    Al-Mouhamed, Mayez
    Computer Journal, 42 (05): : 373 - 389
  • [30] Evolution-based scheduling of computations and communications on distributed-memory multicomputers
    Al-Mouhamed, M
    COMPUTER JOURNAL, 1999, 42 (05): : 373 - 390