COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES

被引:75
|
作者
DAS, R
UYSAL, M
SALTZ, J
HWANG, YS
机构
[1] Department of Computer Science, University of Maryland, College Park
[2] UMIACS, University of Maryland, College Park
关键词
D O I
10.1006/jpdc.1994.1104
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a number of optimizations that can be used to support the efficient execution of irregular problems on distributed memory parallel machines. These primitives (1) coordinate interprocessor data movement, (2) manage the storage of, and access to, copies of off-processor data, (3) minimize interprocessor communication requirements, and (4) support a shared name space. We present a detailed performance and scalability analysis of the communication primitives. This performance and scalability analysis is carried out using a workload generator, kernels from real applications, and a large unstructured adaptive application (the molecular dynamics code CHARMM). (C) 1994 Academic Press, Inc.
引用
收藏
页码:462 / 478
页数:17
相关论文
共 50 条
  • [31] AN EFFICIENT MAPPING OF BOLTZMANN MACHINE COMPUTATIONS ONTO DISTRIBUTED-MEMORY MULTIPROCESSORS
    OH, DH
    NANG, JH
    YOON, H
    MAENG, SR
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 33 (04): : 223 - 236
  • [32] Parallelization of irregular out-of-core applications for distributed-memory systems
    Brezany, P
    Choudhary, A
    Dang, M
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1997, 1225 : 811 - 820
  • [33] Communication lower bounds for distributed-memory matrix multiplication
    Irony, D
    Toledo, S
    Tiskin, A
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (09) : 1017 - 1026
  • [34] DATA MANAGEMENT FOR A CLASS OF ITERATIVE COMPUTATIONS ON DISTRIBUTED-MEMORY MIMD SYSTEMS
    CORNEAHASEGAN, MC
    MARINESCU, DC
    ZHANG, ZY
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (03): : 205 - 229
  • [35] Compiler techniques for effective communication on distributed-memory multiprocessors
    Navarro, AG
    Paek, Y
    Zapata, EL
    Padua, D
    PROCEEDINGS OF THE 1997 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 1997, : 74 - 77
  • [36] COMMUNICATION OPTIMIZATION AND CODE GENERATION FOR DISTRIBUTED-MEMORY MACHINES
    AMARASINGHE, SP
    LAM, MS
    SIGPLAN NOTICES, 1993, 28 (06): : 126 - 138
  • [37] Parallel Out-of-Core MLFMA on Distributed-Memory Computer Architectures
    Hidayetoglu, Mert
    Gurel, Levent
    2015 COMPUTATIONAL ELECTROMAGNETICS INTERNATIONAL WORKSHOP (CEM'15), 2015, : 18 - 19
  • [38] GLB: A low-cost scheduling algorithm for distributed-memory architectures
    Radulescu, A
    van Gemund, AJC
    FIFTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 1998, : 294 - 301
  • [39] Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory
    Dathathri, Roshan
    Reddy, Chandan
    Ramashekar, Thejas
    Bondhugula, Uday
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 375 - 386
  • [40] PARALLELIZING STRASSENS METHOD FOR MATRIX MULTIPLICATION ON DISTRIBUTED-MEMORY MIMD ARCHITECTURES
    CHOU, CC
    DENG, YF
    LI, G
    WANG, Y
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1995, 30 (02) : 49 - 69