Vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines

被引:2
|
作者
Bae, S [1 ]
Kim, D [1 ]
Ranka, S [1 ]
机构
[1] ETRI, Parallel Programming Sect, Taejon, South Korea
关键词
D O I
10.1109/IPPS.1998.669934
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Vector prefix and reduction are collective communication primitives in which all processors must cooperate. We present two parallel algorithms, the direct algorithm and the split algorithm, for vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines. Our algorithms are relatively architecture independent and can be used effectively in many applications such as Pack/Unpack, Array Prefix/Reduction Functions, and Array Combining Scatter Functions, which are defined in Fortran 90 and in High Performance Fortran. Experimental results on the CM-5 are presented.
引用
收藏
页码:321 / 325
页数:5
相关论文
共 50 条
  • [1] PACK/UNPACK on coarse-grained distributed memory parallel machines
    Bae, S
    Ranka, S
    10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, : 320 - 324
  • [2] PACK/UNPACK on Coarse-Grained Distributed Memory Parallel Machines
    Bae, S.
    Ranka, S.
    Journal of Parallel and Distributed Computing, 38 (02):
  • [3] PACK/UNPACK on coarse-grained distributed memory parallel machines
    Bae, SJ
    Ranka, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 38 (02) : 204 - 216
  • [4] PARALLEL COMPUTATION OF GROBNER BASES ON DISTRIBUTED-MEMORY MACHINES
    SAWADA, H
    TERASAKI, S
    AIBA, A
    JOURNAL OF SYMBOLIC COMPUTATION, 1994, 18 (03) : 207 - 222
  • [5] The complexity of parallel multisearch on coarse-grained machines
    Bäumker, A
    Dittrich, W
    Pietracaprina, A
    ALGORITHMICA, 1999, 24 (3-4) : 209 - 242
  • [6] The Complexity of Parallel Multisearch on Coarse-Grained Machines
    A. Bäumker
    W. Dittrich
    A. Pietracaprina
    Algorithmica, 1999, 24 : 209 - 242
  • [7] Integer sorting algorithms for coarse-grained parallel machines
    Alsabti, K
    Ranka, S
    FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 159 - 164
  • [8] PARALLEL TALBOT ALGORITHM FOR DISTRIBUTED-MEMORY MACHINES
    DEROSA, MA
    GIUNTA, G
    RIZZARDI, M
    PARALLEL COMPUTING, 1995, 21 (05) : 783 - 801
  • [9] Building large phylogenetic trees on coarse-grained parallel machines
    Keane, Thomas M.
    Page, Andrew J.
    Naughton, Thomas J.
    Travers, Simon A. A.
    McInerney, James O.
    ALGORITHMICA, 2006, 45 (03) : 285 - 300
  • [10] C3: A Parallel Model for Coarse-Grained Machines
    J Parallel Distrib Comput, 2 (139):