Vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines

被引:2
|
作者
Bae, S [1 ]
Kim, D [1 ]
Ranka, S [1 ]
机构
[1] ETRI, Parallel Programming Sect, Taejon, South Korea
关键词
D O I
10.1109/IPPS.1998.669934
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Vector prefix and reduction are collective communication primitives in which all processors must cooperate. We present two parallel algorithms, the direct algorithm and the split algorithm, for vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines. Our algorithms are relatively architecture independent and can be used effectively in many applications such as Pack/Unpack, Array Prefix/Reduction Functions, and Array Combining Scatter Functions, which are defined in Fortran 90 and in High Performance Fortran. Experimental results on the CM-5 are presented.
引用
收藏
页码:321 / 325
页数:5
相关论文
共 50 条
  • [11] Building Large Phylogenetic Trees on Coarse-Grained Parallel Machines
    Thomas M. Keane
    Andrew J. Page
    Thomas J. Naughton
    Simon A.A. Travers
    James O. McInerney
    Algorithmica, 2006, 45 : 285 - 300
  • [12] Solving large FPT problems on coarse-grained parallel machines
    Cheetham, J
    Dehne, F
    Rau-Chaplin, A
    Stege, U
    Taillon, PJ
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 67 (04) : 691 - 706
  • [13] C-3: A parallel model for coarse-grained machines
    Hambrusch, SE
    Khokhar, AA
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 32 (02) : 139 - 154
  • [14] Parallel FP-LAPW for distributed-memory machines
    Dohmen, R
    Pichlmeier, J
    Petersen, M
    Wagner, F
    Scheffler, M
    COMPUTING IN SCIENCE & ENGINEERING, 2001, 3 (04) : 18 - 29
  • [15] A PARALLEL VECTOR EQUATION SOLVER FOR DISTRIBUTED-MEMORY COMPUTERS
    QIN, JN
    NGUYEN, DT
    COMPUTING SYSTEMS IN ENGINEERING, 1994, 5 (01): : 19 - 25
  • [16] Coarse-grained distributed parallel programming interface for grid computing
    Wu, YW
    Wang, Q
    Yang, GW
    Zheng, WM
    GRID AND COOPERATIVE COMPUTING, PT 1, 2004, 3032 : 255 - 258
  • [17] An interleaving transformation for parallelizing reductions for distributed-memory parallel machines
    Wu, JJ
    JOURNAL OF SUPERCOMPUTING, 2000, 15 (03): : 321 - 339
  • [18] Fast parallel FFT on CTaiJi: A coarse-grained reconfigurable computation platform
    Song, LG
    Jiang, YX
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, 2005, 3758 : 188 - 195
  • [19] An Interleaving Transformation for Parallelizing Reductions for Distributed-Memory Parallel Machines
    Jan-Jan Wu
    The Journal of Supercomputing, 2000, 15 : 321 - 339
  • [20] Lifting sequential graph algorithms for distributed-memory parallel computation
    Gregor, D
    Lumsdaine, A
    ACM SIGPLAN NOTICES, 2005, 40 (10) : 423 - 437