Vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines

被引：2

作者：

Bae, S ^{[1
]}

Kim, D ^{[1
]}

Ranka, S ^{[1
]}

机构：

[1] ETRI, Parallel Programming Sect, Taejon, South Korea

来源：

FIRST MERGED INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING | 1998年

关键词：

D O I：

10.1109/IPPS.1998.669934

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Vector prefix and reduction are collective communication primitives in which all processors must cooperate. We present two parallel algorithms, the direct algorithm and the split algorithm, for vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines. Our algorithms are relatively architecture independent and can be used effectively in many applications such as Pack/Unpack, Array Prefix/Reduction Functions, and Array Combining Scatter Functions, which are defined in Fortran 90 and in High Performance Fortran. Experimental results on the CM-5 are presented.

引用

页码：321 / 325

页数：5

共 50 条

[11] Building Large Phylogenetic Trees on Coarse-Grained Parallel Machines
Thomas M. Keane
Andrew J. Page
Thomas J. Naughton
Simon A.A. Travers
James O. McInerney
Algorithmica, 2006, 45 : 285 - 300
[12] Solving large FPT problems on coarse-grained parallel machines
Cheetham, J
Dehne, F
Rau-Chaplin, A
Stege, U
Taillon, PJ
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 67 (04) : 691 - 706
[13] C-3: A parallel model for coarse-grained machines
Hambrusch, SE
Khokhar, AA
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 32 (02) : 139 - 154
[14] Parallel FP-LAPW for distributed-memory machines
Dohmen, R
Pichlmeier, J
Petersen, M
Wagner, F
Scheffler, M
COMPUTING IN SCIENCE & ENGINEERING, 2001, 3 (04) : 18 - 29
[15] A PARALLEL VECTOR EQUATION SOLVER FOR DISTRIBUTED-MEMORY COMPUTERS
QIN, JN
NGUYEN, DT
COMPUTING SYSTEMS IN ENGINEERING, 1994, 5 (01): : 19 - 25
[16] Coarse-grained distributed parallel programming interface for grid computing
Wu, YW
Wang, Q
Yang, GW
Zheng, WM
GRID AND COOPERATIVE COMPUTING, PT 1, 2004, 3032 : 255 - 258
[17] An interleaving transformation for parallelizing reductions for distributed-memory parallel machines
Wu, JJ
JOURNAL OF SUPERCOMPUTING, 2000, 15 (03): : 321 - 339
[18] Fast parallel FFT on CTaiJi: A coarse-grained reconfigurable computation platform
Song, LG
Jiang, YX
PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, 2005, 3758 : 188 - 195
[19] An Interleaving Transformation for Parallelizing Reductions for Distributed-Memory Parallel Machines
Jan-Jan Wu
The Journal of Supercomputing, 2000, 15 : 321 - 339
[20] Lifting sequential graph algorithms for distributed-memory parallel computation
Gregor, D
Lumsdaine, A
ACM SIGPLAN NOTICES, 2005, 40 (10) : 423 - 437

← 1 2 3 4 5 →