BALANCED PARALLEL SORT ON HYPERCUBE MULTIPROCESSORS

被引:14
作者
ABALI, B
OZGUNER, F
BATAINEH, A
机构
[1] CRAY RES INC,EAGAN,MN 55121
[2] OHIO STATE UNIV,DEPT ELECT ENGN,COLUMBUS,OH 43210
关键词
HYPERCUBE; MULTIPROCESSING; PARALLEL ALGORITHMS; SELECTION; SORTING;
D O I
10.1109/71.224220
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A parallel sorting algorithm for sorting n elements evenly distributed over 2d = p nodes of a d-dimensional hypercube is presented. The average running time of the algorithm is O((n log n)/p + p log2 n). The algorithm maintains a perfect load balance in the nodes by determining the (kn/p)th elements (k = 1, ..., (p - 1)) of the final sorted list in advance. These p - 1 keys are used to partition the sorted sublists in each node to redistribute data to the nodes to be merged in parallel. The nodes finish the sort with an equal number of elements (n/p) regardless of the data distribution. A parallel selection algorithm for determining the balanced partition keys in 0(p log2 n) time is presented. The speed of the sorting algorithm is further enhanced by the distance-d communication capability of the iPSC/2 hypercube computer and a novel conflict-free routing algorithm. Experimental results on a 16-node hypercube computer show that the new sorting algorithm is competitive with the previous algorithms and faster for skewed data distributions.
引用
收藏
页码:572 / 581
页数:10
相关论文
共 21 条
[1]   ITERATIVE ALGORITHMS FOR SOLUTION OF LARGE SPARSE SYSTEMS OF LINEAR-EQUATIONS ON HYPERCUBES [J].
AYKANAT, C ;
OZGUNER, F ;
ERCAL, F ;
SADAYAPPAN, P .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (12) :1554-1568
[2]  
BATCHER KE, 1968, 1968 P AFIPS SPRING, P307
[3]  
Fox G.C., 1988, SOLVING PROBLEMS CON, V1
[4]  
FOX GC, 1986, CCP314 CALTECH
[5]   THE COMPLEXITY OF SELECTION AND RANKING IN X + Y AND MATRICES WITH SORTED COLUMNS [J].
FREDERICKSON, GN ;
JOHNSON, DB .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1982, 24 (02) :197-208
[6]   FAST SELECTION ALGORITHM AND THE PROBLEM OF OPTIMUM DISTRIBUTION OF EFFORT [J].
GALIL, Z ;
MEGIDDO, N .
JOURNAL OF THE ACM, 1979, 26 (01) :58-64
[7]  
Ibaraki T, 1988, RESOURCE ALLOCATION, V1st
[8]  
Johnsson S. L., 1984, Proceedings of the 1984 International Conference on Parallel Processing (Cat. No. 84CH2045-3), P444
[9]   OPTIMUM BROADCASTING AND PERSONALIZED COMMUNICATION IN HYPERCUBES [J].
JOHNSSON, SL ;
HO, CT .
IEEE TRANSACTIONS ON COMPUTERS, 1989, 38 (09) :1249-1268
[10]  
Knuth D.E., 1972, ART COMPUTER PROGRAM, V3