An optimal and processor efficient parallel sorting algorithm on a linear array with a reconfigurable pipelined bus system

被引:3
作者
He, Min [1 ]
Wu, Xiaolong [1 ]
Zheng, Si Qing [2 ]
机构
[1] Calif State Univ Long Beach, Dept Comp Engn & Comp Sci, Long Beach, CA 90840 USA
[2] Univ Texas Dallas, Erik Jonsson Sch Engn & Comp Sci, Dept Comp Sci, Richardson, TX 75080 USA
关键词
Sorting; Parallel algorithms; Optical interconnection; Parallel computing; MATRIX MULTIPLICATION; NEAREST-NEIGHBOR; SELECTION;
D O I
10.1016/j.compeleceng.2008.11.020
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Optical interconnections attract many engineers and scientists' attention due to their potential for gigahertz transfer rates and concurrent access to the bus in a pipelined fashion. These unique characteristics of optical interconnections give us the opportunity to reconsider traditional algorithms designed for ideal parallel computing models, such as PRAMs. Since the PRAM model is far from practice, not all algorithms designed on this model can be implemented on a realistic parallel computing system. From this point of view, we study Cole's pipelined merge sort [Cole R. Parallel merge sort. SIAM J Comput 1988;14:770-85] on the CREW PRAM and extend it in an innovative way to an optical interconnection model, the LARPBS (Linear Array with Reconfigurable Pipelined Bus System) model [Pan Y, Li K. Linear array with a reconfigurable pipelined bus system-concepts and applications. J Inform Sci 1998;106:237-58]. Although Cole's algorithm is optimal, communication details have not been provided due to the fact that it is designed for a PRAM. We close this gap in our sorting algorithm on the LARPBS model and obtain an O(logN)-time optimal sorting algorithm using O(N) processors. This is a substantial improvement over the previous best sorting algorithm on the LARPBS model that runs in O(logNloglogN) worst-case time using N processors [Datta A, Soundaralakshmi S, Owens R. Fast sorting algorithms on a linear array with a reconfigurable pipelined bus system. IEEE Trans Parallel Distribut Syst 2002;13(3):212-22]. Our solution allows efficiently assign and reuse processors. We also discover two new properties of Cole's sorting algorithm that are presented as lemmas in this paper. Published by Elsevier Ltd.
引用
收藏
页码:951 / 965
页数:15
相关论文
共 40 条
[1]  
AJTAI M, 1983, P 15 ANN ACM S THEOR, P15
[2]  
[Anonymous], 1978, P 10 ANN ACM S THEOR, DOI [DOI 10.1145/800133.804339, 10.1145/800133.804339]
[3]  
AROCK M, 2006, P INT C ADV COMP COM, P12
[4]  
AROCK M, 2005, P 3 AS APPL COMP C, P68
[5]   Constant time fault tolerant algorithms for a linear array with a reconfigurable pipelined bus system [J].
Bourgeois, AG ;
Pan, Y ;
Prasad, SK .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2005, 65 (03) :374-381
[6]   Scalable and efficient parallel algorithms for Euclidean Distance Transform on the LARPBS model [J].
Chen, L ;
Pan, Y ;
Xu, XH .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2004, 15 (11) :975-982
[7]   PARALLEL MERGE SORT [J].
COLE, R .
SIAM JOURNAL ON COMPUTING, 1988, 17 (04) :770-785
[8]   A parameterized linear array with a reconfigurable pipelined bus system: LARPBS(p) [J].
d'Auriol, BJ ;
Molakaseema, R .
COMPUTER JOURNAL, 2005, 48 (01) :115-125
[9]   Fast sorting algorithms on a linear array with a reconfigurable pipelined bus system [J].
Datta, A ;
Soundaralakshmi, S ;
Owens, R .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2002, 13 (03) :212-222
[10]  
DAURIOL B, 2008, J SUPERCOMPUT 0729