Efficient and Scalable Parallel Algorithm for Sorting Multisets on Multi-core Systems

被引:3
作者
Zhong, Cheng [1 ]
Qu, Zeng-Yan [1 ]
Yang, Feng [1 ]
Yin, Meng-Xiao [1 ]
Li, Xia [1 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning, Peoples R China
基金
中国国家自然科学基金;
关键词
Multisets Sorting; Parallel algorithms; Multi-core computers; Heterogeneous clusters; Multi-level cache; Shared L2 cache; Thread-level parallelism; Selection; Aperiodic multi-round distribution;
D O I
10.4304/jcp.7.1.30-41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
By distributing adaptively the data blocks to the processing cores to balance their computation loads and applying the strategy of "the extremum of the extremums" to select the data with the same keys, a cache-efficient and thread-level parallel algorithm for sorting Multisets on the multi-core computers is proposed. For the sorting Multisets problem, an aperiodic multi-round data distribution model is presented, which the first round scheduling assigns data blocks into the slave multi-core nodes according to the given distribution order and the other rounds scheduling will distribute data blocks into the slave multi-core nodes by first request first distribution strategy. The scheduling technique can ensure that each slave node can receive the next required data block before it finishes sorting the current data block in its own main memory. A hybrid thread-level and process-level parallel algorithm for sorting Multisets is presented on the heterogeneous cluster with multi-core nodes which have different amount of processing cores, different computation and communication capabilities and distinct size of main memory. The experimental results on the single multi-core computer and the heterogeneous cluster with multi-core computers show that the presented parallel sorting Multisets algorithms are efficient and they obtain good speedup and scalability.
引用
收藏
页码:30 / 41
页数:12
相关论文
共 16 条
[1]  
[Anonymous], 2007, PROC INT C PARALLEL, DOI DOI 10.1109/PACT.2007.12
[2]  
[Anonymous], 2007, P C VERY LARGE DATA
[3]  
Chen Guoliang, 2009, DESIGN ANAL PARALLEL
[4]  
Cheng Zhong, 2010, Proceedings of the Third International Symposium on Parallel Architectures, Algorithms and Programming (PAAP 2010), P247, DOI 10.1109/PAAP.2010.31
[5]  
Cheng Zhong, 2003, J COMPUTER RES DEV, V40, P335
[6]  
Chhugani J, 2008, PROC VLDB ENDOW, V1, P1313
[7]  
Govindaraju N., 2006, P 2006 ACM SIGMOD IN, P325, DOI DOI 10.1145/1142473.1142511
[8]  
GREB A, 2006, P 2006 PAR DISTR PRO, P25
[9]  
Keller J., 2008, P 14 INT EURO C PAR, P26
[10]  
Purcell T. J., 2003, P ACM SIGGRAPH EUROG, P41