An Efficient Parallel Algorithm for Evaluating Join Queries on Heterogeneous Distributed Systems

被引:0
作者
Hassan, M. Al Hajj [1 ]
Bamha, M. [1 ]
机构
[1] Univ Orleans, LIFO, F-45067 Orleans 2, France
来源
16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS | 2009年
关键词
SKEW;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Owing to the fast development of network technologies, executing parallel programs on distributed systems that connect heterogeneous machines became feasible but we still face some challenges: Workload imbalance in such environment may not only be due to uneven load distribution among machines as in parallel systems but also due to distribution that is not adequate with the characteristics of each machine. In this paper, we present a new parallel join algorithm for heterogeneous distributed architectures based on an efficient dynamic data distribution and task allocation which makes it insensitive to data skew and ensures perfect balancing properties during all stages of join computation. The performance of this algorithm is analyzed using the scalable and portable BSP (Bulk Synchronous Parallel) cost model. We show that our algorithm guarantees optimal complexity and near linear speed-up while reducing communication and disk input/output costs to a minimum.
引用
收藏
页码:350 / 358
页数:9
相关论文
共 25 条
  • [1] [Anonymous], QUERY PROCESSING PAR
  • [2] Bamha M, 2005, LECT NOTES COMPUT SC, V3588, P616
  • [3] Bamha M, 2005, LECT NOTES COMPUT SC, V3515, P755
  • [4] BAMHA M, 1999, J PARALLEL DISTRIBUT, V3, P333
  • [5] BAMHA M, 2000, LECT NOTES COMPUTER, V1873
  • [6] CARTER JL, 1979, J COMPUT SYST SCI, V18, P143, DOI 10.1016/0022-0000(79)90044-8
  • [7] IMPLICATIONS OF CERTAIN ASSUMPTIONS IN DATABASE PERFORMANCE EVALUATION
    CHRISTODOULAKIS, S
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1984, 9 (02): : 163 - 186
  • [8] An adaptive parallel distributive join algorithm on a cluster of workstations
    Chung, SM
    Chatterjee, A
    [J]. JOURNAL OF SUPERCOMPUTING, 2002, 21 (01) : 5 - 35
  • [9] PARALLEL DATABASE-SYSTEMS - THE FUTURE OF HIGH-PERFORMANCE DATABASE-SYSTEMS
    DEWITT, D
    GRAY, J
    [J]. COMMUNICATIONS OF THE ACM, 1992, 35 (06) : 85 - 98
  • [10] DEWITT DJ, 1992, PROC INT CONF VERY L, P27