COMBINING JOIN AND SEMI-JOIN OPERATIONS FOR DISTRIBUTED QUERY-PROCESSING

被引:21
作者
CHEN, MS
YU, PS
机构
[1] IBM T.J. Watson Research Center, Yorktown Heights, NY
关键词
DISTRIBUTED QUERY PROCESSING; GAINFUL SEMIJOINS; PURE JOIN ATTRIBUTES; A-ASTERISK-ALGORITHM;
D O I
10.1109/69.224205
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we explore the approach to applying a combination of join and semi-join operations to minimize the amount of data transmission required for distributed query processing. Specifically, we identify and exploit two important concepts which occur with the use of join operations as reducers in query processing, namely, gainful semi-joins and pure join attributes. Some semi-joins, though not profitable themselves, may benefit the execution of subsequent join operations, and become profitable owing to the use of join operations as reducers. Such a semi-join is termed a gainful semi-join. In addition, join attributes which are not part of the output attributes are referred to as pure join attributes. We shall not only exploit the usefulness of gainful semi-joins, but also utilize the removability of pure join attributes to reduce the amount of data transmission required for query processing. Moreover, in light of the two concepts, heuristic searches are developed to determine a sequence of join and semi-join reducers for query processing. Our results show the importance of the approach to combining joins and semi-joins for distributed query processing.
引用
收藏
页码:534 / 542
页数:9
相关论文
共 26 条
[1]   OPTIMIZATION ALGORITHMS FOR DISTRIBUTED QUERIES [J].
APERS, PMG ;
HEVNER, AR ;
YAO, SB .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (01) :57-68
[2]   QUERY-PROCESSING IN A SYSTEM FOR DISTRIBUTED DATABASES (SDD-1) [J].
BERNSTEIN, PA ;
GOODMAN, N ;
WONG, E ;
REEVE, CL ;
ROTHNIE, JB .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1981, 6 (04) :602-625
[3]   USING SEMI-JOINS TO SOLVE RELATIONAL QUERIES [J].
BERNSTEIN, PA ;
CHIU, DMW .
JOURNAL OF THE ACM, 1981, 28 (01) :25-40
[4]  
Ceri S., 1985, DISTRIBUTED DATABASE
[5]  
CHEN ALP, 1984, 10TH P INT C VER LAR, P429
[6]   INTERLEAVING A JOIN SEQUENCE WITH SEMIJOINS IN DISTRIBUTED QUERY-PROCESSING [J].
CHEN, MS ;
YU, PS .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (05) :611-621
[7]  
CHEN MS, 1989, IBM RC14788 RES REP
[8]  
CHEN MS, 1990, 2ND P INT S DAT PAR, P116
[9]   OPTIMIZING CHAIN QUERIES IN A DISTRIBUTED DATABASE SYSTEM [J].
CHIU, DM ;
BERNSTEIN, PA ;
HO, YC .
SIAM JOURNAL ON COMPUTING, 1984, 13 (01) :116-134
[10]  
GARDY D, 1989, ACM T DATABASE SYST, V14, P574, DOI 10.1145/76902.76907