Optimizing large join queries using a graph-based approach

被引:19
作者
Lee, C [1 ]
Shih, CS
Chen, YH
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Natl Chiayi Univ, Dept Informat & Comp Sci, Chiayi 60054, Taiwan
关键词
large join query; graph theory; join cost; query tree; query optimization;
D O I
10.1109/69.917567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although many query tree optimization strategies have been proposed in the literature, there still is a lack of a formal and complete representation of all possible permutations of query operations (i.e., execution plans) in a uniform manner. A graph-theoretic approach presented in this paper provides a sound mathematical basis for representing a query and searching for an execution plan. In this graph model, a node represents an operation and a directed edge between two nodes indicates the order of executing these two operations in an execution plan. Each node is associated with a weight and so is an edge. The weight is an expression containing optimization required parameters, such as relation size, tuple size, join selectivity factors. All possible execution plans are representable in this graph and each spanning tree of the graph becomes an execution plan, It is a general model which can be used in the optimizer of a DBMS for interal query representation. On the basis of this model, we devise an algorithm that finds a near optimal execution plan using only polynomial time. The algorithm is compared with a few other popular optimization methods. Experiments show that the proposed algorithm is superior to the others under most circumstances.
引用
收藏
页码:298 / 315
页数:18
相关论文
共 30 条
[1]  
[Anonymous], QUERY PROCESSING PAR
[2]  
Chartrand G., 1993, Applied and algorithmic graph theory
[3]  
CHEN MS, 1992, PROC INT CONF VERY L, P15
[4]  
DEWITT DJ, 1984, P ACM SIGMOD INT C M, P1
[5]  
HOU WC, 1992, P INT C VER LARG DAT, P278
[6]  
Hua K. A., 1993, Proceedings of the Second International Conference on Parallel and Distributed Information Systems (Cat. No.93TH0493-7), P74, DOI 10.1109/PDIS.1993.253068
[7]  
HUA KA, 1991, PROC INT CONF VERY L, P525
[8]   DATA PARTITIONING FOR MULTICOMPUTER DATABASE-SYSTEMS - A CELL-BASED APPROACH [J].
HUA, KA ;
LEE, C ;
YOUNG, HC .
INFORMATION SYSTEMS, 1993, 18 (05) :329-342
[9]   Dynamic load balancing in multicomputer database systems using partition tuning [J].
Hua, KA ;
Lee, C ;
Hua, CM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1995, 7 (06) :968-983
[10]  
HUA KA, 1990, P INT C VER LARG DAT