Graph Pattern Matching: A Join/Semijoin Approach

被引:15
作者
Cheng, Jiefeng [1 ]
Yu, Jeffrey Xu [2 ]
Yu, Philip S. [3 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
关键词
Graph matching; 2-hop labeling; reachability joins; join/semijoin processing; QUERY OPTIMIZATION; DATABASES; JOIN;
D O I
10.1109/TKDE.2010.169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to rapid growth of the Internet and new scientific/technological advances, there exist many new applications that model data as graphs, because graphs have sufficient expressiveness to model complicated structures. The dominance of graphs in real-world applications demands new graph processing techniques to access large data graphs effectively and efficiently. In this paper, we study a graph pattern matching problem, which is to find all patterns in a large data graph that match a user-given graph pattern. We propose new two-step R-join (reachability join) algorithms with a filter step (R-semijoin) and a fetch step (R-join) by utilizing a new cluster-based join index with graph codes in a relational database context. We also propose two optimization approaches to further optimize sequences of R-joins/R-semijoins. The first approach is based on R-join order selection followed by R-semijoin enhancement, and the second approach is to interleave R-joins with R-semijoins. We conducted extensive performance studies, and confirm the efficiency of our proposed new approaches.
引用
收藏
页码:1006 / 1021
页数:16
相关论文
共 38 条
[1]  
Abiteboul S., 1999, DATA WEB RELATIONS S
[2]  
ABITEBOUL S, 1995, FOUNDATIONS DATABASE
[3]  
AGRAWAL R, 1989, P ACM SIGMOD
[4]  
[Anonymous], P ACM S PRINC DAT SY
[5]  
[Anonymous], P 35 INT C VER LARG
[6]  
[Anonymous], PROCEEDINGS OF THE 1
[7]  
BANCILHON F, 1986, P ACM SIGMOD
[8]   QUERY-PROCESSING IN A SYSTEM FOR DISTRIBUTED DATABASES (SDD-1) [J].
BERNSTEIN, PA ;
GOODMAN, N ;
WONG, E ;
REEVE, CL ;
ROTHNIE, JB .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1981, 6 (04) :602-625
[9]   USING SEMI-JOINS TO SOLVE RELATIONAL QUERIES [J].
BERNSTEIN, PA ;
CHIU, DMW .
JOURNAL OF THE ACM, 1981, 28 (01) :25-40
[10]  
BRAMANDIA R, 2008, P INT C WORLD WID WE