Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together

被引:117
作者
Han, Myoungji [1 ]
Kim, Hyunjoon [1 ]
Gu, Geonmo [1 ]
Park, Kunsoo [1 ]
Han, Wook-Shin [2 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
基金
新加坡国家研究基金会;
关键词
subgraph matching; subgraph isomorphism; dynamic programming; adaptive matching order; failing set; QUERY OPTIMIZATION; ISOMORPHISM; ALGORITHM; PARALLEL; GRAPHS;
D O I
10.1145/3299869.3319880
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Subgraph matching (or subgraph isomorphism) is one of the fundamental problems in graph analysis. Extensive research has been done to develop practical solutions for subgraph matching. The state-of-the-art algorithms such as CFL-Match and Turboiso convert a query graph into a spanning tree for obtaining candidates for each query vertex and obtaining a good matching order with the spanning tree. However, by using the spanning tree instead of the original query graph, it could lead to lower pruning power and a sub-optimal matching order. Another limitation is that they perform redundant computation in search without utilizing the knowledge learned from past computation. In this paper, we introduce three novel concepts to address these inherent limitations: 1) dynamic programming between a directed acyclic graph (DAG) and a graph, 2) adaptive matching order with DAG ordering, and 3) pruning by failing sets, which together lead to a much faster algorithm DAF for subgraph matching. Extensive experiments with real datasets show that DAF outperforms the fastest existing solution by up to orders of magnitude in terms of recursive calls as well as in terms of the elapsed time.
引用
收藏
页码:1429 / 1446
页数:18
相关论文
共 49 条
[1]   COLOR-CODING [J].
ALON, N ;
YUSTER, R ;
ZWICK, U .
JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY, 1995, 42 (04) :844-856
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Distributed Evaluation of Subgraph Queries Using Worst-case Optimal Low-Memory Dataflows [J].
Ammar, Khaled ;
McSherry, Frank ;
Salihoglu, Semih ;
Joglekar, Manas .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (06) :691-704
[4]   Efficient Subgraph Matching by Postponing Cartesian Products [J].
Bi, Fei ;
Chang, Lijun ;
Lin, Xuemin ;
Qin, Lu ;
Zhang, Wenjie .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1199-1214
[5]  
BODLAENDER HL, 1988, LECT NOTES COMPUT SC, V317, P105
[6]  
Cao Y, 2015, PROC INT CONF DATA, P161, DOI 10.1109/ICDE.2015.7113281
[7]  
Choudhury S., 2015, EDBT, P157, DOI DOI 10.5441/002/EDBT.2015.15
[8]   A (sub)graph isomorphism algorithm for matching large graphs [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Vento, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1367-1372
[9]  
Cormen T. H., 2009, Introduction to algorithms, VThird
[10]  
Eppstein D., 2002, Graph Algorithms and Applications I, P283, DOI [10.1142/97898127776380014, DOI 10.1142/97898127776380014]