Consistent Subgraph Matching over Large Graphs

被引:2
作者
Yuan, Ye [1 ]
Ma, Delong [2 ]
Zhang, Aoqian [1 ]
Wang, Guoren [1 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] Northeastern Univ, Xian, Peoples R China
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
关键词
DATA COMPLEXITY; ALGORITHM;
D O I
10.1109/ICDE53745.2022.00235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subgraph matching over graphs has been extensively studied, due to its wide applications in knowledge bases, social networks, and among others. To catch the inconsistency and errors that commonly exist in these graphs, this paper studies consistent subgraph matching (CSM), i.e., finding the common matches in every consistent graph repair w.r.t a set of conditional graph dependencies (CGDs). We concentrate on subset, superset and symmetric difference graph repairs. We study fundamental problems for CGDs and CSM. We show that the satisfiability, implication, and validation problems of CGDs are coNP-complete, coNP-complete and NP-complete, respectively. We also show that the CSM problem (under any kind of repair) is NP-complete. We provide (parallel) algorithms to solve CSM, and guarantee to reduce running time when given more processors. Using reallife and synthetic graphs, we empirically verify the efficiency and effectiveness of our algorithms.
引用
收藏
页码:2536 / 2548
页数:13
相关论文
共 40 条
[1]  
Akhtar Waseem, 2010, Semantics in Data and Knowledge Bases. 4th International Workshop, SDKB 2010. Revised Selected Papers, P23, DOI 10.1007/978-3-642-23441-5_2
[2]   PG-KEYS: Keys for Property Graphs [J].
Angles, Renzo ;
Bonifati, Angela ;
Dumbrava, Stefania ;
Fletcher, George ;
Hare, Keith W. ;
Hidders, Jan ;
Lee, Victor E. ;
Li, Bei ;
Libkin, Leonid ;
Martens, Wim ;
Murlak, Filip ;
Perryman, Josh ;
Savkovic, Ognjen ;
Schmidt, Michael ;
Sequeda, Juan ;
Staworko, Slawek ;
Tomaszuk, Dominik .
SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, :2423-2436
[3]  
[Anonymous], DBpedia
[4]  
[Anonymous], 2010, LDOW
[5]  
[Anonymous], 2013, Proceedings of the sixth ACM international conference on Web search and data mining
[6]  
Arenas M., 1999, Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, P68, DOI 10.1145/303976.303983
[7]   On the data complexity of consistent query answering over graph databases [J].
Barcelo, Pablo ;
Fontaine, Gaelle .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2017, 88 :164-194
[8]   EXPRESSIVE PATH QUERIES ON GRAPHS WITH DATA [J].
Barcelo, Pablo ;
Fontaine, Gaelle ;
Lin, Anthony Widjaja .
LOGICAL METHODS IN COMPUTER SCIENCE, 2015, 11 (04)
[9]   CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching [J].
Bhattarai, Bibek ;
Liu, Hang ;
Huang, H. Howie .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1447-1462
[10]   Efficient Subgraph Matching by Postponing Cartesian Products [J].
Bi, Fei ;
Chang, Lijun ;
Lin, Xuemin ;
Qin, Lu ;
Zhang, Wenjie .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1199-1214