Rule-Based Graph Repairing: Semantic and Efficient Repairing Methods

被引:24
作者
Cheng, Yurong [1 ]
Chen, Lei [2 ]
Yuan, Ye [3 ]
Wang, Guoren [1 ,3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[3] Northeastern Univ, Sch Comp Sci & Engn, Boston, MA 02115 USA
来源
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | 2018年
基金
美国国家科学基金会;
关键词
SUBGRAPH ISOMORPHISM; ALGORITHMS;
D O I
10.1109/ICDE.2018.00075
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-life graph datasets extracted from Web are inevitably full of incompleteness, conflicts, and redundancies, so graph data cleaning shows its necessity. One of the main issues is to automatically repair the graph with some repairing rules. Although rules like data dependencies have been widely studied in relational data repairing, very few works exist to repair the graph data. In this paper, we introduce an automatic repairing semantic for graphs, called Graph-Repairing Rules (GRRs). This semantic can capture the incompleteness, conflicts, and redundancies in the graphs and indicate how to correct these errors. We study three fundamental problems associated with GRRs, implication, consistency and termination, which show whether a given set of GRRs make sense. Repairing the graph data using GRRs involves a problem of finding isomorphic subgraphs of the graph data for each GRR, which is NP-complete. To efficiently circumvent the complex calculation of subgraph isomorphism, we design a decomposition-and-join strategy to solve this problem. Extensive experiments on real datasets show that our GRR semantic and corresponding repairing algorithms can effectively and efficiently repair real-life graph data.
引用
收藏
页码:773 / 784
页数:12
相关论文
共 39 条
[1]  
Aggarwal CC, 2010, ADV DATABASE SYST, V40, P1, DOI 10.1007/978-1-4419-6045-0
[2]  
[Anonymous], 2013, P 9 INT C SEMANTIC S, DOI DOI 10.1145/2506182.2506195
[3]  
[Anonymous], 2005, SIGMOD
[4]  
[Anonymous], 2013, ICDE
[5]  
[Anonymous], 2005, Complexity Theory: Exploring the Limits of Efficient Algorithms
[6]  
Arias M, 2011, ARXIV11035043
[7]   Recognizing chordal probe graphs and cycle-bicolorable graphs [J].
Berry, Anne ;
Golumbic, Martin Charles ;
Lipshteyn, Marina .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2007, 21 (03) :573-591
[8]  
Bertossi L., 2010, THEORY COMPUTING SYS, V52, P441
[9]   Sampling the Repairs of Functional Dependency Violations under Hard Constraints [J].
Beskales, George ;
Ilyas, Ihab F. ;
Golab, Lukasz .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :197-207
[10]  
Buss S. R., 1997, Computational Logic and Proof Theory. 5th Kurt Godel Colloquium, KGC'97. Proceedings, P18