Speeding Up GED Verification for Graph Similarity Search

被引:16
作者
Chang, Lijun [1 ]
Feng, Xing [2 ,4 ]
Lin, Xuemin [2 ]
Qin, Lu [3 ]
Zhang, Wenjie [2 ]
Ouyang, Dian [1 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Univ New South Wales, Sydney, NSW, Australia
[3] Univ Technol Sydney, Sydney, NSW, Australia
[4] Google, Mountain View, CA 94043 USA
来源
2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020) | 2020年
关键词
EDIT DISTANCE;
D O I
10.1109/ICDE48307.2020.00074
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph similarity search retrieves from a database all graphs whose edit distance (GED) to a query graph is within a threshold. As GED computation is NP-hard, the existing works adopt the filtering-and-verification paradigm to reduce the number of GED verifications, and they mainly focus on designing filtering techniques while using the now out-dated algorithm A*GED for verification. In this paper, we aim to speed up GED verification, which is orthogonal to the index structures used in the filtering phase. We propose a best-first search algorithm AStar(+)-LSa which improves A*GED by (1) reducing memory consumption, (2) tightening lower bound estimation, and (3) improving the time complexity for lower bound computation. We formally show that AStar(+)-LSa has a lower space and time complexity than A*GED. We further modify AStar(+)-LSa into a depth-first search algorithm to contrast these two search paradigms, and we extend our algorithms for exact GED computation. We conduct extensive empirical studies on real graph datasets, and show that our algorithm AStar(+)-LSa outperforms the state-of-the-art algorithms by several orders of magnitude for both GED verification and GED computation.
引用
收藏
页码:793 / 804
页数:12
相关论文
共 24 条
[1]  
Abu-Aisheh Zeina, 2015, 4th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2015). Proceedings, P271
[2]  
Bernard M., 2006, P EMB 06
[3]   Exact Computation of Graph Edit Distance for Uniform and Non-uniform Metric Edit Costs [J].
Blumenthal, David B. ;
Gamper, Johann .
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 :211-221
[4]   A graph distance metric based on the maximal common subgraph [J].
Bunke, H ;
Shearer, K .
PATTERN RECOGNITION LETTERS, 1998, 19 (3-4) :255-259
[5]  
Chang L., 2017, ABS170906810 CORR
[6]   Retrieval of objects in video by similarity based on graph matching [J].
Chevalier, F. ;
Domenger, J.-P. ;
Benois-Pineau, J. ;
Delest, M. .
PATTERN RECOGNITION LETTERS, 2007, 28 (08) :939-949
[7]   A graph distance metric combining maximum common subgraph and minimum common supergraph [J].
Fernández, ML ;
Valiente, G .
PATTERN RECOGNITION LETTERS, 2001, 22 (6-7) :753-758
[8]  
Gouda K, 2016, PROC INT CONF DATA, P265, DOI 10.1109/ICDE.2016.7498246
[9]   A binary linear programming formulation of the graph edit distance [J].
Justice, Derek ;
Hero, Alfred .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (08) :1200-1214
[10]  
Kim J., 2019, P EDBT 19