Speeding Up GED Verification for Graph Similarity Search

被引:16
作者
Chang, Lijun [1 ]
Feng, Xing [2 ,4 ]
Lin, Xuemin [2 ]
Qin, Lu [3 ]
Zhang, Wenjie [2 ]
Ouyang, Dian [1 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Univ New South Wales, Sydney, NSW, Australia
[3] Univ Technol Sydney, Sydney, NSW, Australia
[4] Google, Mountain View, CA 94043 USA
来源
2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020) | 2020年
关键词
EDIT DISTANCE;
D O I
10.1109/ICDE48307.2020.00074
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph similarity search retrieves from a database all graphs whose edit distance (GED) to a query graph is within a threshold. As GED computation is NP-hard, the existing works adopt the filtering-and-verification paradigm to reduce the number of GED verifications, and they mainly focus on designing filtering techniques while using the now out-dated algorithm A*GED for verification. In this paper, we aim to speed up GED verification, which is orthogonal to the index structures used in the filtering phase. We propose a best-first search algorithm AStar(+)-LSa which improves A*GED by (1) reducing memory consumption, (2) tightening lower bound estimation, and (3) improving the time complexity for lower bound computation. We formally show that AStar(+)-LSa has a lower space and time complexity than A*GED. We further modify AStar(+)-LSa into a depth-first search algorithm to contrast these two search paradigms, and we extend our algorithms for exact GED computation. We conduct extensive empirical studies on real graph datasets, and show that our algorithm AStar(+)-LSa outperforms the state-of-the-art algorithms by several orders of magnitude for both GED verification and GED computation.
引用
收藏
页码:793 / 804
页数:12
相关论文
共 24 条
[11]   Similarity Search in Graph Databases: A Multi-layered Indexing Approach [J].
Liang, Yongjiang ;
Zhao, Peixiang .
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, :783-794
[12]   Edit distance-based kernel functions for structural pattern classification [J].
Neuhaus, Michel ;
Bunke, Horst .
PATTERN RECOGNITION, 2006, 39 (10) :1852-1863
[13]   A heuristic graph comparison algorithm and its application to detect functionally related enzyme clusters [J].
Ogata, H ;
Fujibuchi, W ;
Goto, S ;
Kanehisa, M .
NUCLEIC ACIDS RESEARCH, 2000, 28 (20) :4021-4028
[14]  
Riesen K., 2007, P MLG 07
[15]  
Riesen K., 2013, P GBRPR 13
[16]   Graph edit distance from spectral seriation [J].
Robles-Kelly, A ;
Hancock, ER .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (03) :365-378
[17]   EXPERIMENTS WITH TEXTURE CLASSIFICATION USING AVERAGES OF LOCAL PATTERN MATCHES [J].
PIETIKAINEN, M ;
ROSENFELD, A ;
DAVIS, LS .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (03) :421-426
[18]   An Efficient Graph Indexing Method [J].
Wang, Xiaoli ;
Ding, Xiaofeng ;
Tung, Anthony K. H. ;
Ying, Shanshan ;
Jin, Hai .
2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, :210-221
[19]  
Zeng Z., 2009, VLDB, V2, P25, DOI [DOI 10.14778/1687627.1687631, DOI 10.14778/1687627.16876312]
[20]   Efficient structure similarity searches: a partition-based approach [J].
Zhao, Xiang ;
Xiao, Chuan ;
Lin, Xuemin ;
Zhang, Wenjie ;
Wang, Yang .
VLDB JOURNAL, 2018, 27 (01) :53-78