SQBC: An efficient subgraph matching method over large and dense graphs

被引:12
作者
Zheng, Weiguo [1 ,2 ]
Zou, Lei [2 ]
Lian, Xiang [3 ]
Zhang, Huaming [4 ]
Wang, Wei [5 ]
Zhao, Dongyan [2 ]
机构
[1] Wuhan Univ, State Key Lab Software Engn, Wuhan 430072, Peoples R China
[2] Peking Univ, Beijing 100871, Peoples R China
[3] Univ Texas Pan Amer, Dept Comp Sci, Edinburg, TX 78539 USA
[4] Univ Alabama, Dept Comp Sci, Huntsville, AL 35899 USA
[5] Engn Univ CAPE, Dept Elect Technol, Xian, Peoples R China
关键词
Algorithm; Large network; Database; Graph theory; Subgraph isomorphism; Index strategy; MAXIMUM CLIQUE; BOUND ALGORITHM; TOOL;
D O I
10.1016/j.ins.2013.10.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent progress in biology and computer science have generated many complicated networks, most of which can be modeled as large and dense graphs. Developing effective and efficient subgraph match methods over these graphs is urgent, meaningful and necessary. Although some excellent exploratory approaches have been proposed these years, they show poor performances when the graphs are large and dense. This paper presents a novel Subgraph Query technique Based on Clique feature, called SQBC, which integrates the carefully designed clique encoding with the existing vertex encoding 1401 as the basic index unit to reduce the search space. Furthermore, SQBC optimizes the subgraph isomorphism test based on clique features. Extensive experiments over biological networks, RDF dataset and synthetic graphs have shown that SQBC outperforms the most popular competitors both in effectiveness and efficiency especially when the data graphs are large and dense. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:116 / 131
页数:16
相关论文
共 43 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]  
[Anonymous], 2007, Proceedings of the 2007 ACM SIGMOD international conference on Management of data
[3]   Vehicle routing with a sparse feasibility graph [J].
Beasley, JE ;
Christofides, N .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1997, 98 (03) :499-511
[4]   A (sub)graph isomorphism algorithm for matching large graphs [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Vento, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1367-1372
[5]   SING: Subgraph search In Non-homogeneous Graphs [J].
Di Natale, Raffaele ;
Ferro, Alfredo ;
Giugno, Rosalba ;
Mongiovi, Misael ;
Pulvirenti, Alfredo ;
Shasha, Dennis .
BMC BIOINFORMATICS, 2010, 11
[6]   QNet: A tool for querying protein interaction networks [J].
Dost, Banu ;
Shlomi, Tomer ;
Gupta, Nitin ;
Ruppin, Eytan ;
Bafna, Vineet ;
Sharan, Roded .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (07) :913-925
[7]  
GEDIK B, 2007, ICDE, P536
[8]  
Giugno R, 2002, INT C PATT RECOG, P112, DOI 10.1109/ICPR.2002.1048250
[9]   Simple ingredients leading to very efficient heuristics for the maximum clique problem [J].
Grosso, Andrea ;
Locatelli, Marco ;
Pullan, Wayne .
JOURNAL OF HEURISTICS, 2008, 14 (06) :587-612
[10]  
He H., 2006, ICDE