ArcMatch: high-performance subgraph matching for labeled graphs by exploiting edge domains

被引:0
作者
Bonnici, Vincenzo [1 ]
Grasso, Roberto [2 ]
Micale, Giovanni [3 ]
di Maria, Antonio [3 ]
Shasha, Dennis [4 ]
Pulvirenti, Alfredo [3 ]
Giugno, Rosalba [5 ]
机构
[1] Univ Parma, Dept Math Phys & Comp Sci, Parco Area Sci 53A, I-43124 Parma, Italy
[2] Univ Catania, Dept Phys & Astron, Via S Sofia 64, I-95123 Catania, Italy
[3] Univ Catania, Dept Clin & Expt Med, Via Santa Sofia 89, I-95123 Catania, Italy
[4] Courant Inst Math Sci, Comp Sci Dept, 251 Mercer St, New York, NY 10012 USA
[5] Univ Verona, Dept Comp Sci, Str Grazie 15, I-37134 Verona, Italy
关键词
Subgraph isomorphism; Domain reduction; Path-based reduction; Labelled graphs; ISOMORPHISM; NETWORK; ALGORITHM;
D O I
10.1007/s10618-024-01061-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consider a large labeled graph (network), denoted the target. Subgraph matching is the problem of finding all instances of a small subgraph, denoted the query, in the target graph. Unlike the majority of existing methods that are restricted to graphs with labels solely on vertices, our proposed approach, named can effectively handle graphs with labels on both vertices and edges. ntroduces an efficient new vertex/edge domain data structure filtering procedure to speed up subgraph queries. The procedure, called path-based reduction, filters initial domains by scanning them for paths up to a specified length that appear in the query graph. Additionally, ncorporates existing techniques like variable ordering and parent selection, as well as adapting the core search process, to take advantage of the information within edge domains. Experiments in real scenarios such as protein-protein interaction graphs, co-authorship networks, and email networks, show that s faster than state-of-the-art systems varying the number of distinct vertex labels over the whole target graph and query sizes.
引用
收藏
页码:3868 / 3921
页数:54
相关论文
共 59 条
[1]   Fast Subgraph Matching Strategies Based on Pattern-Only Heuristics [J].
Aparo, Antonino ;
Bonnici, Vincenzo ;
Micale, Giovanni ;
Ferro, Alfredo ;
Shasha, Dennis ;
Pulvirenti, Alfredo ;
Giugno, Rosalba .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2019, 11 (01) :21-32
[2]  
Archibald B, 2021, 27 INT C PRINC PRACT
[3]  
Avellaneda Florent, 2019, 2019 INT C CYB ICOCS, P1
[4]   APPLICATIONS OF GRAPH-THEORY IN CHEMISTRY [J].
BALABAN, AT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (03) :334-343
[5]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[6]   Efficient Subgraph Matching by Postponing Cartesian Products [J].
Bi, Fei ;
Chang, Lijun ;
Lin, Xuemin ;
Qin, Lu ;
Zhang, Wenjie .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1199-1214
[7]   Heterogeneous graph neural networks analysis: a survey of techniques, evaluations and applications [J].
Bing, Rui ;
Yuan, Guan ;
Zhu, Mu ;
Meng, Fanrong ;
Ma, Huifang ;
Qiao, Shaojie .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) :8003-8042
[8]   On the Variable Ordering in Subgraph Isomorphism Algorithms [J].
Bonnici, Vincenzo ;
Giugno, Rosalba .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) :193-203
[9]   A subgraph isomorphism algorithm and its application to biochemical data [J].
Bonnici, Vincenzo ;
Giugno, Rosalba ;
Pulvirenti, Alfredo ;
Shasha, Dennis ;
Ferro, Alfredo .
BMC BIOINFORMATICS, 2013, 14
[10]  
Bonnici V, 2010, LECT N BIOINFORMAT, V6282, P195, DOI 10.1007/978-3-642-16001-1_17