Enhanced subgraph matching for large graphs using candidate region-based decomposition and ordering

被引:3
作者
Ansari, Zubair Ali [1 ]
Parwez, Md Aslam [2 ]
Thoker, Irfan Rashid [3 ]
Jahiruddin [4 ]
机构
[1] Geethanjali Coll Engn & Technol, Dept Comp Sci & Engn, Hyderabad, Telangana, India
[2] Jamia Hamdard, Dept Comp Sci & Engn, New Delhi, India
[3] Govt Jammu & Kashmir, Dept Educ, Srinagar, India
[4] Jamia Millia Islamia, Dept Comp Sci, Delhi, India
关键词
Subgraph isomorphism; Graph search; Eccentricity; Candidate region ordering; Large graph; Embedding; Straggler query; ISOMORPHISM; ALGORITHM;
D O I
10.1016/j.jksuci.2023.101694
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The subgraph matching problem associated with large graphs is an emerging research challenge in graph search due to the growing size of the web, social, and metabolic graphs, and the wide availability of graph databases. Such problems involve finding all instances (aka embedding) of the small-sized query graph in the associated large-sized reference graph. Many state-of-the-art algorithms, including VF3, RI, CFLMatch, and Glasgow, exist to solve subgraph matching problem. RI is one of the fastest subgraph matching algorithms focusing mainly on time efficiency performance measures. However, other performance measures, such as the number of found instances of the query graph (embedding count), the method of ordering the query graph's vertices, and the number of recursive calls, are crucial for the efficiency and effectiveness of the subgraph matching. In this paper, the RI+ algorithm is proposed as an enhanced version of RI, which has been designed using candidate region-based decomposition and ordering. Three novel candidate region orderings have been introduced, namely vertex-count, density, and average-path-length, based on the structural properties of the candidate regions. On empirical analysis of RI+ on real-world data sets, it was observed that RI+ shows significant improvement in efficiency and effectiveness over RI on both performance evaluation measures, namely, embedding count and search time. The influence of the proposed candidate region orderings on the search time of RI+ was also analyzed, revealing that a suitable candidate region ordering has the potential to improve the search time of the proposed algorithm. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:13
相关论文
共 39 条
[21]   When Subgraph Isomorphism is Really Hard, and Why This Matters for Graph Databases [J].
McCreesh, Ciaran ;
Prosser, Patrick ;
Solnon, Christine ;
Trimble, James .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 :723-759
[22]   An efficient pruning method for subgraph matching in large-scale graphs [J].
Moayed, Hojjat ;
Mansoori, Eghbal G. ;
Moosavi, Mohammad R. .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (10) :10511-10532
[23]   Querying massive graph data: A compress and search approach [J].
Nabti, Chemseddine ;
Seba, Hamida .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 74 :63-75
[24]   Dominance-Partitioned Subgraph Matching on Large RDF Graph [J].
Ning, Bo ;
Sun, Yunhao ;
Zhao, Deji ;
Xing, Weikang ;
Li, Guanyu .
COMPLEXITY, 2020, 2020
[25]   Subgraph Matching: on Compression and Computation [J].
Qiao, Miao ;
Zhang, Hao ;
Cheng, Hong .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (02) :176-188
[26]   Exploiting Vertex Relationships in Speeding up Subgraph Isomorphism over Large Graphs [J].
Ren, Xuguang ;
Wang, Junhu .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (05) :617-628
[27]   The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing [J].
Sahu, Siddhartha ;
Mhedhbi, Amine ;
Salihoglu, Semih ;
Lin, Jimmy ;
Ozsu, M. Tamer .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (04) :420-431
[28]   Taming Verification Hardness: An Efficient Algorithm for Testing Subgraph Isomorphism [J].
Shang, Haichuan ;
Zhang, Ying ;
Lin, Xuemin ;
Yu, Jeffrey Xu .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :364-375
[29]   Experimental Evaluation of Subgraph Isomorphism Solvers [J].
Solnon, Christine .
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 :1-13
[30]   RapidFlow: An E.icient Approach to Continuous Subgraph Matching [J].
Sun, Shixuan ;
Sun, Xibo ;
He, Bingsheng ;
Luo, Qiong .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (11) :2415-2427