Efficient Closest Community Search over Large Graphs

被引:2
作者
Cai, Mingshen [1 ]
Chang, Lijun [2 ]
机构
[1] Canva, Sydney, Australia
[2] Univ Sydney, Sydney, Australia
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II | 2020年 / 12113卷
关键词
D O I
10.1007/978-3-030-59416-9_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the closest community search problem. Given a graph G and a set of query vertices Q, the closest community of Q in G is the connected subgraph of G that contains Q, is most cohesive (i.e., with the largest possible minimum vertex degree), is closest to Q, and is maximal. We show that this can be computed via a two-stage approach: (1) compute the maximal connected subgraph go of G that contains Q and is most cohesive, and (2) iteratively remove from go the vertex that is furthest to Q and subsequently also other vertices that violate the cohesiveness requirement. The last non-empty subgraph is the closest community of Q in G. We first propose baseline approaches for the two stages that run in O(n + m) and O(n(0) x m(0)) time, respectively, where n (resp. n(0)) and m (resp. m(0)) are the number of vertices and edges in G (resp. g(0)). Then, we develop techniques to improve the time complexities of the two stages into O(n(0) + m(0)) and O(m(0) + n(0) log n(0)), respectively. Moreover, we further design an algorithm CCS with the same time complexity as O(m(0) + n(0) log n(0)), but performs much better in practice. Extensive empirical studies demonstrate that CCS can efficiently compute the closest community over large graphs.
引用
收藏
页码:569 / 587
页数:19
相关论文
共 14 条
[1]  
Batagelj V., 2003, An O (m) algorithm for cores decomposition of networks, P845
[2]   An Optimal and Progressive Approach to Online Search of Top-K Influential Communities [J].
Bi, Fei ;
Chang, Lijun ;
Lin, Xuemin ;
Zhang, Wenjie .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (09) :1056-1068
[3]  
Chang L., 2018, Springer Series in the Data Sciences, DOI [10.1007/978-3-030-03599-0, DOI 10.1007/978-3-030-03599-0]
[4]   Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity [J].
Chang, Lijun ;
Lin, Xuemin ;
Qin, Lu ;
Yu, Jeffrey Xu ;
Zhang, Wenjie .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :459-474
[5]  
Cormen T. H., 2009, Introduction To Algorithms
[6]   Local Search of Communities in Large Graphs [J].
Cui, Wanyun ;
Xiao, Yanghua ;
Wang, Haixun ;
Wang, Wei .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :991-1002
[7]  
Eifrem E., 2013, GRAPH DATABASES
[8]   Community detection in graphs [J].
Fortunato, Santo .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2010, 486 (3-5) :75-174
[9]   Functional cartography of complex metabolic networks [J].
Guimerà, R ;
Amaral, LAN .
NATURE, 2005, 433 (7028) :895-900
[10]   Community Search over Big Graphs: Models, Algorithms, and Opportunities [J].
Huang, Xin ;
Lakshmanan, Laks V. S. ;
Xu, Jianliang .
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, :1451-1454