Top-k overlapping densest subgraphs

被引:39
作者
Galbrun, Esther [1 ]
Gionis, Aristides [2 ,3 ]
Tatti, Nikolaj [2 ,3 ]
机构
[1] Inria Nancy Grand Est, Villers Les Nancy, France
[2] Aalto Univ, HIIT, Helsinki, Finland
[3] Aalto Univ, Dept Comp Sci, Helsinki, Finland
关键词
Community detection; Overlapping communities; Social network analysis; Dense subgraphs; Diverse subgraphs; Approximation algorithm; COMMUNITY STRUCTURE; NETWORKS; MODEL;
D O I
10.1007/s10618-016-0464-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding dense subgraphs is an important problem in graph mining and has many practical applications. At the same time, while large real-world networks are known to have many communities that are not well-separated, the majority of the existing work focuses on the problem of finding a single densest subgraph. Hence, it is natural to consider the question of finding the top-k densest subgraphs. One major challenge in addressing this question is how to handle overlaps: eliminating overlaps completely is one option, but this may lead to extracting subgraphs not as dense as it would be possible by allowing a limited amount of overlap. Furthermore, overlaps are desirable as in most real-world graphs there are vertices that belong to more than one community, and thus, to more than one densest subgraph. In this paper we study the problem of finding top-k overlapping densest subgraphs, and we present a new approach that improves over the existing techniques, both in theory and practice. First, we reformulate the problem definition in a way that we are able to obtain an algorithm with constant-factor approximation guarantee. Our approach relies on using techniques for solving the max-sum diversification problem, which however, we need to extend in order to make them applicable to our setting. Second, we evaluate our algorithm on a collection of benchmark datasets and show that it convincingly outperforms the previous methods, both in terms of quality and efficiency.
引用
收藏
页码:1134 / 1165
页数:32
相关论文
共 45 条
[1]   Link communities reveal multiscale complexity in networks [J].
Ahn, Yong-Yeol ;
Bagrow, James P. ;
Lehmann, Sune .
NATURE, 2010, 466 (7307) :761-U11
[2]  
Andersen R, 2009, P 6 INT WORKSH ALG M, P25
[3]   Dense Subgraph Maintenance under Streaming Edge Weight Updates for Real-time Story Identification [J].
Angel, Albert ;
Sarkas, Nikos ;
Koudas, Nick ;
Srivastava, Divesh .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (06) :574-585
[4]  
[Anonymous], ANN S FDN COMP SCI
[5]  
[Anonymous], 2013, WSDM, DOI DOI 10.1145/2433396.2433471
[6]  
Asahiro Y, 1996, P 5 SCAND WORKSH ALG, P136
[7]  
Balalau OD, 2015, P 8 ACM INT C WEB SE, P379
[8]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[9]  
Borodin A, 2012, P 31 ACM SIGMOD SIGA, P155
[10]  
Charikar M., 2000, INT WORKSH APPR ALG, P84, DOI DOI 10.1007/3-540-44436-X_10