Dense Subgraph Extraction with Application to Community Detection

被引:160
作者
Chen, Jie [1 ]
Saad, Yousef [1 ]
机构
[1] Univ Minnesota Twin Cities, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Dense subgraph; social network; community; matrix reordering; hierarchical clustering; partial clustering;
D O I
10.1109/TKDE.2010.271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for identifying a set of dense subgraphs of a given sparse graph. Within the main applications of this "dense subgraph problem," the dense subgraphs are interpreted as communities, as in, e.g., social networks. The problem of identifying dense subgraphs helps analyze graph structures and complex networks and it is known to be challenging. It bears some similarities with the problem of reordering/blocking matrices in sparse matrix techniques. We exploit this link and adapt the idea of recognizing matrix column similarities, in order to compute a partial clustering of the vertices in a graph, where each cluster represents a dense subgraph. In contrast to existing subgraph extraction techniques which are based on a complete clustering of the graph nodes, the proposed algorithm takes into account the fact that not every participating node in the network needs to belong to a community. Another advantage is that the method does not require to specify the number of clusters; this number is usually not known in advance and is difficult to estimate. The computational process is very efficient, and the effectiveness of the proposed method is demonstrated in a few real-life examples.
引用
收藏
页码:1216 / 1230
页数:15
相关论文
共 50 条
  • [1] Abello J., 2002, P LAT AM S THEOR INF
  • [2] Abou-Rjeili A., 2006, P INT PAR DISTR PROC
  • [3] Adamic Lada A., 2005, P 3 INT WORKSHOP LIN, P36, DOI DOI 10.1145/1134271.1134277
  • [4] Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
  • [5] Internet -: Diameter of the World-Wide Web
    Albert, R
    Jeong, H
    Barabási, AL
    [J]. NATURE, 1999, 401 (6749) : 130 - 131
  • [6] [Anonymous], 2003, Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining
  • [7] Banerjee A, 2005, J MACH LEARN RES, V6, P1345
  • [8] Banerjee A, 2007, J MACH LEARN RES, V8, P1919
  • [9] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [10] The LCA problem revisited
    Bender, MA
    Farach-Colton, M
    [J]. LATIN 2000: THEORETICAL INFORMATICS, 2000, 1776 : 88 - 94