Tolerating the community detection resolution limit with edge weighting

被引:120
作者
Berry, Jonathan W. [1 ]
Hendrickson, Bruce [1 ]
LaViolette, Randall A. [1 ]
Phillips, Cynthia A. [1 ]
机构
[1] Sandia Natl Labs, Albuquerque, NM 87185 USA
关键词
D O I
10.1103/PhysRevE.83.056119
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Communities of vertices within a giant network such as the World Wide Web are likely to be vastly smaller than the network itself. However, Fortunato and Barthelemy have proved that modularity maximization algorithms for community detection may fail to resolve communities with fewer than root L/2 edges, where L is the number of edges in the entire network. This resolution limit leads modularity maximization algorithms to have notoriously poor accuracy on many real networks. Fortunato and Barthelemy's argument can be extended to networks with weighted edges as well, and we derive this corollary argument. We conclude that weighted modularity algorithms may fail to resolve communities with less than root W is an element of/2 total edge weight, where W is the total edge weight in the network and is an element of is the maximum weight of an intercommunity edge. If is an element of is small, then small communities can be resolved. Given a weighted or unweighted network, we describe how to derive new edge weights in order to achieve a low is an element of, we modify the Clauset, Newman, and Moore (CNM) community detection algorithm to maximize weighted modularity, and we show that the resulting algorithm has greatly improved accuracy. In experiments with an emerging community standard benchmark, we find that our simple CNM variant is competitive with the most accurate community detection methods yet proposed.
引用
收藏
页数:9
相关论文
共 31 条
  • [1] Entropy measures for networks: Toward an information theory of complex topologies
    Anand, Kartik
    Bianconi, Ginestra
    [J]. PHYSICAL REVIEW E, 2009, 80 (04)
  • [2] [Anonymous], 1996, Grooming, Gossip, and the Evolution of Language
  • [3] [Anonymous], 2008, P 17 INT C WORLD WID, DOI DOI 10.1145/1367497.1367591
  • [4] Analysis of the structure of complex networks at different resolution levels
    Arenas, A.
    Fernandez, A.
    Gomez, S.
    [J]. NEW JOURNAL OF PHYSICS, 2008, 10
  • [5] Entropy of network ensembles
    Bianconi, Ginestra
    [J]. PHYSICAL REVIEW E, 2009, 79 (03)
  • [6] Finding local community structure in networks
    Clauset, A
    [J]. PHYSICAL REVIEW E, 2005, 72 (02)
  • [7] Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
  • [8] Hierarchical structure and the prediction of missing links in networks
    Clauset, Aaron
    Moore, Cristopher
    Newman, M. E. J.
    [J]. NATURE, 2008, 453 (7191) : 98 - 101
  • [9] Graph Twiddling in a MapReduce World
    Cohen, Jonathan
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2009, 11 (04) : 29 - 41
  • [10] Comparing community structure identification -: art. no. P09008
    Danon, L
    Díaz-Guilera, A
    Duch, J
    Arenas, A
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005, : 219 - 228