Detecting Statistically Significant Communities

被引:4
作者
He, Zengyou [1 ,2 ]
Liang, Hao [1 ]
Chen, Zheng [1 ]
Zhao, Can [3 ]
Liu, Yan [1 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian 116024, Liaoning, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116024, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
关键词
Community detection; random graphs; configuration model; statistical significance; NETWORKS; EXTRACTION; INFERENCE; MODEL;
D O I
10.1109/TKDE.2020.3015667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection is a key data analysis problem across different fields. During the past decades, numerous algorithms have been proposed to address this issue. However, most work on community detection does not address the issue of statistical significance. Although some research efforts have been made towards mining statistically significant communities, deriving an analytical solution of p-value for one community under the configuration model is still a challenging mission that remains unsolved. The configuration model is a widely used random graph model in community detection, in which the degree of each node is preserved in the generated random networks. To partially fulfill this void, we present a tight upper bound on the p-value of a single community under the configuration model, which can be used for quantifying the statistical significance of each community analytically. Meanwhile, we present a local search method to detect statistically significant communities in an iterative manner. Experimental results demonstrate that our method is comparable with the competing methods on detecting statistically significant communities.
引用
收藏
页码:2711 / 2725
页数:15
相关论文
共 50 条
  • [21] An Efficient Rigorous Approach for Identifying Statistically Significant Frequent Itemsets
    Kirsch, Adam
    Mitzenmacher, Michael
    Pietracaprina, Andrea
    Pucci, Geppino
    Upfal, Eli
    Vandin, Fabio
    JOURNAL OF THE ACM, 2012, 59 (03)
  • [22] Mining Statistically Significant Co-location and Segregation Patterns
    Barua, Sajib
    Sander, Joerg
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1185 - 1199
  • [23] Detecting Overlapping Communities in Modularity Optimization by Reweighting Vertices
    Tsung, Chen-Kun
    Ho, Hann-Jang
    Chen, Chien-Yu
    Chang, Tien-Wei
    Lee, Sing-Ling
    ENTROPY, 2020, 22 (08)
  • [24] Detecting Communities with Different Sizes for Social Network Analysis
    Zhou, Lihua
    Lu, Kevin
    COMPUTER JOURNAL, 2015, 58 (09) : 1894 - 1908
  • [25] Leveraging disjoint communities for detecting overlapping community structure
    Chakraborty, Tanmoy
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2015,
  • [26] A local multiresolution algorithm for detecting communities of unbalanced structures
    Zalik, Krista Rizman
    Zalik, Borut
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2014, 407 : 380 - 393
  • [27] Position Matters: Play a Sequential Game to Detect Significant Communities
    Wang, Yuyao
    Cao, Jie
    Wang, Youquan
    Wu, Jia
    Liu, Yangyang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3402 - 3416
  • [28] Significant DBSCAN plus : Statistically Robust Density-based Clustering
    Xie, Yiqun
    Jia, Xiaowei
    Shekhar, Shashi
    Bao, Han
    Zhou, Xun
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)
  • [29] Preservation of Statistically Significant Patterns in Multiresolution 0-1 Data
    Adhikari, Prem Raj
    Hollmen, Jaakko
    PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 86 - 97
  • [30] Coup de Grace for a Tough Old Bull: "Statistically Significant" Expires
    Hurlbert, Stuart H.
    Levine, Richard A.
    Utts, Jessica
    AMERICAN STATISTICIAN, 2019, 73 : 352 - 357