Detecting Statistically Significant Communities

被引:4
|
作者
He, Zengyou [1 ,2 ]
Liang, Hao [1 ]
Chen, Zheng [1 ]
Zhao, Can [3 ]
Liu, Yan [1 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian 116024, Liaoning, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116024, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
关键词
Community detection; random graphs; configuration model; statistical significance; NETWORKS; EXTRACTION; INFERENCE; MODEL;
D O I
10.1109/TKDE.2020.3015667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection is a key data analysis problem across different fields. During the past decades, numerous algorithms have been proposed to address this issue. However, most work on community detection does not address the issue of statistical significance. Although some research efforts have been made towards mining statistically significant communities, deriving an analytical solution of p-value for one community under the configuration model is still a challenging mission that remains unsolved. The configuration model is a widely used random graph model in community detection, in which the degree of each node is preserved in the generated random networks. To partially fulfill this void, we present a tight upper bound on the p-value of a single community under the configuration model, which can be used for quantifying the statistical significance of each community analytically. Meanwhile, we present a local search method to detect statistically significant communities in an iterative manner. Experimental results demonstrate that our method is comparable with the competing methods on detecting statistically significant communities.
引用
收藏
页码:2711 / 2725
页数:15
相关论文
共 50 条
  • [1] Mining Statistically Significant Communities From Weighted Networks
    He, Zengyou
    Chen, Wenfang
    Wei, Xiaoqi
    Liu, Yan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6073 - 6084
  • [2] Scalable detection of statistically significant communities and hierarchies, using message passing for modularity
    Zhang, Pan
    Moore, Cristopher
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (51) : 18144 - 18149
  • [3] Statistically Significant! But Is Trust of Practical Significance?
    Liu, Dapeng
    Weistroffer, Heinz Roland
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2022, 62 (02) : 247 - 258
  • [4] Detecting Statistically Significant Events in Large Heterogeneous Attribute Graphs via Densest Subgraphs
    Li, Yuan
    Fan, Xiaolin
    Sun, Jing
    Zhao, Yuhai
    Wang, Guoren
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 107 - 120
  • [5] Detecting Latent Communities in Network Formation Models
    Ma, Shujie
    Su, Liangjun
    Zhang, Yichong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [6] Statistically Significant Discriminative Patterns Searching
    Pham, Hoang Son
    Virlet, Gwendal
    Lavenier, Dominique
    Termier, Alexandre
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2019, 2019, 11708 : 105 - 115
  • [7] A TESTING BASED EXTRACTION ALGORITHM FOR IDENTIFYING SIGNIFICANT COMMUNITIES IN NETWORKS
    Wilson, James D.
    Wang, Simi
    Mucha, Peter J.
    Bhamidi, Shankar
    Nobel, Andrew B.
    ANNALS OF APPLIED STATISTICS, 2014, 8 (03) : 1853 - 1891
  • [8] What is the probability of replicating a statistically significant effect?
    Miller, Jeff
    PSYCHONOMIC BULLETIN & REVIEW, 2009, 16 (04) : 617 - 640
  • [9] Detecting Communities by Integrating Perspective Communities as Semantic Information
    Boujlaleb, Loubna
    Idarrou, Ali
    Mammass, Driss
    2017 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW) 2017, 2017, : 126 - 131
  • [10] Significant DBSCAN towards Statistically Robust Clustering
    Xie, Yiqun
    Shekhar, Shashi
    SSTD '19 - PROCEEDINGS OF THE 16TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, 2019, : 31 - 40