Detecting Statistically Significant Communities

被引:4
作者
He, Zengyou [1 ,2 ]
Liang, Hao [1 ]
Chen, Zheng [1 ]
Zhao, Can [3 ]
Liu, Yan [1 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian 116024, Liaoning, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116024, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
关键词
Community detection; random graphs; configuration model; statistical significance; NETWORKS; EXTRACTION; INFERENCE; MODEL;
D O I
10.1109/TKDE.2020.3015667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection is a key data analysis problem across different fields. During the past decades, numerous algorithms have been proposed to address this issue. However, most work on community detection does not address the issue of statistical significance. Although some research efforts have been made towards mining statistically significant communities, deriving an analytical solution of p-value for one community under the configuration model is still a challenging mission that remains unsolved. The configuration model is a widely used random graph model in community detection, in which the degree of each node is preserved in the generated random networks. To partially fulfill this void, we present a tight upper bound on the p-value of a single community under the configuration model, which can be used for quantifying the statistical significance of each community analytically. Meanwhile, we present a local search method to detect statistically significant communities in an iterative manner. Experimental results demonstrate that our method is comparable with the competing methods on detecting statistically significant communities.
引用
收藏
页码:2711 / 2725
页数:15
相关论文
共 50 条
[31]   Coup de Grace for a Tough Old Bull: "Statistically Significant" Expires [J].
Hurlbert, Stuart H. ;
Levine, Richard A. ;
Utts, Jessica .
AMERICAN STATISTICIAN, 2019, 73 :352-357
[32]   Detecting Periods of Significant Increased Communication Levels for Subgroups of Targeted Individuals [J].
Sparks, Ross .
QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2016, 32 (05) :1871-1888
[33]   A hybrid artificial immune network for detecting communities in complex networks [J].
Karimi-Majd, Amir-Mohsen ;
Fathian, Mohammad ;
Amiri, Babak .
COMPUTING, 2015, 97 (05) :483-507
[34]   Adaptive multi-resolution Modularity for detecting communities in networks [J].
Chen, Shi ;
Wang, Zhi-Zhong ;
Bao, Mei-Hua ;
Tang, Liang ;
Zhou, Ji ;
Xiang, Ju ;
Li, Jian-Ming ;
Yi, Chen-He .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 491 :591-603
[35]   Detecting informative higher-order interactions in statistically validated hypergraphs [J].
Musciotto, Federico ;
Battiston, Federico ;
Mantegna, Rosario N. .
COMMUNICATIONS PHYSICS, 2021, 4 (01)
[36]   Effectively Detecting Communities by Adjusting Initial Structure via Cores [J].
Chen, Mei ;
Yang, Zhichong ;
Wen, Xiaofang ;
Leng, Mingwei ;
Zhang, Mei ;
Li, Ming .
COMPLEXITY, 2019, 2019
[37]   The Fragility of Statistically Significant Findings From Randomized Trials in Head and Neck Surgery [J].
Noel, Christopher W. ;
McMullen, Caitlin ;
Yao, Christopher ;
Monteiro, Eric ;
Goldstein, David P. ;
Eskander, Antoine ;
de Almeida, John R. .
LARYNGOSCOPE, 2018, 128 (09) :2094-2100
[38]   Testing for statistically significant differences in predictions obtained from competing creep models [J].
Evans, M. .
MATERIALS SCIENCE AND TECHNOLOGY, 2024, 40 (10) :743-754
[39]   VERSACHI: Finding Statistically Significant Subgraph Matches using Chebyshev's Inequality [J].
Agarwal, Shubhangi ;
Dutta, Sourav ;
Bhattacharya, Arnab .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, :2812-2816
[40]   Almost all articles on cancer prognostic markers report statistically significant results [J].
Kyzas, Panayiotis A. ;
Denaxa-Kyza, Despina ;
Ioannidis, John P. A. .
EUROPEAN JOURNAL OF CANCER, 2007, 43 (17) :2559-2579