Fast Network Community Detection With Profile-Pseudo Likelihood Methods

被引:13
作者
Wang, Jiangzhou [1 ,2 ,3 ]
Zhang, Jingfei [4 ]
Liu, Binghui [1 ,2 ]
Zhu, Ji [5 ]
Guo, Jianhua [1 ,2 ]
机构
[1] Northeast Normal Univ, Sch Math & Stat, Jilin 130024, Jilin, Peoples R China
[2] Northeast Normal Univ, KLAS, Jilin 130024, Jilin, Peoples R China
[3] Southern Univ Sci & Technol, Dept Stat & Data Sci, Shenzhen, Peoples R China
[4] Univ Miami, Dept Management Sci, Coral Gables, FL 33124 USA
[5] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
基金
国家重点研发计划; 中国博士后科学基金;
关键词
Network analysis; Profile likelihood; Pseudo likelihood; Stochastic block model; Strong consistency; STOCHASTIC BLOCKMODELS; MAXIMUM-LIKELIHOOD; CONSISTENCY; APPROXIMATION; PREDICTION; RECOVERY; MODEL;
D O I
10.1080/01621459.2021.1996378
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The stochastic block model is one of the most studied network models for community detection, and fitting its likelihood function on large-scale networks is known to be challenging. One prominent work that overcomes this computational challenge is the fast pseudo-likelihood approach proposed by Amini et al. for fitting stochastic block models to large sparse networks. However, this approach does not have convergence guarantee, and may not be well suited for small and medium scale networks. In this article, we propose a novel likelihood based approach that decouples row and column labels in the likelihood function, enabling a fast alternating maximization. This new method is computationally efficient, performs well for both small- and large-scale networks, and has provable convergence guarantee. We show that our method provides strongly consistent estimates of communities in a stochastic block model. We further consider extensions of our proposed method to handle networks with degree heterogeneity and bipartite properties. Supplementary materials for this article are available online.
引用
收藏
页码:1359 / 1372
页数:14
相关论文
共 42 条
[1]  
Abbe E, 2020, ANN STAT, V48, P1452, DOI [10.1214/19-AOS1854, 10.1214/19-aos1854]
[2]   Community detection and stochastic block models: Recent developments [J].
Abbe, Emmanuel .
Journal of Machine Learning Research, 2018, 18 :1-86
[3]   Exact Recovery in the Stochastic Block Model [J].
Abbe, Emmanuel ;
Bandeira, Afonso S. ;
Hall, Georgina .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (01) :471-487
[4]  
Adamic Lada., 2005, P 3 INT WORKSHOP LIN, P36, DOI DOI 10.1145/1134271.1134277
[5]  
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[6]   PSEUDO-LIKELIHOOD METHODS FOR COMMUNITY DETECTION IN LARGE SPARSE NETWORKS [J].
Amini, Arash A. ;
Chen, Aiyou ;
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2013, 41 (04) :2097-2122
[7]   ASYMPTOTIC NORMALITY OF MAXIMUM LIKELIHOOD AND ITS VARIATIONAL APPROXIMATION FOR STOCHASTIC BLOCKMODELS [J].
Bickel, Peter ;
Choi, David ;
Chang, Xiangyu ;
Zhang, Hai .
ANNALS OF STATISTICS, 2013, 41 (04) :1922-1943
[8]   A nonparametric view of network models and Newman-Girvan and other modularities [J].
Bickel, Peter J. ;
Chen, Aiyou .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (50) :21068-21073
[9]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[10]  
Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111