A likelihood-ratio type test for stochastic block models with bounded degrees

被引:3
作者
Yuan, Mingao [1 ]
Feng, Yang [2 ]
Shang, Zuofeng [3 ]
机构
[1] North Dakota State Univ, Dept Stat, 1340 Adm Ave, Fargo, ND 58102 USA
[2] NYU, Coll Global Publ Hlth, 665 Broadway, New York, NY 10003 USA
[3] New Jersey Inst Technol, Dept Math Sci, 323 Dr MLK Jr Blvd, Newark, NJ 07102 USA
基金
美国国家科学基金会;
关键词
Stochastic block model; Bounded degrees; Hypothesis testing; Likelihood ratio; Contiguity theory; COMMUNITY DETECTION; NETWORK;
D O I
10.1016/j.jspi.2021.12.005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A fundamental problem in network data analysis is to test Erdos-Renyi model G (n, a+b) versus a bisection stochastic block model G (n, a ), where a, b > 0 are constants that represent the expected degrees of the graphs and n denotes the number of nodes. This problem serves as the foundation of many other problems such as testing-based methods for determining the number of communities (Bickel and Sarkar, 2016; Lei, 2016) and community detection (Montanari and Sen, 2016). Existing work has been focusing on growing-degree regime a, b -> infinity (Bickel and Sarkar, 2016; Lei, 2016; Montanari and Sen, 2016; Banerjee and Ma, 2017; Banerjee, 2018; Gao and Lafferty, 2017a,b) while leaving the bounded-degree regime untreated. In this paper, we propose a likelihoodratio (LR) type procedure based on regularization to test stochastic block models with bounded degrees. We derive the limit distributions as power Poisson laws under both null and alternative hypotheses, based on which the limit power of the test is carefully analyzed. We also examine a Monte-Carlo method that partly resolves the computational cost issue. The proposed procedures are examined by both simulated and real-world data. The proof depends on a contiguity theory developed by Janson (1995).
引用
收藏
页码:98 / 119
页数:22
相关论文
共 30 条