Thinned random measures for sparse graphs with overlapping communities

被引:0
作者
Ricci, Federica Zoe [1 ]
Guindani, Michele [2 ]
Sudderth, Erik B. [3 ]
机构
[1] Univ Calif Irvine, Dept Stat, Irvine, CA 92717 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA USA
[3] Univ Calif Irvine, Dept Comp Sci & Stat, Irvine, CA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
MODELS; ARRAYS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network models for exchangeable arrays, including most stochastic block models, generate dense graphs with a limited ability to capture many characteristics of real-world social and biological networks. A class of models based on completely random measures like the generalized gamma process (GGP) have recently addressed some of these limitations. We propose a framework for thinning edges from realizations of GGP random graphs that models observed links via nodes' overall propensity to interact, as well as the similarity of node memberships within a large set of latent communities. Our formulation allows us to learn the number of communities from data, and enables efficient Monte Carlo methods that scale linearly with the number of observed edges, and thus (unlike dense block models) sub-quadratically with the number of entities or nodes. We compare to alternative models for both dense and sparse networks, and demonstrate effective recovery of latent community structure for real-world networks with thousands of nodes.
引用
收藏
页数:14
相关论文
共 45 条