Learning latent block structure in weighted networks

被引:204
作者
Aicher, Christopher [1 ]
Jacobs, Abigail Z. [2 ]
Clauset, Aaron [2 ,3 ,4 ]
机构
[1] Univ Colorado, Dept Appl Math, Boulder, CO 80309 USA
[2] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[3] Univ Colorado, BioFrontiers Inst, Boulder, CO 80303 USA
[4] Santa Fe Inst, Santa Fe, NM 87501 USA
基金
美国国家科学基金会;
关键词
community detection; weighted relational data; block models; exponential family; variational Bayes;
D O I
10.1093/comnet/cnu026
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Community detection is an important task in network analysis, in which we aim to learn a network partition that groups together vertices with similar community-level connectivity patterns. By finding such groups of vertices with similar structural roles, we extract a compact representation of the network's large-scale structure, which can facilitate its scientific interpretation and the prediction of unknown or future interactions. Popular approaches, including the stochastic block model (SBM), assume edges are unweighted, which limits their utility by discarding potentially useful information. We introduce the weighted stochastic block model (WSBM), which generalizes the SBM to networks with edge weights drawn from any exponential family distribution. This model learns from both the presence and weight of edges, allowing it to discover structure that would otherwise be hidden when weights are discarded or thresholded. We describe a Bayesian variational algorithm for efficiently approximating this model's posterior distribution over latent block structures. We then evaluate the WSBM's performance on both edge-existence and edge-weight prediction tasks for a set of real-world weighted networks. In all cases, the WSBM performs as well or better than the best alternatives on these tasks.
引用
收藏
页码:221 / 248
页数:28
相关论文
共 35 条
[1]  
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[2]  
Ambroise C, 2012, J R STAT SOC B, V74, P3, DOI 10.1111/j.1467-9868.2011.01009.x
[3]  
[Anonymous], 2003, EXPLORING ARTIFICIAL
[4]  
[Anonymous], [No title captured]
[5]  
[Anonymous], 2010, NETWORKS INTRO, DOI DOI 10.1093/ACPROF:OSO/9780199206650.001.0001
[6]  
[Anonymous], 2013, ICML WORKSH STRUCT L
[7]  
Attias H, 2000, ADV NEUR IN, V12, P209
[8]  
Berger J.O., 1991, BAYSN STATIST, V4, P35
[9]  
Blitzstein J.K., 2011, ARXIV11010788
[10]   Consistency of maximum-likelihood and variational estimators in the stochastic block model [J].
Celisse, Alain ;
Daudin, Jean-Jacques ;
Pierre, Laurent .
ELECTRONIC JOURNAL OF STATISTICS, 2012, 6 :1847-1899