OVERLAPPING STOCHASTIC BLOCK MODELS WITH APPLICATION TO THE FRENCH POLITICAL BLOGOSPHERE

被引:111
作者
Latouche, Pierre [1 ]
Birmele, Etienne [1 ]
Ambroise, Christophe [1 ]
机构
[1] Univ Evry, INRA 1152, UMR 8071, Lab Stat & Genome,CNRS, F-91000 Evry, France
关键词
Random graph models; blockmodels; overlapping clusters; global and local variational techniques; COMMUNITY STRUCTURE; COMPLEX NETWORKS; BLOCKMODELS; PREDICTION; GRAPHS;
D O I
10.1214/10-AOAS382
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Complex systems in nature and in society are often represented as networks, describing the rich set of interactions between objects of interest. Many deterministic and probabilistic clustering methods have been developed to analyze such structures. Given a network, almost all of them partition the vertices into disjoint clusters, according to their connection profile. However, recent studies have shown that these techniques were too restrictive and that most of the existing networks contained overlapping clusters. To tackle this issue, we present in this paper the Overlapping Stochastic Block Model. Our approach allows the vertices to belong to multiple clusters, and, to some extent, generalizes the well-known Stochastic Block Model [Nowicki and Snijders (2001)]. We show that the model is generically identifiable within classes of equivalence and we propose an approximate inference procedure, based on global and local variational techniques. Using toy data sets as well as the French Political Blogosphere network and the transcriptional network of Saccharomyces cerevisiae, we compare our work with other approaches.
引用
收藏
页码:309 / 336
页数:28
相关论文
共 41 条
[1]  
Airoldi E.M., 2006, P INT BIOM SOC ANN M, VVolume 15
[2]  
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[3]  
Airoldi EM, 2007, MIXED MEMBERSHIP ANA
[4]   IDENTIFIABILITY OF PARAMETERS IN LATENT STRUCTURE MODELS WITH MANY OBSERVED VARIABLES [J].
Allman, Elizabeth S. ;
Matias, Catherine ;
Rhode, John A. .
ANNALS OF STATISTICS, 2009, 37 (6A) :3099-3132
[5]  
[Anonymous], 1998, Connections
[6]  
[Anonymous], 2005, NIPS
[7]  
[Anonymous], 1981, Sociological methodology, DOI DOI 10.2307/270741
[8]   A nonparametric view of network models and Newman-Girvan and other modularities [J].
Bickel, Peter J. ;
Chen, Aiyou .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (50) :21068-21073
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Boer P., 2006, STOCNET OPEN SOFTWAR