Model-based clustering for random hypergraphs

被引:5
作者
Ng, Tin Lok James [1 ]
Murphy, Thomas Brendan [2 ]
机构
[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
[2] Univ Coll Dublin, Sch Math & Stat, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Clustering; Hypergraph; Latent class analysis; Minorization maximization; COLLABORATION NETWORK; AFFILIATION NETWORKS; MAXIMUM-LIKELIHOOD; PHASE-TRANSITION; 2-MODE; NUMBER;
D O I
10.1007/s11634-021-00454-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A probabilistic model for random hypergraphs is introduced to represent unary, binary and higher order interactions among objects in real-world problems. This model is an extension of the latent class analysis model that introduces two clustering structures for hyperedges and captures variation in the size of hyperedges. An expectation maximization algorithm with minorization maximization steps is developed to perform parameter estimation. Model selection using Bayesian Information Criterion is proposed. The model is applied to simulated data and two real-world data sets where interesting results are obtained.
引用
收藏
页码:691 / 723
页数:33
相关论文
共 48 条
  • [1] Agarwal S., 2006, P 23 INT C MACH LEAR, P17
  • [2] Statistical modelling of the group structure of social networks
    Aitkin, Murray
    Vu, Duy
    Francis, Brian
    [J]. SOCIAL NETWORKS, 2014, 38 : 74 - 87
  • [3] Scientific authorship and collaboration network analysis on malaria research in Benin: papers indexed in the web of science (1996–2016)
    Azondekon R.
    Harper Z.J.
    Agossa F.R.
    Welzig C.M.
    McRoy S.
    [J]. Global Health Research and Policy, 3 (1)
  • [4] Network analysis of 2-mode data
    Borgatti, SP
    Everett, MG
    [J]. SOCIAL NETWORKS, 1997, 19 (03) : 243 - 269
  • [5] Bu J, 2010, P INT C MULTIMEDIA M, P391, DOI [10.1145/1873951.1874005, DOI 10.1145/1873951.1874005]
  • [6] CLUSTERING CRITERIA FOR DISCRETE-DATA AND LATENT CLASS MODELS
    CELEUX, G
    GOVAERT, G
    [J]. JOURNAL OF CLASSIFICATION, 1991, 8 (02) : 157 - 176
  • [7] GOODNESS-OF-FIT TESTING FOR LATENT CLASS MODELS
    COLLINS, LM
    FIDLER, PL
    WUGALTER, SE
    LONG, JD
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 1993, 28 (03) : 375 - 389
  • [8] Phase transition of random non-uniform hypergraphs
    de Panafieu, Elie
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2015, 31 : 26 - 39
  • [9] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [10] Generalized blockmodeling of two-mode network data
    Doreian, P
    Batagelj, V
    Ferligoj, A
    [J]. SOCIAL NETWORKS, 2004, 26 (01) : 29 - 53