Bayesian nonparametric clustering as a community detection problem

被引:2
|
作者
Tonellato, Stefano F. [1 ]
机构
[1] Ca Foscari Univ Venice, Dept Econ, Cannaregio 873, I-30121 Venice, Italy
关键词
Dirichlet process priors; Mixture models; Community detection; Entropy; Clustering uncertainty; MONTE-CARLO METHODS; MIXTURE MODEL; DENSITY-ESTIMATION; SAMPLING METHODS; RANDOM-WALKS; CLASSIFICATION; SELECTION; NUMBER;
D O I
10.1016/j.csda.2020.107044
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A wide class of Bayesian nonparametric priors leads to the representation of the distribution of the observable variables as a mixture density with an infinite number of components. Such a representation induces a clustering structure in the data. However, due to label switching, cluster identification is not straightforward a posteriori and some post-processing of the MCMC output is usually required. Alternatively, observations can be mapped on a weighted undirected graph, where each node represents a sample item and edge weights are given by the posterior pairwise similarities. It is shown how, after building a particular random walk on such a graph, it is possible to apply a community detection algorithm, known as map equation, leading to the minimisation of the expected description length of the partition. A relevant feature of this method is that it allows for the quantification of the posterior uncertainty of the classification. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Malware Detection Using Nonparametric Bayesian Clustering and Classification Techniques
    Kao, Yimin
    Reich, Brian
    Storlie, Curtis
    Anderson, Blake
    TECHNOMETRICS, 2015, 57 (04) : 535 - 546
  • [2] A Bayesian Nonparametric Approach for Time Series Clustering
    Nieto-Barajas, Luis E.
    Contreras-Cristan, Alberto
    BAYESIAN ANALYSIS, 2014, 9 (01): : 147 - 169
  • [3] Bayesian nonparametric clustering and association studies for candidate SNP observations
    Wang, Charlotte
    Ruggeri, Fabrizio
    Hsiao, Chuhsing K.
    Argiento, Raffaele
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 80 : 19 - 35
  • [4] Nonparametric Bayesian Clustering Ensembles
    Wang, Pu
    Domeniconi, Carlotta
    Laskey, Kathryn Blackmond
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 435 - 450
  • [5] Bayesian Complex Network Community Detection Using Nonparametric Topic Model
    Zhu, Ruimin
    Jiang, Wenxin
    COMPLEX NETWORKS AND THEIR APPLICATIONS VII, VOL 1, 2019, 812 : 280 - 291
  • [6] Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination
    Yau, Christopher
    Holmes, Chris
    BAYESIAN ANALYSIS, 2011, 6 (02): : 329 - 351
  • [7] A Bayesian Nonparametric Model for Integrative Clustering of Omics Data
    Peneva, Iliana
    Savage, Richard S.
    BAYESIAN STATISTICS AND NEW GENERATIONS, BAYSM 2018, 2019, 296 : 105 - 114
  • [8] A Nonparametric Bayesian Model for Local Clustering With Application to Proteomics
    Lee, Juhee
    Mueller, Peter
    Zhu, Yitan
    Ji, Yuan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (503) : 775 - 788
  • [9] Clustering-Based Online News Topic Detection and Tracking Through Hierarchical Bayesian Nonparametric Models
    Fan, Wentao
    Guo, Zhiyan
    Bouguila, Nizar
    Hou, Wenjuan
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2126 - 2130
  • [10] DNB: A Joint Learning Framework for Deep Bayesian Nonparametric Clustering
    Wang, Zeya
    Ni, Yang
    Jing, Baoyu
    Wang, Deqing
    Zhang, Hao
    Xing, Eric
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7610 - 7620