Nonparametric Bayesian Deep Visualization

被引:0
|
作者
Ishizuka, Haruya [1 ]
Mochihashi, Daichi [2 ]
机构
[1] Bridgeston Corp, Chuo City, Japan
[2] Inst Stat Math, Tachikawa, Tokyo, Japan
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I | 2023年 / 13713卷
关键词
Data visualization; Gaussian processes; Nonparametric Bayesian models; Neural network; NONLINEAR DIMENSIONALITY REDUCTION; INFERENCE;
D O I
10.1007/978-3-031-26387-3_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visualization methods such as t-SNE [1] have helped in knowledge discovery from high-dimensional data; however, their performance may degrade when the intrinsic structure of observations is in low-dimensional space, and they cannot estimate clusters that are often useful to understand the internal structure of a dataset. A solution is to visualize the latent coordinates and clusters estimated using a neural clustering model. However, they require a long computational time since they have numerous weights to train and must tune the layer width, the number of latent dimensions and clusters to appropriately model the latent space. Additionally, the estimated coordinates may not be suitable for visualization since such a model and visualization method are applied independently. We utilize neural network Gaussian processes (NNGP) [2] equivalent to a neural network whose weights are marginalized to eliminate the necessity to optimize weights and layer widths. Additionally, to determine latent dimensions and the number of clusters without tuning, we propose a latent variable model that combines NNGP with automatic relevance determination [3] to extract necessary dimensions of latent space and infinite Gaussian mixture model [4] to infer the number of clusters. We integrate this model and visualization method into nonparametric Bayesian deep visualization (NPDV) that learns latent and visual coordinates jointly to render latent coordinates optimal for visualization. Experimental results on images and document datasets show that NPDV shows superior accuracy to existing methods, and it requires less training time than the neural clustering model because of its lower tuning cost. Furthermore, NPDV can reveal plausible latent clusters without labels.
引用
收藏
页码:121 / 137
页数:17
相关论文
共 50 条
  • [21] A Bayesian Nonparametric Approach for Time Series Clustering
    Nieto-Barajas, Luis E.
    Contreras-Cristan, Alberto
    BAYESIAN ANALYSIS, 2014, 9 (01): : 147 - 169
  • [22] A Bayesian nonparametric model for upper record data
    Seo, Jung-In
    Song, Joon Jin
    APPLIED MATHEMATICAL MODELLING, 2019, 71 : 363 - 374
  • [23] A nonparametric Bayesian methodology for regression discontinuity designs
    Branson, Zach
    Rischard, Maxime
    Bornn, Luke
    Miratrix, Luke W.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2019, 202 : 14 - 30
  • [24] Bayesian Nonparametric Modeling for Multivariate Ordinal Regression
    DeYoreo, Maria
    Kottas, Athanasios
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2018, 27 (01) : 71 - 84
  • [25] UNCERTAINTY QUANTIFICATION THROUGH BAYESIAN NONPARAMETRIC MODELLING
    Kockova, E.
    Kucerova, A.
    Sykora, J.
    ENGINEERING MECHANICS 2020 (IM2020), 2020, : 274 - 277
  • [26] Bayesian nonparametric trees for principal causal effects
    Kim, Chanmin
    Zigler, Corwin
    BIOMETRICS, 2025, 81 (01)
  • [27] HEAVY-TAILED BAYESIAN NONPARAMETRIC ADAPTATION
    Agapiou, Sergios
    Castillo, Ismael
    ANNALS OF STATISTICS, 2024, 52 (04) : 1433 - 1459
  • [28] On Bayesian nonparametric modelling of two correlated distributions
    Kolossiatis, M.
    Griffin, J. E.
    Steel, M. F. J.
    STATISTICS AND COMPUTING, 2013, 23 (01) : 1 - 15
  • [29] Bayesian Nonparametric Ordination for the Analysis of Microbial Communities
    Ren, Boyu
    Bacallado, Sergio
    Favaro, Stefano
    Holmes, Susan
    Trippa, Lorenzo
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (520) : 1430 - 1442
  • [30] Robustifying Bayesian Nonparametric Mixtures for Count Data
    Canale, Antonio
    Prunster, Igor
    BIOMETRICS, 2017, 73 (01) : 174 - 184