Gaussian parsimonious clustering models with covariates and a noise component

被引:28
|
作者
Murphy, Keefe [1 ,2 ]
Murphy, Thomas Brendan [1 ,2 ]
机构
[1] Univ Coll Dublin, Sch Math & Stat, Dublin, Ireland
[2] Univ Coll Dublin, Insight Ctr Data Analyt, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Model-based clustering; Mixtures of experts; EM algorithm; Parsimony; Multivariate response; Covariates; Noise component; FINITE MIXTURES; R PACKAGE; CLASSIFICATION; REGRESSIONS; LIKELIHOOD; VARIABLES; EXPERTS;
D O I
10.1007/s11634-019-00373-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider model-based clustering methods for continuous, correlated data that account for external information available in the presence of mixed-type fixed covariates by proposing the MoEClust suite of models. These models allow different subsets of covariates to influence the component weights and/or component densities by modelling the parameters of the mixture as functions of the covariates. A familiar range of constrained eigen-decomposition parameterisations of the component covariance matrices are also accommodated. This paper thus addresses the equivalent aims of including covariates in Gaussian parsimonious clustering models and incorporating parsimonious covariance structures into all special cases of the Gaussian mixture of experts framework. The MoEClust models demonstrate significant improvement from both perspectives in applications to both univariate and multivariate data sets. Novel extensions to include a uniform noise component for capturing outliers and to address initialisation of the EM algorithm, model selection, and the visualisation of results are also proposed.
引用
收藏
页码:293 / 325
页数:33
相关论文
共 50 条
  • [21] Parsimonious Gaussian Process Models for the Classification of Hyperspectral Remote Sensing Images
    Fauvel, Mathieu
    Bouveyron, Charles
    Girard, Stephane
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (12) : 2423 - 2427
  • [22] Gaussian-Based Visualization of Gaussian and Non-Gaussian-Based Clustering
    Biernacki, Christophe
    Marbac, Matthieu
    Vandewalle, Vincent
    JOURNAL OF CLASSIFICATION, 2021, 38 (01) : 129 - 157
  • [23] Parsimonious multiscale classification models
    Alsberg, BK
    JOURNAL OF CHEMOMETRICS, 2000, 14 (5-6) : 529 - 539
  • [24] Component Reduction for Gaussian Mixture Models
    Maebashi, Kumiko
    Suematsu, Nobuo
    Hayashi, Akira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (12) : 2846 - 2853
  • [25] Variable Selection for Clustering with Gaussian Mixture Models
    Maugis, Cathy
    Celeux, Gilles
    Martin-Magniette, Marie-Laure
    BIOMETRICS, 2009, 65 (03) : 701 - 709
  • [26] Vine copula mixture models and clustering for non-Gaussian data
    Sahin, Ozge
    Czado, Claudia
    ECONOMETRICS AND STATISTICS, 2022, 22 : 136 - 158
  • [27] A family of parsimonious mixtures of multivariate Poisson-lognormal distributions for clustering multivariate count data
    Subedi, Sanjeena
    Browne, Ryan P.
    STAT, 2020, 9 (01):
  • [28] Gaussian Mixture Models Implementation to Enhance Spectral Clustering
    Valdes, D. A.
    Ayaquica, I. O.
    Guillen, C.
    Vazquez, J. R.
    IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (03) : 1416 - 1426
  • [29] Multivariate Response and Parsimony for Gaussian Cluster-Weighted Models
    Dang, Utkarsh J.
    Punzo, Antonio
    McNicholas, Paul D.
    Ingrassia, Salvatore
    Browne, Ryan P.
    JOURNAL OF CLASSIFICATION, 2017, 34 (01) : 4 - 34
  • [30] Families of Parsimonious Finite Mixtures of Regression Models
    Dang, Utkarsh J.
    McNicholas, Paul D.
    ADVANCES IN STATISTICAL MODELS FOR DATA ANALYSIS, 2015, : 73 - 84