Two new matrix-variate distributions with application in model-based clustering

被引:22
作者
Tomarchio, Salvatore D. [1 ]
Punzo, Antonio [1 ]
Bagnato, Luca [2 ]
机构
[1] Univ Catania, Dipartimento Econ & Impresa, Catania, Italy
[2] Univ Cattolica Sacro Cuore, Dipartimento Sci Econ & Sociali, Rome, Italy
关键词
Matrix-variate; Mixture models; Heavy-tailed distributions; Clustering; MAXIMUM-LIKELIHOOD; FINITE MIXTURES; EM ALGORITHM; ECM;
D O I
10.1016/j.csda.2020.107050
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] The multivariate leptokurtic-normal distribution and its application in model-based clustering
    Bagnato, Luca
    Punzo, Antonio
    Zoia, Maria G.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (01): : 95 - 119
  • [32] A robust model-based clustering based on the geometric median and the median covariation matrix
    Antoine Godichon-Baggioni
    Stéphane Robin
    Statistics and Computing, 2024, 34
  • [33] Model-based clustering via new parsimonious mixtures of heavy-tailed distributions
    Salvatore D. Tomarchio
    Luca Bagnato
    Antonio Punzo
    AStA Advances in Statistical Analysis, 2022, 106 : 315 - 347
  • [34] Model-based clustering with envelopes
    Wang, Wenjing
    Zhang, Xin
    Mai, Qing
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 82 - 109
  • [35] Model-based clustering using a new multivariate skew distribution
    Tomarchio, Salvatore D.
    Bagnato, Luca
    Punzo, Antonio
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (01) : 61 - 83
  • [36] Model-Based Clustering and Classification Using Mixtures of Multivariate Skewed Power Exponential Distributions
    Dang, Utkarsh J.
    Gallaugher, Michael P. B.
    Browne, Ryan P.
    McNicholas, Paul D.
    JOURNAL OF CLASSIFICATION, 2023, 40 (01) : 145 - 167
  • [37] Dimension reduction for model-based clustering via mixtures of multivariate t-distributions
    Morris, Katherine
    McNicholas, Paul D.
    Scrucca, Luca
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2013, 7 (03) : 321 - 338
  • [38] Model-based clustering and outlier detection with missing data
    Tong, Hung
    Tortora, Cristina
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 5 - 30
  • [39] Addressing overfitting and underfitting in Gaussian model-based clustering
    Andrews, Jeffrey L.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 127 : 160 - 171
  • [40] Model-Based Tensor Low-Rank Clustering
    Li, Junge
    Mai, Qing
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (01) : 208 - 218