Two new matrix-variate distributions with application in model-based clustering

被引:21
|
作者
Tomarchio, Salvatore D. [1 ]
Punzo, Antonio [1 ]
Bagnato, Luca [2 ]
机构
[1] Univ Catania, Dipartimento Econ & Impresa, Catania, Italy
[2] Univ Cattolica Sacro Cuore, Dipartimento Sci Econ & Sociali, Rome, Italy
关键词
Matrix-variate; Mixture models; Heavy-tailed distributions; Clustering; MAXIMUM-LIKELIHOOD; FINITE MIXTURES; EM ALGORITHM; ECM;
D O I
10.1016/j.csda.2020.107050
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] An overview of skew distributions in model-based clustering
    Lee, Sharon X.
    McLachlan, Geoffrey J.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [12] Matrix-variate data analysis by two-way factor model with replicated observations
    Li, Yan
    Gao, Zhigen
    Huang, Wei
    Guo, Jianhua
    STATISTICS & PROBABILITY LETTERS, 2023, 202
  • [13] Matrix-variate normal mean-variance Birnbaum–Saunders distributions and related mixture models
    Salvatore D. Tomarchio
    Computational Statistics, 2024, 39 : 405 - 432
  • [14] Model-Based Clustering
    Gormley, Isobel Claire
    Murphy, Thomas Brendan
    Raftery, Adrian E.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 573 - 595
  • [15] New global optimization algorithms for model-based clustering
    Heath, Jeffrey W.
    Fu, Michael C.
    Jank, Wolfgang
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (12) : 3999 - 4017
  • [16] A robust model-based clustering based on the geometric median and the median covariation matrix
    Godichon-Baggioni, Antoine
    Robin, Stephane
    STATISTICS AND COMPUTING, 2024, 34 (01)
  • [17] Model-based clustering via new parsimonious mixtures of heavy-tailed distributions
    Tomarchio, Salvatore D.
    Bagnato, Luca
    Punzo, Antonio
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2022, 106 (02) : 315 - 347
  • [18] Model-based clustering and classification of functional data
    Chamroukhi, Faicel
    Nguyen, Hien D.
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (04)
  • [19] A matrix-variate dirichlet process to model earthquake hypocentre temporal patterns
    A. Ray, Meredith
    Bowman, Dale
    Csontos, Ryan
    Van Arsdale, Roy B.
    Zhang, Hongmei
    STATISTICAL MODELLING, 2022, 22 (04) : 245 - 272
  • [20] Model-based clustering with non-elliptically contoured distributions
    Dimitris Karlis
    Anais Santourian
    Statistics and Computing, 2009, 19 : 73 - 83