Two new matrix-variate distributions with application in model-based clustering

被引:21
|
作者
Tomarchio, Salvatore D. [1 ]
Punzo, Antonio [1 ]
Bagnato, Luca [2 ]
机构
[1] Univ Catania, Dipartimento Econ & Impresa, Catania, Italy
[2] Univ Cattolica Sacro Cuore, Dipartimento Sci Econ & Sociali, Rome, Italy
关键词
Matrix-variate; Mixture models; Heavy-tailed distributions; Clustering; MAXIMUM-LIKELIHOOD; FINITE MIXTURES; EM ALGORITHM; ECM;
D O I
10.1016/j.csda.2020.107050
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A matrix-variate dirichlet process to model earthquake hypocentre temporal patterns
    A. Ray, Meredith
    Bowman, Dale
    Csontos, Ryan
    Van Arsdale, Roy B.
    Zhang, Hongmei
    STATISTICAL MODELLING, 2022, 22 (04) : 245 - 272
  • [22] Model-based clustering with non-elliptically contoured distributions
    Karlis, Dimitris
    Santourian, Anais
    STATISTICS AND COMPUTING, 2009, 19 (01) : 73 - 83
  • [23] Model-based clustering with non-elliptically contoured distributions
    Dimitris Karlis
    Anais Santourian
    Statistics and Computing, 2009, 19 : 73 - 83
  • [24] Model-based clustering of functional data via mixtures of t distributions
    Anton, Cristina
    Smith, Iain
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (03) : 563 - 595
  • [25] Model-Based Clustering of Temporal Data
    El Assaad, Hani
    Same, Allou
    Govaert, Gerard
    Aknin, Patrice
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 9 - 16
  • [26] Two-way principal component analysis for matrix-variate data, with an application to functional magnetic resonance imaging data
    Huang, Lei
    Reiss, Philip T.
    Xiao, Luo
    Zipunnikov, Vadim
    Lindquist, Martin A.
    Crainiceanu, Ciprian M.
    BIOSTATISTICS, 2017, 18 (02) : 214 - 229
  • [27] Model-Based Clustering
    McNicholas, Paul D.
    JOURNAL OF CLASSIFICATION, 2016, 33 (03) : 331 - 373
  • [28] Model-based clustering and classification with non-normal mixture distributions
    Sharon X. Lee
    Geoffrey J. McLachlan
    Statistical Methods & Applications, 2013, 22 : 427 - 454
  • [29] Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete-data
    Wei, Yuhong
    Tang, Yang
    McNicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 130 : 18 - 41
  • [30] Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions
    Andrews, Jeffrey L.
    McNicholas, Paul D.
    STATISTICS AND COMPUTING, 2012, 22 (05) : 1021 - 1029