Two new matrix-variate distributions with application in model-based clustering

被引:22
作者
Tomarchio, Salvatore D. [1 ]
Punzo, Antonio [1 ]
Bagnato, Luca [2 ]
机构
[1] Univ Catania, Dipartimento Econ & Impresa, Catania, Italy
[2] Univ Cattolica Sacro Cuore, Dipartimento Sci Econ & Sociali, Rome, Italy
关键词
Matrix-variate; Mixture models; Heavy-tailed distributions; Clustering; MAXIMUM-LIKELIHOOD; FINITE MIXTURES; EM ALGORITHM; ECM;
D O I
10.1016/j.csda.2020.107050
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Model-Based Clustering
    Paul D. McNicholas
    [J]. Journal of Classification, 2016, 33 : 331 - 373
  • [42] Model-based clustering for populations of networks
    Signorelli, Mirko
    Wit, Ernst C.
    [J]. STATISTICAL MODELLING, 2020, 20 (01) : 9 - 29
  • [43] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [44] Model-based clustering for random hypergraphs
    Ng, Tin Lok James
    Murphy, Thomas Brendan
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (03) : 691 - 723
  • [45] Model-based Clustering of Count Processes
    Tin Lok James Ng
    Thomas Brendan Murphy
    [J]. Journal of Classification, 2021, 38 : 188 - 211
  • [46] Model-based Clustering of Count Processes
    Ng, Tin Lok James
    Murphy, Thomas Brendan
    [J]. JOURNAL OF CLASSIFICATION, 2021, 38 (02) : 188 - 211
  • [47] Regularization and optimization in model-based clustering
    Sampaio, Raphael Araujo
    Garcia, Joaquim Dias
    Poggi, Marcus
    Vidal, Thibaut
    [J]. PATTERN RECOGNITION, 2024, 150
  • [48] MODEL-BASED CLUSTERING WITH GENE RANKING USING PENALIZED MIXTURES OF HEAVY-TAILED DISTRIBUTIONS
    Cozzini, Alberto
    Jasra, Ajay
    Montana, Giovanni
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (03)
  • [49] Model-based clustering and segmentation of time series with changes in regime
    Same, Allou
    Chamroukhi, Faicel
    Govaert, Gerard
    Aknin, Patrice
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (04) : 301 - 321
  • [50] Robust mixture model-based clustering with genetic algorithm approach
    Nguyen Duc Thang
    Chen, Lihui
    Chan, Chee Keong
    [J]. INTELLIGENT DATA ANALYSIS, 2011, 15 (03) : 357 - 373