Mixtures of factor analyzers with scale mixtures of fundamental skew normal distributions

被引:0
作者
Sharon X. Lee
Tsung-I Lin
Geoffrey J. McLachlan
机构
[1] University of Adelaide,School of Mathematical Science
[2] National Chung Hsing University,Institute of Statistics
[3] China Medical University,Department of Public Health
[4] University of Queensland,School of Mathematics and Physics
来源
Advances in Data Analysis and Classification | 2021年 / 15卷
关键词
Mixture models; Factor Analysis; Skew distributions; EM algorithm; Clustering; 62H30;
D O I
暂无
中图分类号
学科分类号
摘要
Mixtures of factor analyzers (MFA) provide a powerful tool for modelling high-dimensional datasets. In recent years, several generalizations of MFA have been developed where the normality assumption of the factors and/or of the errors were relaxed to allow for skewness in the data. However, due to the form of the adopted component densities, the distribution of the factors/errors in most of these models is typically limited to modelling skewness concentrated in a single direction. Here, we introduce a more flexible finite mixture of factor analyzers based on the class of scale mixtures of canonical fundamental skew normal (SMCFUSN) distributions. This very general class of skew distributions can capture various types of skewness and asymmetry in the data. In particular, the proposed mixtures of SMCFUSN factor analyzers (SMCFUSNFA) can simultaneously accommodate multiple directions of skewness. As such, it encapsulates many commonly used models as special and/or limiting cases, such as models of some versions of skew normal and skew t-factor analyzers, and skew hyperbolic factor analyzers. For illustration, we focus on the t-distribution member of the class of SMCFUSN distributions, leading to mixtures of canonical fundamental skew t-factor analyzers (CFUSTFA). Parameter estimation can be carried out by maximum likelihood via an EM-type algorithm. The usefulness and potential of the proposed model are demonstrated using four real datasets.
引用
收藏
页码:481 / 512
页数:31
相关论文
共 123 条
  • [1] Arellano-Valle RB(2006)On the unification of families of skew-normal distributions Scand J Stat 33 561-574
  • [2] Azzalini A(2005)On fundamental skew distributions J Multivar Anal 96 93-116
  • [3] Arellano-Valle RB(1996)The multivariate skew-normal distribution Biometrika 83 715-726
  • [4] Genton MG(2000)Assessing a mixture model for clustering with the integrated completed likelihood IEEE Trans Pattern Anal Mach Intell 22 719-725
  • [5] Azzalini A(2015)A mixture of generalized hyperbolic distributions Can J Stat 43 176-198
  • [6] Dalla Valle A(2012)Multivariate mixture modeling using skew-normal independent distributions Comput Stat Data Anal 56 126-142
  • [7] Biernacki C(1977)Maximum likelihood from incomplete data via the EM algorithm J Royal Stat Soc B 39 1-38
  • [8] Celeux G(2015)Computer-aided classification of melanocytic lesions using dermoscopic images J Am Acad Dermatol 73 769-776
  • [9] Govaert G(1982)Pattern recognition methods in the prediction of italian olive oil origin by their fatty acid content Annali di Chimica 72 143-155
  • [10] Browne RP(2012)Some results on the truncated multivariate J Stat Plan Inference 142 25-40