Constrained parsimonious model-based clustering

被引:0
作者
Luis A. García-Escudero
Agustín Mayo-Iscar
Marco Riani
机构
[1] University of Valladolid,Department of Statistics and Operational Research and IMUVA
[2] University of Parma,Department of Economics and Management and Interdepartmental Centre of Robust Statistics
来源
Statistics and Computing | 2022年 / 32卷
关键词
Model-based clustering; Mixture modeling; Constraints;
D O I
暂无
中图分类号
学科分类号
摘要
A new methodology for constrained parsimonious model-based clustering is introduced, where some tuning parameter allows to control the strength of these constraints. The methodology includes the 14 parsimonious models that are often applied in model-based clustering when assuming normal components as limit cases. This is done in a natural way by filling the gap among models and providing a smooth transition among them. The methodology provides mathematically well-defined problems and is also useful to prevent us from obtaining spurious solutions. Novel information criteria are proposed to help the user in choosing parameters. The interest of the proposed methodology is illustrated through simulation studies and a real-data application on COVID data.
引用
收藏
相关论文
共 54 条
  • [1] Banfield JD(1993)Model-based Gaussian and non-Gaussian clustering Biometrics 49 803-821
  • [2] Raftery AE(2000)Assessing a mixture model for clustering with the integrated completed likelihood IEEE Trans. Pattern. Anal. Mach. Intell. 22 719-725
  • [3] Biernacki C(2014)Estimating common principal components in high dimensions Adv. Data. Anal. Classif. 8 217-226
  • [4] Celeux G(2018)mixture: mixture models for clustering and classification R Package Version 1 5-793
  • [5] Govaert G(1995)Gaussian parsimonious clustering models Pattern Recognit. 28 781-416
  • [6] Browne R(2018)Finding the number of normal groups in model-based clustering via constrained likelihoods J. Comput. Graph. Stat. 27 404-474
  • [7] McNicholas P(1969)Estimating the components of a mixture of normal distributions Biometrika 56 463-136
  • [8] Browne RP(2013)A fast algorithm for robust constrained clustering Comput. Stat. Data Anal. 61 124-202
  • [9] ElSherbiny A(2018)Probabilistic clustering via pareto solutions and significance tests Adv. Data Anal. Classif. 12 179-633
  • [10] McNicholas PD(2015)Avoiding spurious local maximizers in mixture modeling Stat. Comput. 25 619-233