Mixtures of general location model with factor analyzer covariance structure for clustering mixed type data

被引:1
|
作者
Amiri, Leila [1 ]
Khazaei, Mojtaba [1 ]
Ganjali, Mojtaba [1 ]
机构
[1] Shahid Beheshti Univ, Dept Stat, Tehran, Iran
关键词
Mixture models; mixed type data; general location model; factor analysis; model-based clustering; the ECM algorithm; DISCRIMINANT-ANALYSIS; ELEMENT CONTENTS; CLASSIFICATION; VARIABLES; ALGORITHM; BINARY;
D O I
10.1080/02664763.2019.1579307
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Cluster analysis is one of the most widely used method in statistical analyses, in which homogeneous subgroups are identified in a heterogeneous population. Due to the existence of the continuous and discrete mixed data in many applications, so far, some ordinary clustering methods such as, hierarchical methods, k-means and model-based methods have been extended for analysis of mixed data. However, in the available model-based clustering methods, by increasing the number of continuous variables, the number of parameters increases and identifying as well as fitting an appropriate model may be difficult. In this paper, to reduce the number of the parameters, for the model-based clustering mixed data of continuous (normal) and nominal data, a set of parsimonious models is introduced. Models in this set are extended, using the general location model approach, for modeling distribution of mixed variables and applying factor analyzer structure for covariance matrices. The ECM algorithm is used for estimating the parameters of these models. In order to show the performance of the proposed models for clustering, results from some simulation studies and analyzing two real data sets are presented.
引用
收藏
页码:2075 / 2100
页数:26
相关论文
共 50 条
  • [1] General location model with factor analyzer covariance matrix structure and its applications
    Amiri, Leila
    Khazaei, Mojtaba
    Ganjali, Mojtaba
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (03) : 593 - 609
  • [2] General location model with factor analyzer covariance matrix structure and its applications
    Leila Amiri
    Mojtaba Khazaei
    Mojtaba Ganjali
    Advances in Data Analysis and Classification, 2017, 11 : 593 - 609
  • [3] Clustering Mixed-Type Data via Dirichlet Process Mixture Model with Cluster-Specific Covariance Matrices
    Burhanuddin, Nurul Afiqah
    Ibrahim, Kamarulzaman
    Zulkafli, Hani Syahida
    Mustapha, Norwati
    SYMMETRY-BASEL, 2024, 16 (06):
  • [4] Model-based clustering, classification, and discriminant analysis of data with mixed type
    Browne, Ryan P.
    McNicholas, Paul D.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (11) : 2976 - 2984
  • [5] Identifiable finite mixtures of location models for clustering mixed-mode data
    Willse, A
    Boik, RJ
    STATISTICS AND COMPUTING, 1999, 9 (02) : 111 - 121
  • [6] Identifiable finite mixtures of location models for clustering mixed-mode data
    Alan Willse
    Robert J. Boik
    Statistics and Computing, 1999, 9 : 111 - 121
  • [7] Clustering mixed type data: a space structure-based approach
    Li, Feijiang
    Qian, Yuhua
    Wang, Jieting
    Peng, Furong
    Liang, Jiye
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2799 - 2812
  • [8] Model-based clustering of censored data via mixtures of factor analyzers
    Wang, Wan-Lun
    Castro, Luis M.
    Lachos, Victor H.
    Lin, Tsung-I
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 140 (104-121) : 104 - 121
  • [9] Composite likelihood methods for parsimonious model-based clustering of mixed-type data
    Ranalli, Monia
    Rocci, Roberto
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 381 - 407
  • [10] Clustering Approaches for Mixed-Type Data: A Comparative Study
    Ghattas, Badih
    San-Benito, Alvaro Sanchez
    JOURNAL OF PROBABILITY AND STATISTICS, 2025, 2025 (01)