Variable Selection in Clustering by Recursive Fit of Normal Distribution-based Salient Mixture Model

被引:0
|
作者
Kim, Seung-Gu [1 ]
机构
[1] Sangji Univ, Dept Data & Informat, 83 Usan Dong, Wonju 220702, South Korea
关键词
Saliency parameter; variable selection; clustering; normal mixture model; EM algorithm;
D O I
10.5351/KJAS.2013.26.5.821
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Law et al. (2004) proposed a normal distribution based salient mixture model for variable selection in clustering. However, this model has substantial problems such as the unidentifiability of components and the inaccurate selection of informative variables in the case of a small cluster size. We propose an alternative method to overcome problems and demonstrate a good performance through experiments on simulated data and real data.
引用
收藏
页码:821 / 834
页数:14
相关论文
共 50 条
  • [1] Variable Selection in Normal Mixture Model Based Clustering under Heteroscedasticity
    Kim, Seung-Gu
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (06) : 1213 - 1224
  • [2] Distribution-Based Trajectory Clustering
    Wang, Zi Jing
    Zhu, Ye
    Ting, Kai Ming
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1379 - 1384
  • [3] Finite mixture regression: A sparse variable selection by model selection for clustering
    Devijver, Emilie
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2642 - 2674
  • [4] t distribution-based robust semiparametric mixture regression model
    Ge, Yan
    Xiang, Sijia
    Yao, Weixin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [5] Fisher's z Distribution-Based Mixture Autoregressive Model
    Solikhah, Arifatus
    Kuswanto, Heri
    Iriawan, Nur
    Fithriasari, Kartika
    ECONOMETRICS, 2021, 9 (03)
  • [6] Variable selection for model-based clustering
    Raftery, AE
    Dean, N
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 168 - 178
  • [7] High-dimensional variable selection with the plaid mixture model for clustering
    Thierry Chekouo
    Alejandro Murua
    Computational Statistics, 2018, 33 : 1475 - 1496
  • [8] High-dimensional variable selection with the plaid mixture model for clustering
    Chekouo, Thierry
    Murua, Alejandro
    COMPUTATIONAL STATISTICS, 2018, 33 (03) : 1475 - 1496
  • [9] Distribution-Based Cluster Structure Selection
    Yu, Zhiwen
    Zhu, Xianjun
    Wong, Hau-San
    You, Jane
    Zhang, Jun
    Han, Guoqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (11) : 3554 - 3567
  • [10] Variable Selection for Clustering with Gaussian Mixture Models
    Maugis, Cathy
    Celeux, Gilles
    Martin-Magniette, Marie-Laure
    BIOMETRICS, 2009, 65 (03) : 701 - 709