Variable Selection in Clustering by Recursive Fit of Normal Distribution-based Salient Mixture Model

被引:0
|
作者
Kim, Seung-Gu [1 ]
机构
[1] Sangji Univ, Dept Data & Informat, 83 Usan Dong, Wonju 220702, South Korea
关键词
Saliency parameter; variable selection; clustering; normal mixture model; EM algorithm;
D O I
10.5351/KJAS.2013.26.5.821
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Law et al. (2004) proposed a normal distribution based salient mixture model for variable selection in clustering. However, this model has substantial problems such as the unidentifiability of components and the inaccurate selection of informative variables in the case of a small cluster size. We propose an alternative method to overcome problems and demonstrate a good performance through experiments on simulated data and real data.
引用
收藏
页码:821 / 834
页数:14
相关论文
共 50 条
  • [21] Mixture distribution-based forecasting using stochastic volatility models
    Clements, A. E.
    Hurn, S.
    White, S. I.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2006, 22 (5-6) : 547 - 557
  • [22] DDES: A Distribution-Based Dynamic Ensemble Selection Framework
    Choi, Ye-Rim
    Lim, Dong-Joon
    IEEE ACCESS, 2021, 9 : 40743 - 40754
  • [23] Road traffic estimation and distribution-based route selection
    Kamphuis, Rens
    Mandjes, Michel
    Serra, Paulo
    ELECTRONIC JOURNAL OF STATISTICS, 2025, 19 (01): : 865 - 920
  • [24] Variable selection in clustering via Dirichlet process mixture models
    Kim, Sinae
    Tadesse, Mahlet G.
    Vannucci, Marina
    BIOMETRIKA, 2006, 93 (04) : 877 - 893
  • [25] Bayesian regularization for normal mixture estimation and model-based clustering
    Fraley, Chris
    Raftery, Adrian E.
    JOURNAL OF CLASSIFICATION, 2007, 24 (02) : 155 - 181
  • [26] Bayesian Regularization for Normal Mixture Estimation and Model-Based Clustering
    Chris Fraley
    Adrian E. Raftery
    Journal of Classification, 2007, 24 : 155 - 181
  • [27] A distribution-based clustering algorithm for mining in large spatial databases
    Xu, XW
    Ester, M
    Kriegel, HP
    Sander, J
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 324 - 331
  • [28] Variable selection in finite mixture of median regression models using skew-normal distribution
    Zeng, Xin
    Ju, Yuanyuan
    Wu, Liucang
    STATISTICAL THEORY AND RELATED FIELDS, 2023, 7 (01) : 30 - 48
  • [29] Initialization of Recursive Mixture-based Clustering with Uniform Components
    Suzdaleva, Evgenia
    Nagy, Ivan
    Pecherkova, Pavla
    Likhonina, Raissa
    ICINCO: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS - VOL 1, 2017, : 449 - 458
  • [30] Variable selection for model-based high-dimensional clustering
    Wang, Sijian
    Zhu, Ji
    PREDICTION AND DISCOVERY, 2007, 443 : 177 - +