Variable Selection in Normal Mixture Model Based Clustering under Heteroscedasticity

被引：1

作者：

Kim, Seung-Gu ^{[1
]}

机构：

[1] Sangji Univ, Dept Data & Informat, 83 Usan Dong, Wonju 122807, South Korea

来源：

KOREAN JOURNAL OF APPLIED STATISTICS | 2011年 / 24卷 / 06期

关键词：

Informative variables; variable selection; clustering; EM algorithm; microarray gene expression;

D O I：

10.5351/KJAS.2011.24.6.1213

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In high dimensionality where the number of variables are excessively larger than observations, it is required to remove the noninformative variables to cluster observations. Most model-based approaches for variable selection have been considered under the assumption of homoscedasticity and their models are mainly estimated by a penalized likelihood method. In this paper, a different approach is proposed to remove the noninformative variables effectively and to cluster based on the modified normal mixture model simultaneously. The validity of the model was provided and an EM algorithm was derived to estimate the parameters. Simulation studies and an experiment using real microarray dataset showed the effectiveness of the proposed method.

引用

页码：1213 / 1224

页数：12

共 50 条

[1] Variable Selection in Clustering by Recursive Fit of Normal Distribution-based Salient Mixture Model
Kim, Seung-Gu
KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (05) : 821 - 834
[2] Finite mixture regression: A sparse variable selection by model selection for clustering
Devijver, Emilie
ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2642 - 2674
[3] Variable selection for model-based clustering
Raftery, AE
Dean, N
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 168 - 178
[4] High-dimensional variable selection with the plaid mixture model for clustering
Thierry Chekouo
Alejandro Murua
Computational Statistics, 2018, 33 : 1475 - 1496
[5] High-dimensional variable selection with the plaid mixture model for clustering
Chekouo, Thierry
Murua, Alejandro
COMPUTATIONAL STATISTICS, 2018, 33 (03) : 1475 - 1496
[6] Variable Selection for Clustering with Gaussian Mixture Models
Maugis, Cathy
Celeux, Gilles
Martin-Magniette, Marie-Laure
BIOMETRICS, 2009, 65 (03) : 701 - 709
[7] Variable selection methods for model-based clustering
Fop, Michael
Murphy, Thomas Brendan
STATISTICS SURVEYS, 2018, 12 : 18 - 65
[8] Model selection for mixture-based clustering for ordinal data
Fernandez, D.
Arnold, R.
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2016, 58 (04) : 437 - 472
[9] Robust estimation in the normal mixture model based on robust clustering
Cuesta-Albertos, J. A.
Matran, C.
Mayo-Iscar, A.
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 779 - 802
[10] Comparing Model Selection and Regularization Approaches to Variable Selection in Model-Based Clustering
Celeux, Gilles
Martin-Magniette, Marie-Laure
Maugis-Rabusseau, Cathy
Raftery, Adrian E.
JOURNAL OF THE SFDS, 2014, 155 (02): : 57 - 71

← 1 2 3 4 5 →