Discriminative variable selection for clustering with the sparse Fisher-EM algorithm

被引：18

作者：

Bouveyron, Charles ^{[1
]}

Brunet-Saumard, Camille ^{[2
]}

机构：

[1] Univ Paris 01, EA 4543, Lab SAMM, F-75231 Paris 05, France

[2] Univ Angers, UMR CNRS 6093, Lab LAREMA, Angers, France

来源：

COMPUTATIONAL STATISTICS | 2014年 / 29卷 / 3-4期

关键词：

Model-based clustering; Variable selection; Discriminative subspace; Fisher-EM algorithm; l(1)-Type penalizations; HIGH-DIMENSIONAL DATA; FRAMEWORK; MIXTURES;

D O I：

10.1007/s00180-013-0433-6

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

The interest in variable selection for clustering has increased recently due to the growing need in clustering high-dimensional data. Variable selection allows in particular to ease both the clustering and the interpretation of the results. Existing approaches have demonstrated the importance of variable selection for clustering but turn out to be either very time consuming or not sparse enough in high-dimensional spaces. This work proposes to perform a selection of the discriminative variables by introducing sparsity in the loading matrix of the Fisher-EM algorithm. This clustering method has been recently proposed for the simultaneous visualization and clustering of high-dimensional data. It is based on a latent mixture model which fits the data into a low-dimensional discriminative subspace. Three different approaches are proposed in this work to introduce sparsity in the orientation matrix of the discriminative subspace through -type penalizations. Experimental comparisons with existing approaches on simulated and real-world data sets demonstrate the interest of the proposed methodology. An application to the segmentation of hyperspectral images of the planet Mars is also presented.

引用

页码：489 / 513

页数：25

共 50 条

[1] Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
Charles Bouveyron
Camille Brunet-Saumard
Computational Statistics, 2014, 29 : 489 - 513
[2] A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering
Jouvin, Nicolas
Bouveyron, Charles
Latouche, Pierre
STATISTICS AND COMPUTING, 2021, 31 (04)
[3] On the estimation of the latent discriminative subspace in the Fisher-EM algorithm
Bouveyron, Charles
Brunet, Camille
JOURNAL OF THE SFDS, 2011, 152 (03): : 98 - 115
[4] Theoretical and practical considerations on the convergence properties of the Fisher-EM algorithm
Bouveyron, Charles
Brunet, Camille
JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 109 : 29 - 41
[5] Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
Bouveyron, Charles
Brunet, Camille
STATISTICS AND COMPUTING, 2012, 22 (01) : 301 - 324
[6] Variable Selection for Clustering and Classification
Andrews, Jeffrey L.
McNicholas, Paul D.
JOURNAL OF CLASSIFICATION, 2014, 31 (02) : 136 - 153
[7] Variable Selection for Clustering and Classification
Jeffrey L. Andrews
Paul D. McNicholas
Journal of Classification, 2014, 31 : 136 - 153
[8] Simultaneous clustering and variable selection: A novel algorithm and model selection procedure
Shuai Yuan
Kim De Roover
Katrijn Van Deun
Behavior Research Methods, 2023, 55 : 2157 - 2174
[9] Simultaneous clustering and variable selection: A novel algorithm and model selection procedure
Yuan, Shuai
De Roover, Kim
Van Deun, Katrijn
BEHAVIOR RESEARCH METHODS, 2023, 55 (05) : 2157 - 2174
[10] Finite mixture regression: A sparse variable selection by model selection for clustering
Devijver, Emilie
ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2642 - 2674

← 1 2 3 4 5 →