The Sparse Principal Component Analysis Problem: Optimality Conditions and Algorithms

被引:16
作者
Beck, Amir [1 ]
Vaisbourd, Yakov [1 ]
机构
[1] Technion, Fac Ind Engn & Management, Haifa, Israel
基金
以色列科学基金会;
关键词
Optimality conditions; Principal component analysis; Sparsity constrained problems; Stationarity; Numerical methods; SELECTION; LASSO;
D O I
10.1007/s10957-016-0934-x
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Sparse principal component analysis addresses the problem of finding a linear combination of the variables in a given dataset with a sparse coefficients vector that maximizes the variability of the data. This model enhances the ability to interpret the principal components and is applicable in a wide variety of fields including genetics and finance, just to name a few. We suggest a necessary coordinate-wise-based optimality condition and show its superiority over the stationarity-based condition that is commonly used in the literature, which is the basis for many of the algorithms designed to solve the problem. We devise algorithms that are based on the new optimality condition and provide numerical experiments that support our assertion that algorithms, which are guaranteed to converge to stronger optimality conditions, perform better than algorithms that converge to points satisfying weaker optimality conditions.
引用
收藏
页码:119 / 143
页数:25
相关论文
共 23 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
[Anonymous], 1967, Applied Statistics, DOI DOI 10.1038/S41598-017-00047-5
[3]   MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia [J].
Armstrong, SA ;
Staunton, JE ;
Silverman, LB ;
Pieters, R ;
de Boer, ML ;
Minden, MD ;
Sallan, SE ;
Lander, ES ;
Golub, TR ;
Korsmeyer, SJ .
NATURE GENETICS, 2002, 30 (01) :41-47
[4]   On the Minimization Over Sparse Symmetric Sets: Projections, Optimality Conditions, and Algorithms [J].
Beck, Amir ;
Hallak, Nadav .
MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (01) :196-223
[5]   SPARSITY CONSTRAINED NONLINEAR OPTIMIZATION: OPTIMALITY CONDITIONS AND ALGORITHMS [J].
Beck, Amir ;
Eldar, Yonina C. .
SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (03) :1480-1509
[6]   LOADINGS AND CORRELATIONS IN THE INTERPRETATION OF PRINCIPAL COMPONENTS [J].
CADIMA, J ;
JOLLIFFE, IT .
JOURNAL OF APPLIED STATISTICS, 1995, 22 (02) :203-214
[7]  
d'Aspremont A, 2008, J MACH LEARN RES, V9, P1269
[8]   A direct formulation for sparse PCA using semidefinite programming [J].
d'Aspremont, Alexandre ;
El Ghaoui, Laurent ;
Jordan, Michael I. ;
Lanckriet, Gert R. G. .
SIAM REVIEW, 2007, 49 (03) :434-448
[9]   Identifying small mean-reverting portfolios [J].
D'Aspremont, Alexandre .
QUANTITATIVE FINANCE, 2011, 11 (03) :351-364
[10]   A modified principal component technique based on the LASSO [J].
Jolliffe, IT ;
Trendafilov, NT ;
Uddin, M .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (03) :531-547