Sparse Variable PCA Using Geodesic Steepest Descent

被引:30
作者
Ulfarsson, Magnus O. [1 ]
Solo, Victor [2 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48104 USA
[2] Univ New S Wales, Sch Elect Engn, Sydney, NSW 2052, Australia
关键词
Geodesic; geometric; l(1); LASSO; principal component analysis (PCA); regularization; sparse;
D O I
10.1109/TSP.2008.2006587
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Principal component analysis (PCA) is a dimensionality reduction technique used in most fields of science and engineering. It aims to find linear combinations of the input variables that maximize variance. A problem with PCA is that it typically assigns nonzero loadings to all the variables, which in high dimensional problems can require a very large number of coefficients. But in many applications, the aim is to obtain a massive reduction in the number of coefficients. There are two very different types of sparse PCA problems: sparse loadings PCA (slPCA) which zeros out loadings (while generally keeping all of the variables) and sparse variable PCA which zeros out whole variables (typically leaving less than half of them). In this paper, we propose a new svPCA, which we call sparse variable noisy PCA (svnPCA). It is based on a statistical model, and this gives access to a range of modeling and inferential tools. Estimation is based on optimizing a novel penalized log-likelihood able to zero out whole variables rather than just some loadings. The estimation algorithm is based on the geodesic steepest descent algorithm. Finally, we develop a novel form of Bayesian information criterion (BIC) for tuning parameter selection. The svnPCA algorithm is applied to both simulated data and real functional magnetic resonance imaging (fMRI) data.
引用
收藏
页码:5823 / 5832
页数:10
相关论文
共 46 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   AN ALGORITHM FOR THE MINIMIZATION OF MIXED L1 AND L2 NORMS WITH APPLICATION TO BAYESIAN-ESTIMATION [J].
ALLINEY, S ;
RUZINSKY, SA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (03) :618-627
[3]   ASYMPTOTIC THEORY FOR PRINCIPAL COMPONENT ANALYSIS [J].
ANDERSON, TW .
ANNALS OF MATHEMATICAL STATISTICS, 1963, 34 (01) :122-&
[4]  
[Anonymous], 1996, MATRIX COMPUTATION
[5]  
Bertsekas D., 1999, NONLINEAR PROGRAMMIN
[6]  
CICHOCKI A, 2003, IEICE T FUND ELECTR, V86, P1
[7]  
DASPREMONT A, 2006, SIAM REV IN PRESS
[8]   DE-NOISING BY SOFT-THRESHOLDING [J].
DONOHO, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) :613-627
[9]   IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE [J].
DONOHO, DL ;
JOHNSTONE, IM .
BIOMETRIKA, 1994, 81 (03) :425-455
[10]   On gradient adaptation with unit-norm constraints [J].
Douglas, SC ;
Amari, S ;
Kune, SY .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2000, 48 (06) :1843-1847