Sparse estimation via nonconcave penalized likelihood in factor analysis model

被引：47

作者：

Hirose, Kei ^{[1
]}

Yamamoto, Michio ^{[2
]}

机构：

[1] Osaka Univ, Grad Sch Engn Sci, Div Math Sci, Toyonaka, Osaka 5608531, Japan

[2] Kyoto Univ, Dept Biomed Stat & Bioinformat, Grad Sch Med, Sakyo Ku, Kyoto 6068507, Japan

来源：

STATISTICS AND COMPUTING | 2015年 / 25卷 / 05期

关键词：

Coordinate descent algorithm; Factor analysis; Nonconvex penalty; Penalized likelihood; Rotation technique; COMPONENT LOSS FUNCTIONS; VARIABLE SELECTION; ROTATION; REGRESSION; ALGORITHMS; LASSO; ERROR;

D O I：

10.1007/s11222-014-9458-0

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We consider the problem of sparse estimation in a factor analysis model. A traditional estimation procedure in use is the following two-step approach: the model is estimated by maximum likelihood method and then a rotation technique is utilized to find sparse factor loadings. However, the maximum likelihood estimates cannot be obtained when the number of variables is much larger than the number of observations. Furthermore, even if the maximum likelihood estimates are available, the rotation technique does not often produce a sufficiently sparse solution. In order to handle these problems, this paper introduces a penalized likelihood procedure that imposes a nonconvex penalty on the factor loadings. We show that the penalized likelihood procedure can be viewed as a generalization of the traditional two-step approach, and the proposed methodology can produce sparser solutions than the rotation technique. A new algorithm via the EM algorithm along with coordinate descent is introduced to compute the entire solution path, which permits the application to a wide variety of convex and nonconvex penalties. Monte Carlo simulations are conducted to investigate the performance of our modeling strategy. A real data example is also given to illustrate our procedure.

引用

页码：863 / 875

页数：13

共 53 条

[1] FACTOR-ANALYSIS AND AIC
AKAIKE, H
[J]. PSYCHOMETRIKA, 1987, 52 (03) : 317 - 332
[2] Anderson T. W., 1956, P 3 BERK S MATH STAT, V5, P111
[3] [Anonymous], 2010, R LANG ENV STAT COMP
[4] [Anonymous], 1973, INT S INF THEOR BUD, DOI [10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-5_38]
[5] STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION
Bai, Jushan
Li, Kunpeng
[J]. ANNALS OF STATISTICS, 2012, 40 (01) : 436 - 465
[6] MODEL SELECTION AND AKAIKE INFORMATION CRITERION (AIC) - THE GENERAL-THEORY AND ITS ANALYTICAL EXTENSIONS
BOZDOGAN, H
[J]. PSYCHOMETRIKA, 1987, 52 (03) : 345 - 370
[7] COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION
Breheny, Patrick
Huang, Jian
[J]. ANNALS OF APPLIED STATISTICS, 2011, 5 (01) : 232 - 253
[8] High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics
Carvalho, Carlos M.
Chang, Jeffrey
Lucas, Joseph E.
Nevins, Joseph R.
Wang, Quanli
West, Mike
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1438 - 1456
[9] Extended Bayesian information criteria for model selection with large model spaces
Chen, Jiahua
Chen, Zehua
[J]. BIOMETRIKA, 2008, 95 (03) : 759 - 771
[10] Choi J, 2010, STAT INTERFACE, V3, P429

← 1 2 3 4 5 6 →