Sparse estimation via nonconcave penalized likelihood in factor analysis model

被引:47
作者
Hirose, Kei [1 ]
Yamamoto, Michio [2 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Div Math Sci, Toyonaka, Osaka 5608531, Japan
[2] Kyoto Univ, Dept Biomed Stat & Bioinformat, Grad Sch Med, Sakyo Ku, Kyoto 6068507, Japan
关键词
Coordinate descent algorithm; Factor analysis; Nonconvex penalty; Penalized likelihood; Rotation technique; COMPONENT LOSS FUNCTIONS; VARIABLE SELECTION; ROTATION; REGRESSION; ALGORITHMS; LASSO; ERROR;
D O I
10.1007/s11222-014-9458-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of sparse estimation in a factor analysis model. A traditional estimation procedure in use is the following two-step approach: the model is estimated by maximum likelihood method and then a rotation technique is utilized to find sparse factor loadings. However, the maximum likelihood estimates cannot be obtained when the number of variables is much larger than the number of observations. Furthermore, even if the maximum likelihood estimates are available, the rotation technique does not often produce a sufficiently sparse solution. In order to handle these problems, this paper introduces a penalized likelihood procedure that imposes a nonconvex penalty on the factor loadings. We show that the penalized likelihood procedure can be viewed as a generalization of the traditional two-step approach, and the proposed methodology can produce sparser solutions than the rotation technique. A new algorithm via the EM algorithm along with coordinate descent is introduced to compute the entire solution path, which permits the application to a wide variety of convex and nonconvex penalties. Monte Carlo simulations are conducted to investigate the performance of our modeling strategy. A real data example is also given to illustrate our procedure.
引用
收藏
页码:863 / 875
页数:13
相关论文
共 53 条
  • [1] FACTOR-ANALYSIS AND AIC
    AKAIKE, H
    [J]. PSYCHOMETRIKA, 1987, 52 (03) : 317 - 332
  • [2] Anderson T. W., 1956, P 3 BERK S MATH STAT, V5, P111
  • [3] [Anonymous], 2010, R LANG ENV STAT COMP
  • [4] [Anonymous], 1973, INT S INF THEOR BUD, DOI [10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-5_38]
  • [5] STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION
    Bai, Jushan
    Li, Kunpeng
    [J]. ANNALS OF STATISTICS, 2012, 40 (01) : 436 - 465
  • [7] COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION
    Breheny, Patrick
    Huang, Jian
    [J]. ANNALS OF APPLIED STATISTICS, 2011, 5 (01) : 232 - 253
  • [8] High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics
    Carvalho, Carlos M.
    Chang, Jeffrey
    Lucas, Joseph E.
    Nevins, Joseph R.
    Wang, Quanli
    West, Mike
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1438 - 1456
  • [9] Extended Bayesian information criteria for model selection with large model spaces
    Chen, Jiahua
    Chen, Zehua
    [J]. BIOMETRIKA, 2008, 95 (03) : 759 - 771
  • [10] Choi J, 2010, STAT INTERFACE, V3, P429