Sparse factor regression via penalized maximum likelihood estimation

被引:6
作者
Hirose, Kei [1 ]
Imada, Miyuki [2 ]
机构
[1] Kyushu Univ, Inst Math Ind, Nishi Ku, 744 Motooka, Fukuoka 8190395, Japan
[2] NTT Network Innovat Labs, 3-9-11 Midori Cho, Musashino, Tokyo 1808585, Japan
关键词
Coordinate descent algorithm; Lasso; Multicollinearity; Penalized maximum likelihood estimation; Rotation technique; PREDICTION ERROR PROPERTY; VARIABLE SELECTION; IMPROPER SOLUTIONS; LASSO ESTIMATOR; REGULARIZATION; ALGORITHMS; ROTATION; MODELS;
D O I
10.1007/s00362-016-0781-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In factor regression model, the maximum likelihood estimation suffers from three disadvantages: (i) the maximum likelihood estimates are unavailable when the number of variables exceeds the number of observations, (ii) the rotation technique based on maximum likelihood estimates produces an insufficiently sparse loading matrix, and (iii) multicollinearity can occur when the estimates of unique variances (specific variances) are small because the regression coefficients are sensitive to the inverse of unique variances. To handle these problems, we propose a penalized maximum likelihood procedure. Specifically, we impose a lasso-type penalty on the factor loadings to improve the sparseness of the solution. We also introduce a penalty on unique variances, which (given the factor scores) corresponds to the ridge penalty on the regression coefficient. Theoretical properties from a prediction viewpoint of our procedure are discussed. The effectiveness of the procedure is investigated through Monte Carlo simulations. The utility of our procedure is demonstrated on real data collected by an online questionnaire.
引用
收藏
页码:633 / 662
页数:30
相关论文
共 36 条
[21]  
Little R. J., 2002, STAT ANAL MISSING DA, DOI [10.1002/9781119013563, DOI 10.1002/9781119013563]
[22]   BAYESIAN ESTIMATION IN UNRESTRICTED FACTOR-ANALYSIS - TREATMENT FOR HEYWOOD CASES [J].
MARTIN, JK ;
MCDONALD, RP .
PSYCHOMETRIKA, 1975, 40 (04) :505-517
[23]  
Maxwell A.E., 1977, MULTIVARIATE ANAL BE
[24]   SparseNet: Coordinate Descent With Nonconvex Penalties [J].
Mazumder, Rahul ;
Friedman, Jerome H. ;
Hastie, Trevor .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) :1125-1138
[25]  
Ning LP, 2011, IEEE DECIS CONTR P, P5188
[26]   MAXIMUM LIKELIHOOD ESTIMATION OF THE POLYCHLORIC CORRELATION-COEFFICIENT [J].
OLSSON, U .
PSYCHOMETRIKA, 1979, 44 (04) :443-460
[27]  
Rai Piyush., 2008, NIPS'08, P1321
[28]   Corrected proof of the result of 'A prediction error property of the Lasso estimator and its generalization' by Huang (2003) [J].
Rosset, S ;
Zhu, J .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2004, 46 (03) :505-510
[29]   EM ALGORITHMS FOR ML FACTOR-ANALYSIS [J].
RUBIN, DB ;
THAYER, DT .
PSYCHOMETRIKA, 1982, 47 (01) :69-76
[30]   VARIOUS CAUSES OF IMPROPER SOLUTIONS IN MAXIMUM LIKELIHOOD FACTOR-ANALYSIS [J].
VANDRIEL, OP .
PSYCHOMETRIKA, 1978, 43 (02) :225-243