Sparsest factor analysis for clustering variables: a matrix decomposition approach

被引:7
作者
Adachi, Kohei [1 ]
Trendafilov, Nickolay T. [2 ]
机构
[1] Osaka Univ, Grad Sch Human Sci, 1-2 Yamadaoka, Suita, Osaka 5650871, Japan
[2] Open Univ, Dept Math & Stat, Walton Hall, Milton Keynes MK7 6AA, Bucks, England
基金
日本学术振兴会;
关键词
Exploratory factor analysis; Sparsest loadings; Matrix decomposition factor analysis; Variable clustering; QR re-parameterization; 62H25; 62H30; 15A23; EXPLORATORY FACTOR-ANALYSIS;
D O I
10.1007/s11634-017-0284-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new procedure for sparse factor analysis (FA) such that each variable loads only onecommon factor. Thus, the loading matrix has a single nonzero element in each rowand zeros elsewhere. Such a loading matrix is the sparsest possible for certain number of variables and common factors. For this reason, the proposed method is named sparsest FA (SSFA). It may also be calledFA-based variable clustering, since the variables loading the same commonfactor can be classified into a cluster.InSSFA, all model parts of FA (common factors, their correlations, loadings, unique factors, and unique variances) are treated as fixed unknown parameter matrices and their least squares function is minimized through specific data matrix decomposition. A useful feature of the algorithm is thatthe matrix of common factor scores isre-parameterized using QR decomposition in order to efficiently estimate factorcorrelations. A simulation study shows that the proposed procedure can exactly identify the true sparsest models. Real data examples demonstrate the usefulness of the variable clustering performed by SSFA.
引用
收藏
页码:559 / 585
页数:27
相关论文
共 34 条
[1]  
Adachi K, 2014, JPN J STAT, V44, P363
[2]  
Adachi K, 2014, ADV LATENT VARIABLES, P227
[3]   SOME CONTRIBUTIONS TO DATA-FITTING FACTOR ANALYSIS WITH EMPIRICAL COMPARISONS TO COVARIANCE-FITTING FACTOR ANALYSIS [J].
Adachi, Kohei .
JOURNAL JAPANESE SOCIETY OF COMPUTATIONAL STATISTICS, 2012, 25 (01) :25-38
[4]  
Aggarwal C.C., 2015, Data Mining: The Textbook, DOI [10.1007/978-3-319-14142-8, DOI 10.1007/978-3-319-14142-8]
[5]  
[Anonymous], 1996, Applied Factor Analysis in the Natural Sciences
[6]  
Costa P. T., 1992, REVISED NEO PERSONAL
[7]  
De Leeuw J, 2004, MATH MODELL, V19, P121
[8]  
Eldén L, 2007, FUND ALGORITHMS, V4, pIX, DOI 10.1137/1.9780898718867
[9]  
Everitt BS, 1993, CLUSTER ANAL
[10]  
Gan G, 2007, ASA SIAM SER STAT AP, V20, P1, DOI 10.1137/1.9780898718348