Multiple factor analysis: principal component analysis for multitable and multiblock data sets

被引:321
作者
Abdi, Herve [1 ]
Williams, Lynne J. [2 ]
Valentin, Domininique [3 ]
机构
[1] Univ Texas Dallas, Dept Brain & Behav Sci, Richardson, TX 75080 USA
[2] Baycrest, Rotman Res Inst, Toronto, ON, Canada
[3] Univ Bourgogne, Dijon, France
关键词
multiple factor analysis (MFA); multiple factorial analysis; multiblock correspondence analysis; STATIS; INDSCAL; multiblock barycentric discriminant analysis (MUDICA); multiple factor analysis barycentric discriminant analysis (MUFABADA); barycentric discriminant analysis (BADA); generalized Procrustes analysis (GPA); generalized singular value decomposition; principal component analysis; consensus PCA; multitable PCA; multiblock PCA;
D O I
10.1002/wics.1246
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multiple factor analysis (MFA, also called multiple factorial analysis) is an extension of principal component analysis (PCA) tailored to handle multiple data tables that measure sets of variables collected on the same observations, or, alternatively, (in dual-MFA) multiple data tables where the same variables are measured on different sets of observations. MFA proceeds in two steps: First it computes a PCA of each data table and 'normalizes' each data table by dividing all its elements by the first singular value obtained from its PCA. Second, all the normalized data tables are aggregated into a grand data table that is analyzed via a (non-normalized) PCA that gives a set of factor scores for the observations and loadings for the variables. In addition, MFA provides for each data table a set of partial factor scores for the observations that reflects the specific 'view-point' of this data table. Interestingly, the common factor scores could be obtained by replacing the original normalized data tables by the normalized factor scores obtained from the PCA of each of these tables. In this article, we present MFA, review recent extensions, and illustrate it with a detailed example. (C) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:149 / 179
页数:31
相关论文
共 157 条
  • [11] Abdi H., 2010, ENCY RES DESIGN, P64
  • [12] Abdi H., 2010, ENCY RES DESIGN, P222, DOI DOI 10.4135/9781412961288.N71
  • [13] Abdi H., 2007, ENCY MEASUREMENT STA, P849
  • [14] Abdi H., 2010, ENCY RES DESIGN, P267
  • [15] Abdi H., 2007, ENCY MEASUREMENT STA, P907
  • [16] Analyzing assessors and products in sorting tasks: DISTATIS, theory and applications
    Abdi, Herve
    Valentin, Dominique
    Chollet, Sylvie
    Chrea, Christelle
    [J]. FOOD QUALITY AND PREFERENCE, 2007, 18 (04) : 627 - 640
  • [17] Centroids
    Abdi, Herve
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2009, 1 (02): : 259 - 260
  • [18] Analysis of Regional Cerebral Blood Flow Data to Discriminate among Alzheimer's Disease, Frontotemporal Dementia, and Elderly Controls: A Multi-Block Barycentric Discriminant Analysis (MUBADA) Methodology
    Abdi, Herve
    Williams, Lynne J.
    Beaton, Derek
    Posamentier, Mette T.
    Harris, Thomas S.
    Krishnan, Anjali
    Devous, Michael D., Sr.
    [J]. JOURNAL OF ALZHEIMERS DISEASE, 2012, 31 : S189 - S201
  • [19] STATIS and DISTATIS: optimum multitable principal component analysis and three way metric multidimensional scaling
    Abdi, Herve
    Williams, Lynne J.
    Valentin, Domininique
    Bennani-Dosse, Mohammed
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (02): : 124 - 167
  • [20] Multiple Subject Barycentric Discriminant Analysis (MUSUBADA): How to Assign Scans to Categories without Using Spatial Normalization
    Abdi, Herve
    Williams, Lynne J.
    Connolly, Andrew C.
    Gobbini, M. Ida
    Dunlop, Joseph P.
    Haxby, James V.
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012