Clusterwise analysis for multiblock component methods

被引:0
作者
Stéphanie Bougeard
Hervé Abdi
Gilbert Saporta
Ndèye Niang
机构
[1] Anses (French agency for food,Department of Epidemiology
[2] environmental and occupational health safety),undefined
[3] The University of Texas at Dallas,undefined
[4] CEDRIC CNAM,undefined
来源
Advances in Data Analysis and Classification | 2018年 / 12卷
关键词
Multiblock component method; Clusterwise regression; Typological regression; Cluster analysis; Dimension reduction; 62H30; 62H25; 91C20;
D O I
暂无
中图分类号
学科分类号
摘要
Multiblock component methods are applied to data sets for which several blocks of variables are measured on a same set of observations with the goal to analyze the relationships between these blocks of variables. In this article, we focus on multiblock component methods that integrate the information found in several blocks of explanatory variables in order to describe and explain one set of dependent variables. In the following, multiblock PLS and multiblock redundancy analysis are chosen, as particular cases of multiblock component methods when one set of variables is explained by a set of predictor variables that is organized into blocks. Because these multiblock techniques assume that the observations come from a homogeneous population they will provide suboptimal results when the observations actually come from different populations. A strategy to palliate this problem—presented in this article—is to use a technique such as clusterwise regression in order to identify homogeneous clusters of observations. This approach creates two new methods that provide clusters that have their own sets of regression coefficients. This combination of clustering and regression improves the overall quality of the prediction and facilitates the interpretation. In addition, the minimization of a well-defined criterion—by means of a sequential algorithm—ensures that the algorithm converges monotonously. Finally, the proposed method is distribution-free and can be used when the explanatory variables outnumber the observations within clusters. The proposed clusterwise multiblock methods are illustrated with of a simulation study and a (simulated) example from marketing.
引用
收藏
页码:285 / 313
页数:28
相关论文
共 50 条
  • [31] Principal component analysis and clustering on manifolds
    V. Mardia, Kanti
    Wiechers, Henrik
    Eltzner, Benjamin
    Huckemann, Stephan F.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [32] Principal component analysis for α-stable vectors
    Mohammadi, Mohammad
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (09) : 5245 - 5263
  • [33] Evaluation of methods for adjusting population stratification in genome-wide association studies: Standard versus categorical principal component analysis
    Turkmen, Asuman S.
    Yuan, Yuan
    Billor, Nedret
    ANNALS OF HUMAN GENETICS, 2019, 83 (06) : 454 - 464
  • [34] Principal Component Analysis and Cluster Analysis for Development of Electrical System
    Iswan
    Garniwa, Iwa
    2017 15TH INTERNATIONAL CONFERENCE ON QUALITY IN RESEARCH (QIR) - INTERNATIONAL SYMPOSIUM ON ELECTRICAL AND COMPUTER ENGINEERING, 2017, : 439 - 443
  • [35] Dynamic Supervised Principal Component Analysis for Classification
    Ouyang, Wenbo
    Wu, Ruiyang
    Hao, Ning
    Zhang, Hao Helen
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2025,
  • [36] Dimension reduction in principal component analysis for trees
    Alfaro, Carlos A.
    Aydin, Burcu
    Valencia, Carlos E.
    Bullitt, Elizabeth
    Ladha, Alim
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 157 - 179
  • [37] SPARSE PRINCIPAL COMPONENT ANALYSIS AND ITERATIVE THRESHOLDING
    Ma, Zongming
    ANNALS OF STATISTICS, 2013, 41 (02) : 772 - 801
  • [38] Supervised kernel principal component analysis for forecasting
    Fang, Puyi
    Gao, Zhaoxing
    Tsay, Ruey S.
    FINANCE RESEARCH LETTERS, 2023, 58
  • [39] Principal component analysis of binary genomics data
    Song, Yipeng
    Westerhuis, Johan A.
    Aben, Nanne
    Michaut, Magali
    Wessels, Lodewyk F. A.
    Smilde, Age K.
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (01) : 317 - 329
  • [40] Sparse exponential family Principal Component Analysis
    Lu, Meng
    Huang, Jianhua Z.
    Qian, Xiaoning
    PATTERN RECOGNITION, 2016, 60 : 681 - 691