Variable selection in multi-block regression

被引:53
作者
Biancolillo, Alessandra [1 ,2 ]
Liland, Kristian Hovde [1 ,3 ]
Mage, Ingrid [1 ]
Naes, Tormod [1 ,2 ]
Bro, Rasmus [2 ]
机构
[1] Nofima AS, Osloveien 1,POB 210, N-1431 As, Norway
[2] Univ Copenhagen, Fac Life Sci, Dept Food Sci, Qual & Technol, Rolighedsvej 30, DK-1958 Frederiksberg C, Denmark
[3] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, POB 5003, N-1432 As, Norway
关键词
Variable selection; Multi-block; SO-PLS; MB-PLS; Raman; Sensory; PARTIAL LEAST-SQUARES; NEAR-INFRARED SPECTROSCOPY; MODELS; PLS;
D O I
10.1016/j.chemolab.2016.05.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The focus of the present paper is to propose and discuss different procedures for performing variable selection in a multi-block regression context. In particular, the focus is on two multi-block regression methods: Multi-Block Partial Least Squares (MB-PLS) and Sequential and Orthogonalized Partial Least Squares (SO-PLS) regression. A small simulation study for regular PLS regression was conducted in order to select the most promising methods to investigate further in the multi-block context. The combinations of three variable selection methods with MB-PLS and SO-PLS are examined in detail. These methods are Variable Importance in Projection (VIP) Selectivity Ratio (SR) and forward selection. In this paper we focus on both prediction ability and interpretation. The different approaches are tested on three types of data: one sensory data set, one spectroscopic (Raman) data set and a number of simulated multi-block data sets. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 101
页数:13
相关论文
共 50 条
  • [21] Variable Selection Methods in Spectral Data Analysis
    Li Yan-kun
    Dong Ru-nan
    Zhang Jin
    Huang Ke-nan
    Mao Zhi-yi
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41 (11) : 3331 - 3338
  • [22] Multi-block SO-PLS approach based on infrared spectroscopy for anaerobic digestion process monitoring
    Awhangbo, L.
    Bendoula, R.
    Roger, J. M.
    Beline, F.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 196 (196)
  • [23] Componentwise variable selection in finite mixture regression
    Chen, Bin
    Ye, Keying
    STATISTICS AND ITS INTERFACE, 2015, 8 (02) : 239 - 254
  • [24] Variable Selection and Redundancy in Multivariate Regression Models
    Westad, Frank
    Marini, Federico
    FRONTIERS IN ANALYTICAL SCIENCE, 2022, 2
  • [25] Distribution based truncation for variable selection in subspace methods for multivariate regression
    Liland, Kristian Hovde
    Hoy, Martin
    Martens, Harald
    Saebo, Solve
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 122 : 103 - 111
  • [26] Multi-block Poisson grid generator for cascade simulations
    Gross, Andreas
    Fasel, Hermann F.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2008, 79 (03) : 416 - 428
  • [27] Penalized variable selection in multi-parameter regression survival modeling
    Jaouimaa, Fatima-Zahra
    Do Ha, Il
    Burke, Kevin
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (12) : 2455 - 2471
  • [28] Recent trends in multi-block data analysis in chemometrics for multi- source data integration
    Mishra, Puneet
    Roger, Jean-Michel
    Jouan-Rimbaud-Bouveresse, Delphine
    Biancolillo, Alessandra
    Marini, Federico
    Nordon, Alison
    Rutledge, Douglas N.
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2021, 137
  • [29] Variable selection in expectile regression
    Zhao, Jun
    Zhang, Yi
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (07) : 1731 - 1746
  • [30] Variable selection in linear regression
    Lindsey, Charles
    Sheather, Simon
    STATA JOURNAL, 2010, 10 (04) : 650 - 669