Variable selection in partial least squares with the weighted variable contribution to the first singular value of the covariance matrix

被引:4
|
作者
Lin, Weilu [1 ]
Hang, Haifeng [1 ]
Zhuang, Yingping [1 ]
Zhang, Siliang [1 ]
机构
[1] East China Univ Sci & Technol, State Key Lab Bioreactor Engn, 130 Meilong Rd, Shanghai 200237, Peoples R China
关键词
Informative variables; Interval variable selection; Partial least squares; Variable contribution; Maximal singular value; Spectroscopy; LATENT STRUCTURES; REGRESSION; PROJECTIONS; REGIONS; PLS;
D O I
10.1016/j.chemolab.2018.11.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The selection of informative variables in partial least squares (PLS) is important in process analytical technology (PAT) applications in the pharmaceutical industry, for example, the calibration of spectrometers. In the past, numerous approaches have been proposed to select the variables in partial least squares. In this work, a new variable selection method for PLS with the weighted variable contribution (PLS-WVC) to the first singular value of the covariance matrix for each PLS component is proposed. Several variants of PLS-WVC with different weighting factors are proposed. One variant of PLS-WVC is equivalent to the PLS with variable importance in projection (PIS-VIP). However, the variants with the correlation between X(gamma)w(gamma), and Y(gamma)q(gamma) as the weighting factor are preferred based on the results of the simulation cases studies. The proposed PLS-WVCs are integrated with interval PLS (iPLS) further to select the informative wavelength intervals for spectroscopic modelling. The utility of the proposed WVC based variable selection methods in PIS is demonstrated with the real spectral data sets.
引用
收藏
页码:113 / 121
页数:9
相关论文
共 50 条
  • [1] Recursive weighted partial least squares (rPLS): an efficient variable selection method using PLS
    Rinnan, Asmund
    Andersson, Martin
    Ridder, Carsten
    Engelsen, Soren Balling
    JOURNAL OF CHEMOMETRICS, 2014, 28 (05) : 439 - 447
  • [2] A Partial Least Squares based algorithm for parsimonious variable selection
    Mehmood, Tahir
    Martens, Harald
    Saebo, Solve
    Warringer, Jonas
    Snipen, Lars
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
  • [3] Variable selection for partial least squares modeling by genetic algorithms
    Chu, XL
    Yuan, HF
    Wang, YB
    Lu, WZ
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2001, 29 (04) : 437 - 442
  • [4] Comparison of variable selection methods in partial least squares regression
    Mehmood, Tahir
    Saebo, Solve
    Liland, Kristian Hovde
    JOURNAL OF CHEMOMETRICS, 2020, 34 (06)
  • [5] Variable selection in discriminant partial least-squares analysis
    Alsberg, BK
    Kell, DB
    Goodacre, R
    ANALYTICAL CHEMISTRY, 1998, 70 (19) : 4126 - 4133
  • [6] A Partial Least Squares based algorithm for parsimonious variable selection
    Tahir Mehmood
    Harald Martens
    Solve Sæbø
    Jonas Warringer
    Lars Snipen
    Algorithms for Molecular Biology, 6
  • [7] A review of variable selection methods in Partial Least Squares Regression
    Mehmood, Tahir
    Liland, Kristian Hovde
    Snipen, Lars
    Saebo, Solve
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 62 - 69
  • [8] Application of Variable Selection in Hydrological Forecasting Based on Partial Least Squares
    Ma Tengfei
    Wang Chuanhai
    Ma Tengfei
    Wang Chuanhai
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1990 - 1994
  • [9] A partition-based variable selection in partial least squares regression
    Li, Chuan-Quan
    Fang, Zhaoyu
    Xu, Qing-Song
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 198
  • [10] A variant of sparse partial least squares for variable selection and data exploration
    Hunt, Megan J. Olson
    Weissfeld, Lisa
    Boudreau, Robert M.
    Aizenstein, Howard
    Newman, Anne B.
    Simonsick, Eleanor M.
    Van Domelen, Dane R.
    Thomas, Fridtjof
    Yaffe, Kristine
    Rosano, Caterina
    FRONTIERS IN NEUROINFORMATICS, 2014, 8