Improved modelling for low-correlated multiple responses by common-subset-of-independent-variables partial-least-squares

被引:10
作者
Andries, Jan P. M. [1 ]
Tinnevelt, Gerjen H. [2 ]
Vander Heyden, Yvan [3 ]
机构
[1] Univ Profess Educ, Res Grp Anal Tech Life Sci, Avans Hogesch, POB 90116, NL-4800 RA Breda, Netherlands
[2] Radboud Univ Nijmegen, Inst Mol & Mat, POB 9010, NL-6500 GL Nijmegen, Netherlands
[3] Vrije Univ Brussel VUB, Dept Analyt Chem, Appl Chemometr & Mol Modelling, Laarbeeklaan 103, B-1090 Brussels, Belgium
关键词
PLS1; PLS2; CSIV-PLS1; FCAM-REG variable Selection; Paired t -test; COMPLEXITY ADAPTED MODELS; WAVELENGTH SELECTION; REGRESSION; PLS; CALIBRATION; REDUCTION; NIR;
D O I
10.1016/j.talanta.2021.123140
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this study, a new approach for PLS modelling for low-correlated multiple responses, called Common-Subset-ofIndependent-Variables Partial-Least-Squares, denoted as CSIV-PLS1, is proposed and evaluated. In CSIV-PLS1, for each response vector, individual PLS1 models with individual model complexities are developed, based on one common set of independent variables, obtained after variable selection by the Final Complexity Adapted Models method, using the absolute values of the PLS regression coefficients, denoted as FCAM-REG. CSIV-PLS1 combines a common variable set for all response vectors, which is a characteristic of PLS2, with the individual model complexity for each response, which is a characteristic of PLS1. These characteristics make CSIV-PLS1 more flexible than PLS2. The selective and predictive abilities of the proposed CSIV-PLS1 method are investigated using one simulated and four real data sets with low-correlated multiple responses from different sources. The simulated data set is used to test the general applicability of the CSIV-PLS1 method. The predictive abilities, measured by the RMSEP values, resulting from CSIV-PLS1 models, are statistically compared with those of the corresponding PLS1 and PLS2 models, using one-tailed paired t-tests. The selective ability of the CSIV-PLS1 method is good, because mostly variables with an informative meaning to the responses are selected. The RMSEP values resulting from the CSIV-PLS1 method are (i) significantly lower at the 95% confidence level than those of the corresponding PLS2 method, and (ii) borderline significantly lower at the 90-95% confidence level than those of the corresponding PLS1 methods. In case of low-correlated multiple responses, the predictive ability of the CSIV-PLS1 method is significantly better than that of the PLS2 method, and borderline significantly better than those of the corresponding PLS1 methods. Therefore, CSIV-PLS1 modelling may be an alternative for PLS1 or PLS2.
引用
收藏
页数:8
相关论文
共 34 条
[1]   Variable selection in regression-a tutorial [J].
Andersen, C. M. ;
Bro, R. .
JOURNAL OF CHEMOMETRICS, 2010, 24 (11-12) :728-737
[2]   Improved multi-class discrimination by Common-Subset-of-Independent-Variables Partial-Least-Squares Discriminant Analysis [J].
Andries, Jan P. M. ;
Vander Heyden, Yvan .
TALANTA, 2021, 234
[3]   Predictive-Property-Ranked Variable Reduction with Final Complexity Adapted Models in Partial Least Squares Modeling for Multiple Responses [J].
Andries, Jan P. M. ;
Vander Heyden, Yvan ;
Buydens, Lutgarde M. C. .
ANALYTICAL CHEMISTRY, 2013, 85 (11) :5444-5453
[4]   Predictive-property-ranked variable reduction in partial least squares modelling with final complexity adapted models: Comparison of properties for ranking [J].
Andries, Jan P. M. ;
Vander Heyden, Yvan ;
Buydens, Lutgarde M. C. .
ANALYTICA CHIMICA ACTA, 2013, 760 :34-45
[5]   Improved variable reduction in partial least squares modelling based on Predictive-Property-Ranked Variables and adaptation of partial least squares complexity [J].
Andries, Jan P. M. ;
Vander Heyden, Yvan ;
Buydens, Lutgarde M. C. .
ANALYTICA CHIMICA ACTA, 2011, 705 (1-2) :292-305
[6]   Variable selection in near-infrared spectroscopy: Benchmarking of feature selection methods on biodiesel data [J].
Balabin, Roman M. ;
Smirnov, Sergey V. .
ANALYTICA CHIMICA ACTA, 2011, 692 (1-2) :63-72
[7]   WAVELENGTH SELECTION IN MULTICOMPONENT NEAR-INFRARED CALIBRATION [J].
BROWN, PJ .
JOURNAL OF CHEMOMETRICS, 1992, 6 (03) :151-161
[8]   Bayesian wavelet regression on curves with application to a spectroscopic calibration problem [J].
Brown, PJ ;
Fearn, T ;
Vannucci, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) :398-408
[9]  
Brown PJ, 1998, J CHEMOMETR, V12, P173, DOI 10.1002/(SICI)1099-128X(199805/06)12:3<173::AID-CEM505>3.3.CO
[10]  
2-S