Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation

被引:433
|
作者
Farres, Mireia [1 ]
Platikanov, Stefan [1 ]
Tsakovski, Stefan [2 ]
Tauler, Roma [1 ]
机构
[1] CSIC, IDAEA, Dept Environm Chem, ES-08034 Barcelona, Spain
[2] Univ Sofia, Fac Chem, Dept Analyt Chem, Sofia 1164, Bulgaria
基金
欧洲研究理事会;
关键词
variable importance in projection; selectivity ratio; variable selection; partial least squares; PARTIAL LEAST-SQUARES; MASS-SPECTRAL PROFILES; MICROARRAY DATA; REGRESSION; CLASSIFICATION; IDENTIFICATION; PERFORMANCE; INDEX; WATER; PANEL;
D O I
10.1002/cem.2736
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study compares the application of two variable selection methods in partial least squares regression (PLSR), the variable importance in projection (VIP) method and the selectivity ratio (SR) method. For this purpose, three different data sets were analysed: (a) physiochemical water quality parameters related to sensorial data, (b) gas chromatography-mass spectrometry (GC-MS) chemical (organic compound) profiles from fossil sea sediment samples related to sea surface temperature (SST) changes, and (c) exposed genes of Daphnia magna female samples related to their total offspring production. Correlation coefficients (r), levels of significance (p-value) and interpretation of the underlying experimental phenomena allowed the discussion about the best approach for variable selection in each case. The comparison of the two variable selection methods in the first water quality data set showed that the SR method is more accurate for sensorial prediction. For the climate data set, when raw total ion current (TIC) GC-MS chromatograms were considered, variables selected using the VIP method were easier to interpret compared with those selected by the SR method. However, when only some chromatographic peak areas (concentrations) were considered, the SR method was more efficient for prediction, and the VIP method selected the most relevant variables for the interpretation of SST changes. Finally, for the transcriptomic data set, the SR method was found again to be more reliable for prediction purposes. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:528 / 536
页数:9
相关论文
共 50 条
  • [2] Industrial PLS model variable selection using moving window variable importance in projection
    Lu, Bo
    Castillo, Ivan
    Chiang, Leo
    Edgar, Thomas F.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 135 : 90 - 109
  • [3] Discriminating Variable Test and Selectivity Ratio Plot: Quantitative Tools for Interpretation and Variable (Biomarker) Selection in Complex Spectral or Chromatographic Profiles
    Rajalahti, Tarja
    Arneberg, Reidar
    Kroksveen, Ann C.
    Berle, Magnus
    Myhr, Kjell-Morten
    Kvalheim, Olav M.
    ANALYTICAL CHEMISTRY, 2009, 81 (07) : 2581 - 2590
  • [4] A variable importance criterion for variable selection in near-infrared spectral analysis
    Zhang, Jin
    Cui, Xiaoyu
    Cai, Wensheng
    Shao, Xueguang
    SCIENCE CHINA-CHEMISTRY, 2019, 62 (02) : 271 - 279
  • [5] Variable Selection Methods in Spectral Data Analysis
    Li Yan-kun
    Dong Ru-nan
    Zhang Jin
    Huang Ke-nan
    Mao Zhi-yi
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41 (11) : 3331 - 3338
  • [6] Variable influence on projection (VIP) for orthogonal projections to latent structures (OPLS)
    Galindo-Prieto, Beatriz
    Eriksson, Lennart
    Trygg, Johan
    JOURNAL OF CHEMOMETRICS, 2014, 28 (08) : 623 - 632
  • [7] Boosting model performance and interpretation by entangling preprocessing selection and variable selection
    Gerretzen, Jan
    Szymanska, Ewa
    Bart, Jacob
    Davies, Antony N.
    van Manen, Henk-Jan
    van den Heuvel, Edwin R.
    Jansen, Jeroen J.
    Buydens, Lutgarde M. C.
    ANALYTICA CHIMICA ACTA, 2016, 938 : 44 - 52
  • [8] A Spatial Model for Repairing of the Dam Safety Monitoring Data Combining the Variable Importance for Projection (VIP) and Cokriging Methods
    Li, Shiwan
    Li, Yanling
    Lu, Xiang
    Wu, Zhenyu
    Pei, Liang
    Liu, Kexin
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [9] Review of Variable Selection Methods for Discriminant-Type Problems in Chemometrics
    Sorochan Armstrong, Michael D.
    de la Mata, A. Paulina
    Harynuk, James J.
    FRONTIERS IN ANALYTICAL SCIENCE, 2022, 2
  • [10] Variable influence on projection (VIP) for OPLS models and its applicability in multivariate time series analysis
    Galindo-Prieto, Beatriz
    Eriksson, Lennart
    Trygg, Johan
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 146 : 297 - 304