Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data

被引:0
作者
Krongchai, Chanida [1 ]
Wongsaipun, Sakunna [1 ]
Funsueb, Sujitra [1 ]
Theanjumpol, Parichat [2 ,3 ]
Jakmunee, Jaroon [1 ,4 ]
Kittiwachana, Sila [1 ,5 ]
机构
[1] Chiang Mai Univ, Fac Sci, Dept Chem, Chiang Mai 50200, Thailand
[2] Chiang Mai Univ, Fac Agr, Postharvest Technol Res Ctr, Chiang Mai 50200, Thailand
[3] Postharvest Technol Innovat Ctr, Off Higher Educ Commiss, Bangkok 10400, Thailand
[4] Chiang Mai Univ, Inst Sci & Technol Res & Dev, Chiang Mai 50200, Thailand
[5] Chiang Mai Univ, Fac Sci, Environm Sci Res Ctr, Chiang Mai 50200, Thailand
来源
CHIANG MAI JOURNAL OF SCIENCE | 2020年 / 47卷 / 01期
关键词
variable selection; multivariate calibration; partial least squares (PLS); self organizing map (SOM); spectral data analysis; PARTIAL LEAST-SQUARES; NIR SPECTROSCOPY; REGRESSION; PREDICTION; VIP;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods based on partial least squares-variable important in prediction (PLS-VIP) and self organizing map-discrimination index (SOM-DI) were compared. Two datasets, near-infrared (NIR) spectra of adulterated Thai Jasmine rice and ultraviolet-visible (UV-Vis) spectra of food colorant mixtures were used for the demonstration. The advantages and disadvantages for the use of the different algorithms were compared and discussed. For the NIR data, the calibration model using supervised self organizing map (SSOM) offered better prediction results and the SOM-DI variable selection method identified the spectral changes in NIR overtone regions as significance. On the other hand, PLS calibration model resulted in higher predictive errors while the PLS-VIP variable selection captured variation from the visible region between 664 nm and 884 nm. Using the UV-Vis data, PLS appeared to put attention on only the highest absorbance region of the peak maximum absorbance. In contrast, SSOM model highlighted the variation around the isosbestic spectral regions between the mixture components. The drawback for the use of a mixture design to construct the calibration models, leading to wrong interpretation of the important variables, was also discussed.
引用
收藏
页码:160 / 174
页数:15
相关论文
共 26 条
  • [1] Variable selection in regression-a tutorial
    Andersen, C. M.
    Bro, R.
    [J]. JOURNAL OF CHEMOMETRICS, 2010, 24 (11-12) : 728 - 737
  • [2] Predictive-property-ranked variable reduction in partial least squares modelling with final complexity adapted models: Comparison of properties for ranking
    Andries, Jan P. M.
    Vander Heyden, Yvan
    Buydens, Lutgarde M. C.
    [J]. ANALYTICA CHIMICA ACTA, 2013, 760 : 34 - 45
  • [3] [Anonymous], 2009, CHEMOMETRICS PATTERN
  • [4] A bootstrap-based strategy for spectral interval selection in PLS regression
    Bras, Ligia P.
    Lopes, Marta
    Ferreira, Ana P.
    Menezes, Jose C.
    [J]. JOURNAL OF CHEMOMETRICS, 2008, 22 (11-12) : 695 - 700
  • [5] Brereton R.G., 2005, CHEMOMETRICS DATA AN
  • [6] Advances in quantitative UV-visible spectroscopy for clinical and pre-clinical application in cancer
    Brown, J. Quincy
    Vishwanath, Karthik
    Palmer, Gregory M.
    Ramanujam, Nirmala
    [J]. CURRENT OPINION IN BIOTECHNOLOGY, 2009, 20 (01) : 119 - 131
  • [7] Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation
    Farres, Mireia
    Platikanov, Stefan
    Tsakovski, Stefan
    Tauler, Roma
    [J]. JOURNAL OF CHEMOMETRICS, 2015, 29 (10) : 528 - 536
  • [8] Applications of NIR spectroscopy to monitoring and analyzing the solid state during industrial crystallization processes
    Févotte, G
    Calas, J
    Puel, F
    Hoff, C
    [J]. INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2004, 273 (1-2) : 159 - 169
  • [9] PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL
    GELADI, P
    KOWALSKI, BR
    [J]. ANALYTICA CHIMICA ACTA, 1986, 185 : 1 - 17
  • [10] Use of Partial Least Squares Regression for Variable Selection and Quality Prediction
    Jun, Chi-Hyuck
    Lee, Sang-Ho
    Park, Hae-Sang
    Lee, Jeong-Hwa
    [J]. CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1302 - 1307