Analysing spectroscopy data using two-step group penalized partial least squares regression

被引:2
|
作者
Chang, Le [1 ]
Wang, Jiali [2 ]
Woodgate, William [3 ,4 ]
机构
[1] Australia Natl Univ, Coll Business & Econ, Res Sch Finance Acturial Studies & Stat, Canberra, ACT, Australia
[2] Commonwealth Sci & Ind Res Org, Data61, Canberra, ACT, Australia
[3] Commonwealth Sci & Ind Res Org, Land & Water, Canberra, ACT, Australia
[4] Univ Queensland, Sch Earth & Environm Sci, Brisbane, Qld, Australia
基金
澳大利亚研究理事会;
关键词
Dimension reduction; Group lasso; Partial least squares regression; Reflectance spectrum; Spectroscopy; PRINCIPAL COMPONENT; GROUP LASSO; CLASSIFICATION; INDEX;
D O I
10.1007/s10651-021-00496-2
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
A statistical challenge to analyse hyperspectral data is the multicollinearity between spectral bands. Partial least squares (PLS) has been extensively used as a dimensionality reduction technique through constructing lower dimensional latent variables from the spectral bands that correlate with the response variables. However, it does not take into account the grouping structure of the full spectrum where spectral subsets may exhibit distinct relationships with the response variables. We propose a two-step group penalized PLS regression approach by performing a PLS regression on each group of predictors identified from a clustering approach in the first step. In the second step, a group penalty is imposed on the latent components to select the group with the highest predictive power. Our proposed method demonstrated a superior prediction performance, higher R-squared value and faster computation time over other PLS variations when applied to simulations and a real-world observational data set. Interpretations of the model performance are illustrated using the real-world data example of leaf spectra to indirectly quantify leaf traits. The method is implemented in an R package called "groupPLS", which is accessible from github.com/jialiwang1211/groupPLS.
引用
收藏
页码:445 / 467
页数:23
相关论文
共 50 条
  • [21] Detection of DNA copy number alterations using penalized least squares regression
    Huang, T
    Wu, BL
    Lizardi, P
    Zhao, HY
    BIOINFORMATICS, 2005, 21 (20) : 3811 - 3817
  • [22] Sensory profiling data studied by partial least squares regression
    Martens, M
    Bredie, WLP
    Martens, H
    FOOD QUALITY AND PREFERENCE, 2000, 11 (1-2) : 147 - 149
  • [23] SAS® partial least squares regression for analysis of spectroscopic data
    Reeves, JB
    Delwiche, SR
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2003, 11 (06) : 415 - 431
  • [24] Multivariate modelling of the pharmaceutical two-step process of wet granulation and tableting with multiblock partial least squares
    Westerhuis, JA
    Coenegracht, PMJ
    JOURNAL OF CHEMOMETRICS, 1997, 11 (05) : 379 - 392
  • [25] An improved partial least-squares regression method for Raman spectroscopy
    Monfared, Ali Momenpour Tehran
    Anis, Hanan
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2017, 185 : 98 - 103
  • [26] Quantification of brain lipids by FTIR spectroscopy and partial least squares regression
    Dreissig, Isabell
    Machill, Susanne
    Salzer, Reiner
    Krafft, Christoph
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2009, 71 (05) : 2069 - 2075
  • [27] Penalized Partial Least Squares with applications to B-spline transformations and functional data
    Kraemer, Nicole
    Boulesteix, Anne-Laure
    Tutz, Gerhard
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 94 (01) : 60 - 69
  • [28] Near-Infrared Spectroscopy Analytical Model Using Ensemble Partial Least Squares Regression
    Luo, Na
    Han, Ping
    Wang, Shifang
    Wang, Dong
    Zhao, Chunjiang
    ANALYTICAL LETTERS, 2019, 52 (11) : 1732 - 1756
  • [29] A Two-step Constrained Least Squares Localization in Wireless Sensor Networks
    Liu, Guangzhe
    Hua, Jingyu
    Li, Feng
    Zhang, Yu
    Xu, Zhijiang
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 296 - 300
  • [30] LASSO-penalized clusterwise linear regression modelling: a two-step approach
    Di Mari, Roberto
    Rocci, Roberto
    Gattone, Stefano Antonio
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (18) : 3235 - 3258