Analysing spectroscopy data using two-step group penalized partial least squares regression

被引:2
|
作者
Chang, Le [1 ]
Wang, Jiali [2 ]
Woodgate, William [3 ,4 ]
机构
[1] Australia Natl Univ, Coll Business & Econ, Res Sch Finance Acturial Studies & Stat, Canberra, ACT, Australia
[2] Commonwealth Sci & Ind Res Org, Data61, Canberra, ACT, Australia
[3] Commonwealth Sci & Ind Res Org, Land & Water, Canberra, ACT, Australia
[4] Univ Queensland, Sch Earth & Environm Sci, Brisbane, Qld, Australia
基金
澳大利亚研究理事会;
关键词
Dimension reduction; Group lasso; Partial least squares regression; Reflectance spectrum; Spectroscopy; PRINCIPAL COMPONENT; GROUP LASSO; CLASSIFICATION; INDEX;
D O I
10.1007/s10651-021-00496-2
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
A statistical challenge to analyse hyperspectral data is the multicollinearity between spectral bands. Partial least squares (PLS) has been extensively used as a dimensionality reduction technique through constructing lower dimensional latent variables from the spectral bands that correlate with the response variables. However, it does not take into account the grouping structure of the full spectrum where spectral subsets may exhibit distinct relationships with the response variables. We propose a two-step group penalized PLS regression approach by performing a PLS regression on each group of predictors identified from a clustering approach in the first step. In the second step, a group penalty is imposed on the latent components to select the group with the highest predictive power. Our proposed method demonstrated a superior prediction performance, higher R-squared value and faster computation time over other PLS variations when applied to simulations and a real-world observational data set. Interpretations of the model performance are illustrated using the real-world data example of leaf spectra to indirectly quantify leaf traits. The method is implemented in an R package called "groupPLS", which is accessible from github.com/jialiwang1211/groupPLS.
引用
收藏
页码:445 / 467
页数:23
相关论文
共 50 条
  • [1] Analysing spectroscopy data using two-step group penalized partial least squares regression
    Le Chang
    Jiali Wang
    William Woodgate
    Environmental and Ecological Statistics, 2021, 28 : 445 - 467
  • [2] Classification using partial least squares with penalized logistic regression
    Fort, G
    Lambert-Lacroix, S
    BIOINFORMATICS, 2005, 21 (07) : 1104 - 1111
  • [3] Two-Step Partial Least Squares for Monitoring Dynamic Processes
    Yuan, Zeyi
    Ma, Xin
    Qin, Yihao
    Wang, Youqing
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2808 - 2813
  • [4] A two-step estimation method for grouped data with connections to the extended growth curve model and partial least squares regression
    Li, Ying
    Uden, Peter
    von Rosen, Dietrich
    JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 139 : 347 - 359
  • [5] Nonparametric regression estimation using penalized least squares
    Kohler, M
    Krzyzak, A
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (07) : 3054 - 3058
  • [6] Penalized Partial Least Squares for Multi-label Data
    Liu, Huawen
    Ma, Zongjie
    Zhao, Jianmin
    Zheng, Zhonglong
    2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 515 - 520
  • [7] A Two-Step Penalized Regression Method with Networked Predictors
    Chong Luo
    Wei Pan
    Xiaotong Shen
    Statistics in Biosciences, 2012, 4 (1) : 27 - 46
  • [8] A Two-Step Penalized Regression Method with Networked Predictors
    Luo, Chong
    Pan, Wei
    Shen, Xiaotong
    STATISTICS IN BIOSCIENCES, 2012, 4 (01) : 27 - 46
  • [9] Partial Least Squares Regression for Binary Data
    Vicente-Gonzalez, Laura
    Frutos-Bernal, Elisa
    Vicente-Villardon, Jose Luis
    MATHEMATICS, 2025, 13 (03)
  • [10] Generalized partial least squares regression based on the penalized minimum norm projection
    Li, BB
    Morris, AJ
    Martin, EB
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (01) : 21 - 26