Review of sparse methods in regression and classification with application to chemometrics

被引:93
|
作者
Filzmoser, Peter [1 ]
Gschwandtner, Moritz [1 ]
Todorov, Valentin [2 ]
机构
[1] Vienna Univ Technol, Inst Stat & Probabil Theory, A-1040 Vienna, Austria
[2] Vienna Int Ctr, UNIDO, A-1400 Vienna, Austria
关键词
sparse methods; high-dimensional data; partial least squares regression; discriminant analysis; principal component analysis; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; DIMENSION REDUCTION; PLS;
D O I
10.1002/cem.1418
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-dimensional data often contain many variables that are irrelevant for predicting a response or for an accurate group assignment. The inclusion of such variables in a regression or classification model leads to a loss in performance, even if the contribution of the variables to the model is small. Sparse methods for regression and classification are able to suppress these variables. This is possible by adding an appropriate penalty term to the objective function of the method. An overview of recent sparse methods for regression and classification is provided. The methods are applied to several high-dimensional data sets from chemometrics. A comparison with the non-sparse counterparts allows us to acquire an insight into their performance. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 50 条
  • [31] Stacked interval sparse partial least squares regression analysis
    Poerio, Dominic V.
    Brown, Steven D.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 166 : 49 - 60
  • [32] Review of Modern Logistic Regression Methods with Application to Small and Medium Sample Size Problems
    Makalic, Enes
    Schmidt, Daniel Francis
    AI 2010: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2010, 6464 : 213 - 222
  • [33] Robust and sparse multigroup classification by the optimal scoring approach
    Ortner, Irene
    Filzmoser, Peter
    Croux, Christophe
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 723 - 741
  • [34] Application of chemometrics methods for the estimation of heavy metals contamination in river sediments
    Wang, YW
    Yuan, CG
    Jin, XL
    Jiang, GB
    JOURNAL OF ENVIRONMENTAL SCIENCES, 2005, 17 (04) : 540 - 544
  • [35] Sparse relative risk regression models
    Wit, Ernst C.
    Augugliaro, Luigi
    Pazira, Hassan
    Gonzalez, Javier
    Abegaz, Fentaw
    BIOSTATISTICS, 2020, 21 (02) : E131 - E147
  • [36] Sequential Scaled Sparse Factor Regression
    Zheng, Zemin
    Li, Yang
    Wu, Jie
    Wang, Yuchen
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (02) : 595 - 604
  • [37] Structured sparse logistic regression with application to lung cancer prediction using breath volatile biomarkers
    Zhang, Xiaochen
    Zhang, Qingzhao
    Wang, Xiaofeng
    Ma, Shuangge
    Fang, Kuangnan
    STATISTICS IN MEDICINE, 2020, 39 (07) : 955 - 967
  • [39] Variable selection in gamma regression model using chaotic firefly algorithm with application in chemometrics
    Alkhateeb, Ahmed Naziyah
    Algamar, Zakariya Yahya
    ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2021, 14 (01) : 266 - 276
  • [40] Incorporating Symmetric Smooth Regularizations into Sparse Logistic Regression for Classification and Feature Extraction
    Wang, Jing
    Xie, Xiao
    Wang, Pengwei
    Sun, Jian
    Liu, Yaochen
    Zhang, Li
    SYMMETRY-BASEL, 2025, 17 (02):