Review of sparse methods in regression and classification with application to chemometrics

被引:93
|
作者
Filzmoser, Peter [1 ]
Gschwandtner, Moritz [1 ]
Todorov, Valentin [2 ]
机构
[1] Vienna Univ Technol, Inst Stat & Probabil Theory, A-1040 Vienna, Austria
[2] Vienna Int Ctr, UNIDO, A-1400 Vienna, Austria
关键词
sparse methods; high-dimensional data; partial least squares regression; discriminant analysis; principal component analysis; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; DIMENSION REDUCTION; PLS;
D O I
10.1002/cem.1418
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-dimensional data often contain many variables that are irrelevant for predicting a response or for an accurate group assignment. The inclusion of such variables in a regression or classification model leads to a loss in performance, even if the contribution of the variables to the model is small. Sparse methods for regression and classification are able to suppress these variables. This is possible by adding an appropriate penalty term to the objective function of the method. An overview of recent sparse methods for regression and classification is provided. The methods are applied to several high-dimensional data sets from chemometrics. A comparison with the non-sparse counterparts allows us to acquire an insight into their performance. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 50 条
  • [21] Obtaining insights from high-dimensional data: sparse principal covariates regression
    Van Deun, Katrijn
    Crompvoets, Elise A. V.
    Ceulemans, Eva
    BMC BIOINFORMATICS, 2018, 19
  • [22] Chemometrics on microchips: Towards the classification of wines
    Scampicchio, M
    Mannino, S
    Zima, J
    Wang, J
    ELECTROANALYSIS, 2005, 17 (13) : 1215 - 1221
  • [23] High-dimensional sparse vine copula regression with application to genomic prediction
    Sahin, Oezge
    Czado, Claudia
    BIOMETRICS, 2024, 80 (01)
  • [24] Model-based methods of classification: Using the mclust software in chemometrics
    Fraley, Chris
    Raftery, Adrian E.
    JOURNAL OF STATISTICAL SOFTWARE, 2007, 18 (06):
  • [25] STRUCTURED, SPARSE REGRESSION WITH APPLICATION TO HIV DRUG RESISTANCE
    Percival, Daniel
    Roeder, Kathryn
    Rosenfeld, Roni
    Wasserman, Larry
    ANNALS OF APPLIED STATISTICS, 2011, 5 (2A): : 628 - 644
  • [26] Rapid classification of rice in Northern Vietnam by usingFTIRspectroscopy combined with chemometrics methods
    Le Truong Giang
    Tran Lam Thanh Thien
    Dao Hai Yen
    VIETNAM JOURNAL OF CHEMISTRY, 2020, 58 (03) : 372 - 379
  • [27] Classification of local diesel fuels and simultaneous prediction of their physicochemical parameters using FTIR-ATR data and chemometrics
    Msimanga, Huggins Z.
    Dockery, Christopher R.
    Vandenbos, Deidre D.
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 279
  • [28] Globally Sparse PLS Regression
    Liu, Tzu-Yu
    Trinchera, Laura
    Tenenhaus, Arthur
    Wei, Dennis
    Hero, Alfred O.
    NEW PERSPECTIVES IN PARTIAL LEAST SQUARES AND RELATED METHODS, 2013, 56 : 117 - 127
  • [29] Application and impact of Lasso regression in gastroenterology: A systematic review
    Hassam Ali
    Maria Shahzad
    Shiza Sarfraz
    Kerry B. Sewell
    Shehabaldin Alqalyoobi
    Babu P. Mohan
    Indian Journal of Gastroenterology, 2023, 42 : 780 - 790
  • [30] Application and impact of Lasso regression in gastroenterology: A systematic review
    Ali, Hassam
    Shahzad, Maria
    Sarfraz, Shiza
    Sewell, Kerry B. B.
    Alqalyoobi, Shehabaldin
    Mohan, Babu P. P.
    INDIAN JOURNAL OF GASTROENTEROLOGY, 2023, 42 (06) : 780 - 790