Robust adaptive LASSO in high-dimensional logistic regression

被引:0
|
作者
Basu, Ayanendranath [1 ]
Ghosh, Abhik [1 ]
Jaenada, Maria [2 ]
Pardo, Leandro [2 ]
机构
[1] Indian Stat Inst, Interdisciplinary Stat Res Unit, 203 BT Rd, Kolkata 700108, India
[2] Univ Complutense Madrid, Stat & OR, Plaza Ciencias 3, Madrid 28040, Spain
关键词
Density power divergence; High-dimensional data; Logistic regression; Oracle properties; Variable selection; VARIABLE SELECTION; GENE SELECTION; SPARSE REGRESSION; CLASSIFICATION; CANCER; MICROARRAYS; LIKELIHOOD; ALGORITHM; MODELS;
D O I
10.1007/s10260-024-00760-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Penalized logistic regression is extremely useful for binary classification with large number of covariates (higher than the sample size), having several real life applications, including genomic disease classification. However, the existing methods based on the likelihood loss function are sensitive to data contamination and other noise and, hence, robust methods are needed for stable and more accurate inference. In this paper, we propose a family of robust estimators for sparse logistic models utilizing the popular density power divergence based loss function and the general adaptively weighted LASSO penalties. We study the local robustness of the proposed estimators through its influence function and also derive its oracle properties and asymptotic distribution. With extensive empirical illustrations, we demonstrate the significantly improved performance of our proposed estimators over the existing ones with particular gain in robustness. Our proposal is finally applied to analyse four different real datasets for cancer classification, obtaining robust and accurate models, that simultaneously performs gene selection and patient classification.
引用
收藏
页码:1217 / 1249
页数:33
相关论文
共 50 条
  • [1] Robust adaptive LASSO in high-dimensional logistic regressionRobust adaptive LASSO in high-dimensional logistic regressionA. Basu et al.
    Ayanendranath Basu
    Abhik Ghosh
    Maria Jaenada
    Leandro Pardo
    Statistical Methods & Applications, 2024, 33 (5) : 1217 - 1249
  • [2] Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9326 - 9332
  • [3] ADAPTIVE LASSO FOR SPARSE HIGH-DIMENSIONAL REGRESSION MODELS
    Huang, Jian
    Ma, Shuangge
    Zhang, Cun-Hui
    STATISTICA SINICA, 2008, 18 (04) : 1603 - 1618
  • [4] Fully Bayesian logistic regression with hyper-LASSO priors for high-dimensional feature selection
    Li, Longhai
    Yao, Weixin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (14) : 2827 - 2851
  • [5] Minimum Distance Lasso for robust high-dimensional regression
    Lozano, Aurelie C.
    Meinshausen, Nicolai
    Yang, Eunho
    ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (01): : 1296 - 1340
  • [6] GFLASSO-LR: Logistic Regression with Generalized Fused LASSO for Gene Selection in High-Dimensional Cancer Classification
    Bir-Jmel, Ahmed
    Douiri, Sidi Mohamed
    Bernoussi, Souad El
    Maafiri, Ayyad
    Himeur, Yassine
    Atalla, Shadi
    Mansoor, Wathiq
    Al-Ahmad, Hussain
    COMPUTERS, 2024, 13 (04)
  • [7] Adaptive Lasso in high-dimensional settings
    Lin, Zhengyan
    Xiang, Yanbiao
    Zhang, Caiya
    JOURNAL OF NONPARAMETRIC STATISTICS, 2009, 21 (06) : 683 - 696
  • [8] Robust high-dimensional regression for data with anomalous responses
    Ren, Mingyang
    Zhang, Sanguo
    Zhang, Qingzhao
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (04) : 703 - 736
  • [9] Robust and sparse estimation methods for high-dimensional linear and logistic regression
    Kurnaz, Fatma Sevinc
    Hoffmann, Irene
    Filzmoser, Peter
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 172 : 211 - 222
  • [10] Robust Variable Selection with Optimality Guarantees for High-Dimensional Logistic Regression
    Insolia, Luca
    Kenney, Ana
    Calovi, Martina
    Chiaromonte, Francesca
    STATS, 2021, 4 (03): : 665 - 681