Robust and sparse logistic regression

被引:0
|
作者
Cornilly, Dries [1 ,3 ]
Tubex, Lise [2 ]
Van Aelst, Stefan [1 ]
Verdonck, Tim [1 ,2 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
[2] Univ Antwerp, imec, Dept Math, Middelheimlaan 1, B-2020 Antwerp, Belgium
[3] Asteria IM, Rue Lausanne 15, CH-1202 Geneva, Switzerland
关键词
Elastic net; gamma-divergence; Logistic regression; Robustness; Sparsity; VARIABLE SELECTION; REGULARIZATION; MODEL;
D O I
10.1007/s11634-023-00572-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Logistic regression is one of the most popular statistical techniques for solving (binary) classification problems in various applications (e.g. credit scoring, cancer detection, ad click predictions and churn classification). Typically, the maximum likelihood estimator is used, which is very sensitive to outlying observations. In this paper, we propose a robust and sparse logistic regression estimator where robustness is achieved by means of the gamma-divergence. An elastic net penalty ensures sparsity in the regression coefficients such that the model is more stable and interpretable. We show that the influence function is bounded and demonstrate its robustness properties in simulations. The good performance of the proposed estimator is also illustrated in an empirical application that deals with classifying the type of fuel used by cars.
引用
收藏
页码:663 / 679
页数:17
相关论文
共 50 条
  • [41] Robust sparse regression by modeling noise as a mixture of gaussians
    Xu, Shuang
    Zhang, Chun-Xia
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (10) : 1738 - 1755
  • [42] Joint sparse principal component regression with robust property
    Qi, Kai
    Tu, Jingwen
    Yang, Hu
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [43] Sparse logistic regression for whole-brain classification of fMRI data
    Ryali, Srikanth
    Supekar, Kaustubh
    Abrams, Daniel A.
    Menon, Vinod
    NEUROIMAGE, 2010, 51 (02) : 752 - 764
  • [44] High dimensional classification with combined adaptive sparse PLS and logistic regression
    Durif, Ghislain
    Modolo, Laurent
    Michaelsson, Jakob
    Mold, Jeff E.
    Lambert-Lacroix, Sophie
    Picard, Franck
    BIOINFORMATICS, 2018, 34 (03) : 485 - 493
  • [45] Robust adaptive LASSO in high-dimensional logistic regression
    Basu, Ayanendranath
    Ghosh, Abhik
    Jaenada, Maria
    Pardo, Leandro
    STATISTICAL METHODS AND APPLICATIONS, 2024, : 1217 - 1249
  • [46] Robust inference for generalized linear models with application to logistic regression
    Adimari, G
    Ventura, L
    STATISTICS & PROBABILITY LETTERS, 2001, 55 (04) : 413 - 419
  • [47] Robust and Sparse Regression via γ-Divergence
    Kawashima, Takayuki
    Fujisawa, Hironori
    ENTROPY, 2017, 19 (11):
  • [48] The evidence framework applied to sparse kernel logistic regression
    Cawley, GC
    Talbot, NLC
    NEUROCOMPUTING, 2005, 64 (64) : 119 - 135
  • [49] Clinical Risk Prediction with Multilinear Sparse Logistic Regression
    Wang, Fei
    Zhang, Ping
    Qian, Buyue
    Wang, Xiang
    Davidson, Ian
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 145 - 154
  • [50] Sparse partial robust M regression
    Hoffmann, Irene
    Serneels, Sven
    Filzmoser, Peter
    Croux, Christophe
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 149 : 50 - 59