Robust and sparse logistic regression

被引:0
|
作者
Cornilly, Dries [1 ,3 ]
Tubex, Lise [2 ]
Van Aelst, Stefan [1 ]
Verdonck, Tim [1 ,2 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
[2] Univ Antwerp, imec, Dept Math, Middelheimlaan 1, B-2020 Antwerp, Belgium
[3] Asteria IM, Rue Lausanne 15, CH-1202 Geneva, Switzerland
关键词
Elastic net; gamma-divergence; Logistic regression; Robustness; Sparsity; VARIABLE SELECTION; REGULARIZATION; MODEL;
D O I
10.1007/s11634-023-00572-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Logistic regression is one of the most popular statistical techniques for solving (binary) classification problems in various applications (e.g. credit scoring, cancer detection, ad click predictions and churn classification). Typically, the maximum likelihood estimator is used, which is very sensitive to outlying observations. In this paper, we propose a robust and sparse logistic regression estimator where robustness is achieved by means of the gamma-divergence. An elastic net penalty ensures sparsity in the regression coefficients such that the model is more stable and interpretable. We show that the influence function is bounded and demonstrate its robustness properties in simulations. The good performance of the proposed estimator is also illustrated in an empirical application that deals with classifying the type of fuel used by cars.
引用
收藏
页码:663 / 679
页数:17
相关论文
共 50 条
  • [31] Large-Scale Sparse Logistic Regression
    Liu, Jun
    Chen, Jianhui
    Ye, Jieping
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 547 - 555
  • [32] Stochastic DCA for Sparse Multiclass Logistic Regression
    Hoai An Le Thi
    Hoai Minh Le
    Duy Nhat Phan
    Bach Tran
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, ICCSAMA 2017, 2018, 629 : 1 - 12
  • [33] Logistic regression with sparse common and distinctive covariates
    S. Park
    E. Ceulemans
    K. Van Deun
    Behavior Research Methods, 2023, 55 : 4143 - 4174
  • [34] Multiclass Classification by Sparse Multinomial Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    Levy, Tomer
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (07) : 4637 - 4646
  • [35] Differentially Private Logistic Regression with Sparse Solutions
    Khanna, Amol
    Lu, Fred
    Raff, Edward
    Testa, Brian
    PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023, 2023, : 1 - 9
  • [36] A Safe Screening Rule for Sparse Logistic Regression
    Wang, Jie
    Zhou, Jiayu
    Liu, Jun
    Wonka, Peter
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [37] Logistic Regression Under Sparse Data Conditions
    Walker, David A.
    Smith, Thomas J.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2019, 18 (02)
  • [38] Algorithm for the Robust Estimation in Logistic Regression
    Kim, Bu-Yong
    Kahng, Myung Wook
    Choi, Mi-Ae
    KOREAN JOURNAL OF APPLIED STATISTICS, 2007, 20 (03) : 551 - 559
  • [39] Robust estimation in the logistic regression model
    Kordzakhia, N
    Mishra, GD
    Reiersolmoen, L
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2001, 98 (1-2) : 211 - 223
  • [40] Greedy Projected Gradient-Newton Method for Sparse Logistic Regression
    Wang, Rui
    Xiu, Naihua
    Zhang, Chao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 527 - 538