Comparison of penalized logistic regression models for rare event case

被引:3
作者
Olmus, Hulya [1 ]
Nazman, Ezgi [1 ]
Erbas, Semra [2 ]
机构
[1] Gazi Univ, Stat, Ankara, Turkey
[2] Univ Kyrenia, Fac Arts & Sci, Karakum, Northern Cyprus, Turkey
关键词
Firth LR; FLIC; FLAC; Rare event; Predicted probability bias; BIAS; ESTIMATORS; REDUCTION; RATIO;
D O I
10.1080/03610918.2019.1676438
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The occurrence rate of the event of interest might be quite small (rare) in some cases, although sample size is large enough for Binary Logistic Regression (LR) model. In studies where the sample size is not large enough, the parameters to be estimated might be biased because of rare event case. Parameter estimations of LR model are usually obtained using Newton?Raphson (NR) algorithm for Maximum Likelihood Estimation (MLE). It is known that these estimations are usually biased in small samples but asymptotically unbiased. On the other hand, initial parameter values are sensitive for parameter estimation in NR for MLE. Our aim of the study is to present an approach on parameter estimation bias using inverse conditional distributions based on distribution assumption giving true parameter values and to compare this approach on different penalized LR methods. With this aim, LR, Firth LR, FLIC and FLAC methods were compared in terms of parameter estimation bias, predicted probability bias and Root Mean Squared Error (RMSE) for different sample sizes, event and correlation rates conducting a detailed Monte Carlo simulation study. Findings suggest that FLIC method should be preferred in rare event and small sample cases.
引用
收藏
页码:1578 / 1590
页数:13
相关论文
共 50 条
[41]   Block Gibbs samplers for logistic mixed models: Convergence properties and a comparison with full Gibbs samplers [J].
Rao, Yalin ;
Roy, Vivekananda .
ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02) :5598-5625
[42]   Bias-corrected estimates for logistic regression models for complex surveys with application to the United States' Nationwide Inpatient Sample [J].
Rader, Kevin A. ;
Lipsitz, Stuart R. ;
Fitzmaurice, Garrett M. ;
Harrington, David P. ;
Parzen, Michael ;
Sinha, Debajyoti .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (05) :2257-2269
[43]   Bayesian generalized linear low rank regression models for the detection of vaccine-adverse event associations [J].
Hauser, Paloma ;
Tan, Xianming ;
Chen, Fang ;
Ibrahim, Joseph G. .
STATISTICS IN MEDICINE, 2023, :2009-2026
[44]   Impact of Missing Data on the Detection of Differential Item Functioning The Case of Mantel-Haenszel and Logistic Regression Analysis [J].
Robitzsch, Alexander ;
Rupp, Andre A. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2009, 69 (01) :18-34
[45]   BIAS ADJUSTMENT WITH POLYCHOTOMOUS LOGISTIC-REGRESSION IN MATCHED CASE-CONTROL STUDIES WITH 2 CONTROL-GROUPS [J].
BECHER, H ;
JOCKEL, KH .
BIOMETRICAL JOURNAL, 1990, 32 (07) :801-816
[46]   Dynamic churn prediction framework with more effective use of rare event data: The case of private banking [J].
Ali, Ozden Gur ;
Ariturk, Umut .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (17) :7889-7903
[47]   Logistic Regression and Artificial Neural Network Models for Mapping of Regional-scale Landslide Susceptibility in Volcanic Mountains of West Java']Java (Indonesia) [J].
Ngadisih ;
Bhandary, Netra P. ;
Yatabe, Ryuichi ;
Dahal, Ranjan K. .
5TH INTERNATIONAL SYMPOSIUM ON EARTHHAZARD AND DISASTER MITIGATION, 2016, 1730
[48]   Handling Imbalanced Data With Weighted Logistic Regression and Propensity Score Matching methods: The Case of P2P Money Transfers [J].
Agrawal, Lavlin ;
Mulgund, Pavankumar ;
Sharman, Raj .
JOURNAL OF DATABASE MANAGEMENT, 2024, 35 (01) :1-37
[49]   Evaluation of Cox's model and logistic regression for matched case-control data with time-dependent covariates:: a simulation study [J].
Leffondré, K ;
Abrahamowicz, M ;
Siemiatycki, J .
STATISTICS IN MEDICINE, 2003, 22 (24) :3781-3794
[50]   Prediction of river damming susceptibility by landslides based on a logistic regression model and InSAR techniques: A case study of the Bailong River Basin, China [J].
Jin, Jiacheng ;
Chen, Guan ;
Meng, Xingmin ;
Zhang, Yi ;
Shi, Wei ;
Li, Yuanxi ;
Yang, Yunpeng ;
Jiang, Wanyu .
ENGINEERING GEOLOGY, 2022, 299