Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data

被引:2
|
作者
Alharthi, Aiedh Mrisi [1 ,2 ]
Lee, Muhammad Hisyam [1 ]
Algamal, Zakariya Yahya [3 ]
机构
[1] Univ Teknol Malaysia, Dept Math Sci, Skudai, Malaysia
[2] Taif Univ, Dept Math, At Taif, Saudi Arabia
[3] Univ Mosul, Dept Stat & Informat, Mosul, Iraq
关键词
high-dimensional data; feature selection; missing data; multiple imputations; penalized regression; MULTIPLE IMPUTATION; VARIABLE SELECTION; ALGORITHM; REGULARIZATION;
D O I
10.3991/ijoe.v18i02.25047
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
without adequate handling of missing values may lead to inconsistent and biased estimates. Despite multiple imputations becoming a widely used approach in handling missing data, manuscript researchers generally encounter missing data in their respective studies. In high-dimensional data, penalized regression is a popular technique for performing feature selection and coefficient estimation simultaneously. However, one of the most vital issues with high-dimensional data is that it often contains large quantities of missing data that common multiple imputation approaches may not work correctly. Therefore, this study uses imputations penalized regression models as an extension of the penalized methods to improve the performance and impute missing values in high-dimensional data. The method was applied to real-life high dimensional datasets for the different number of features, sample sizes, and missing dataset rates to evaluate its efficiency. The method was also compared with other existing imputation penalized methods for high-dimensional data. The comparative experimental results indicate that the proposed method outperforms its competitors by achieving higher sensitivity, specificity, and classification accuracy values.
引用
收藏
页码:40 / 54
页数:15
相关论文
共 50 条
  • [21] Debiased inference for heterogeneous subpopulations in a high-dimensional logistic regression model
    Kim, Hyunjin
    Lee, Eun Ryung
    Park, Seyoung
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [22] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Shi, Yue Yong
    Jiao, Yu Ling
    Cao, Yong Xiu
    Liu, Yan Yan
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2018, 34 (12) : 1892 - 1906
  • [23] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235
  • [24] Performance Comparison of Penalized Regression Methods in Poisson Regression under High-Dimensional Sparse Data with Multicollinearity
    Choosawat, Chutikarn
    Reangsephet, Orawan
    Srisuradetchai, Patchanok
    Lisawadi, Supranee
    THAILAND STATISTICIAN, 2020, 18 (03): : 306 - 318
  • [25] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong SHI
    Yu Ling JIAO
    Yong Xiu CAO
    Yan Yan LIU
    Acta Mathematica Sinica,English Series, 2018, 34 (12) : 1892 - 1906
  • [26] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong SHI
    Yu Ling JIAO
    Yong Xiu CAO
    Yan Yan LIU
    ActaMathematicaSinica, 2018, 34 (12) : 1892 - 1906
  • [27] An Alternating Direction Method of Multipliers for MCP-penalized Regression with High-dimensional Data
    Yue Yong Shi
    Yu Ling Jiao
    Yong Xiu Cao
    Yan Yan Liu
    Acta Mathematica Sinica, English Series, 2018, 34 : 1892 - 1906
  • [28] Vanishing deviance problem in high-dimensional penalized Cox regression
    Yao, Sijie
    Li, Tingyi
    Cao, Biwei
    Wang, Xuefeng
    CANCER RESEARCH, 2023, 83 (07)
  • [29] High-Dimensional Censored Regression via the Penalized Tobit Likelihood
    Jacobson, Tate
    Zou, Hui
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024, 42 (01) : 286 - 297
  • [30] Semi-Supervised Factored Logistic Regression for High-Dimensional Neuroimaging Data
    Bzdok, Danilo
    Eickenberg, Michael
    Grisel, Olivier
    Thirion, Bertrand
    Varoquaux, Gael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28