When the binary response variable contains an excess of zero counts, the data are imbalanced. Imbalanced data cause trouble for binary classification. To simplify the numerical computation to obtain the maximum likelihood estimators of the zero-inflated Bernoulli (ZIBer) model parameters with imbalanced data, an expectation-maximization (EM) algorithm is proposed to derive the maximum likelihood estimates of the model parameters. The logistic regression model links the Bernoulli probabilities with the covariates in the ZIBer model, and the prediction performance among the ZIBer model, LightGBM, and artificial neural network (ANN) procedures is compared by Monte Carlo simulation. The results show that no method can dominate the other methods regarding predictive performance under the imbalanced data. The LightGBM and ZIBer models are more competitive than the ANN model for zero-inflated-imbalanced data sets.
机构:
Polytech Inst Porto, ISEP, Rua Dr Antonio Bernardino Almeida, P-4249015 Porto, PortugalPolytech Inst Porto, ISEP, Rua Dr Antonio Bernardino Almeida, P-4249015 Porto, Portugal
Vieira, Pedro Marques
Rodrigues, Fatima
论文数: 0引用数: 0
h-index: 0
机构:
Polytech Inst Porto, ISEP, Rua Dr Antonio Bernardino Almeida, P-4249015 Porto, Portugal
Interdisciplinary Studies Res Ctr ISRC, Porto, PortugalPolytech Inst Porto, ISEP, Rua Dr Antonio Bernardino Almeida, P-4249015 Porto, Portugal
机构:
Chongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
Zheng, Jian
Ren, Shumiao
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Polytech Inst, Coll Big Data, Chongqing 401320, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
Ren, Shumiao
Zhang, Jingyue
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Polytech Inst, Coll Big Data, Chongqing 401320, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
Zhang, Jingyue
Wang, Shiyan
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
Wang, Shiyan
Li, Lin
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Three Gorges Univ, Coll Math & Stat, Chongqing 404100, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
机构:
Chongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
Zheng, Jian
Hu, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Yangtze Normal Univ, Coll Big Data & Intelligent Engn, Chongqing 408100, Peoples R ChinaChongqing Technol & Business Univ, Sch Artificial Intelligence, Chongqing 400067, Peoples R China
机构:
Chongqing Technol & Business Inst, Chongqing 404000, Peoples R ChinaChongqing Technol & Business Inst, Chongqing 404000, Peoples R China
Wang, Qingling
Zheng, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Technol & Business Univ, Coll Artificial Intelligence, Chongqing, Peoples R ChinaChongqing Technol & Business Inst, Chongqing 404000, Peoples R China
Zheng, Jian
Zhang, Wenjing
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Ind Polytech Coll, Chongqing, Peoples R ChinaChongqing Technol & Business Inst, Chongqing 404000, Peoples R China
机构:
Yonsei Univ, Div Biostat, Dept Biomed Syst Informat, Coll Med, 50-1 Yonsei Ro, Seoul 03722, South KoreaYonsei Univ, Div Biostat, Dept Biomed Syst Informat, Coll Med, 50-1 Yonsei Ro, Seoul 03722, South Korea
Park, Geun U.
Jun, Inkyun G.
论文数: 0引用数: 0
h-index: 0
机构:
Yonsei Univ, Div Biostat, Dept Biomed Syst Informat, Coll Med, 50-1 Yonsei Ro, Seoul 03722, South KoreaYonsei Univ, Div Biostat, Dept Biomed Syst Informat, Coll Med, 50-1 Yonsei Ro, Seoul 03722, South Korea