Development of decision tree classification algorithms in predicting mortality of COVID-19 patients

被引:1
作者
Mohammadi-Pirouz, Zahra [1 ]
Hajian-Tilaki, Karimollah [2 ,3 ]
Sadeghi Haddat-Zavareh, Mahmoud [4 ]
Amoozadeh, Abazar [3 ]
Bahrami, Shabnam [1 ]
机构
[1] Babol Univ Med Sci, Res Inst, Student Res Ctr, Babol, Iran
[2] Babol Univ Med Sci, Sch Publ Hlth, Dept Biostat & Epidemiol, Babol, Iran
[3] Babol Univ Med Sci, Res Inst, Social Determinants Hlth Res Ctr, Babol, Iran
[4] Babol Univ Med Sci, Ayatollah Rohani Hosp, Dept Infect Dis, Babol, Iran
关键词
Decision tree; CART; C5.0; CHAID; Logistic regression; COVID-19; mortality; Predictive factors; LOGISTIC-REGRESSION;
D O I
10.1186/s12245-024-00681-7
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
IntroductionThe accurate prediction of COVID-19 mortality risk, considering influencing factors, is crucial in guiding effective public policies to alleviate the strain on the healthcare system. As such, this study aimed to assess the efficacy of decision tree algorithms (CART, C5.0, and CHAID) in predicting COVID-19 mortality risk and compare their performance with that of the logistic model.MethodsThis retrospective cohort study examined 5080 cases of COVID-19 in Babol, a city in northern Iran, who tested positive for the virus via PCR from March 2020 to March 2022. In order to check the validity of the findings, the data was randomly divided into an 80% training set and a 20% testing set. The prediction models, such as Logistic regression models and decision tree algorithms, were trained on the 80% training data and tested on the 20% testing data. The accuracy of these methods for the test samples was assessed using measures like ROC curve, sensitivity, specificity, and AUC.ResultsThe findings revealed that the mortality rate for COVID-19 patients who were admitted to hospitals was 7.7%. Through cross validation, it was determined that the CHAID algorithm outperformed other decision tree and logistic regression algorithms in specificity, and precision but not sensitivity in predicting the risk of COVID-19 mortality. The CHAID algorithm demonstrated a specificity, precision, accuracy, and F-score of 0.98, 0.70, 0.95, and 0.52 respectively. All models indicated that factors such as ICU hospitalization, intubation, age, kidney disease, BUN, CRP, WBC, NLR, O2 sat, and hemoglobin were among the factors that influenced the mortality rate of COVID-19 patients.ConclusionsThe CART and C5.0 models had outperformed in sensitivity but CHAID demonstrates a better performance compared to other decision tree algorithms in specificity, precision, accuracy and shows a slight improvement over the logistic regression method in predicting the risk of COVID-19 mortality in the population under study.
引用
收藏
页数:18
相关论文
共 49 条
  • [1] Agrawal R., 2018, NATURE INSPIRED COMP, P31, DOI DOI 10.1007/978-981-10-6747-1_4
  • [2] Comparison of machine learning algorithms for the prediction of five-year survival in oral squamous cell carcinoma
    Alkhadar, Huda
    Macluskey, Michaelina
    White, Sharon
    Ellis, Ian
    Gardner, Alexander
    [J]. JOURNAL OF ORAL PATHOLOGY & MEDICINE, 2021, 50 (04) : 378 - 384
  • [3] Alkhawaldeh Ibraheem M, 2023, World J Methodol, V13, P373, DOI 10.5662/wjm.v13.i5.373
  • [4] Cardiovascular Event Prediction by Machine Learning The Multi-Ethnic Study of Atherosclerosis
    Ambale-Venkatesh, Bharath
    Yang, Xiaoying
    Wu, Colin O.
    Liu, Kiang
    Hundley, W. Gregory
    McClelland, Robyn
    Gomes, Antoinette S.
    Folsom, Aaron R.
    Shea, Steven
    Guallar, Eliseo
    Bluemke, David A.
    Lima, Joao A. C.
    [J]. CIRCULATION RESEARCH, 2017, 121 (09) : 1092 - +
  • [5] Preexisting Comorbidities Predicting COVID-19 and Mortality in the UK Biobank Community Cohort
    Atkins, Janice L.
    Masoli, Jane A. H.
    Delgado, Joao
    Pilling, Luke C.
    Kuo, Chia-Ling
    Kuchel, George A.
    Melzer, David
    [J]. JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2020, 75 (11): : 2224 - 2230
  • [6] Informatics in Radiology Comparison of Logistic Regression and Artificial Neural Network Models in Breast Cancer Risk Estimation
    Ayer, Turgay
    Chhatwal, Jagpreet
    Alagoz, Oguzhan
    Kahn, Charles E., Jr.
    Woods, Ryan W.
    Burnside, Elizabeth S.
    [J]. RADIOGRAPHICS, 2010, 30 (01) : 13 - U27
  • [7] Baratloo A, 2015, EMERGENCY, V3, P48
  • [8] Fatality rate and predictors of mortality in an Italian cohort of hospitalized COVID-19 patients
    Bellan, Mattia
    Patti, Giuseppe
    Hayden, Eyal
    Azzolina, Danila
    Pirisi, Mario
    Acquaviva, Antonio
    Aimaretti, Gianluca
    Aluffi Valletti, Paolo
    Angilletta, Roberto
    Arioli, Roberto
    Avanzi, Gian Carlo
    Avino, Gianluca
    Balbo, Piero Emilio
    Baldon, Giulia
    Baorda, Francesca
    Barbero, Emanuela
    Baricich, Alessio
    Barini, Michela
    Barone-Adesi, Francesco
    Battistini, Sofia
    Beltrame, Michela
    Bertoli, Matteo
    Bertolin, Stephanie
    Bertolotti, Marinella
    Betti, Marta
    Bobbio, Flavio
    Boffano, Paolo
    Boglione, Lucio
    Borre, Silvio
    Brucoli, Matteo
    Calzaducca, Elisa
    Cammarata, Edoardo
    Cantaluppi, Vincenzo
    Cantello, Roberto
    Capponi, Andrea
    Carriero, Alessandro
    Casciaro, Francesco Giuseppe
    Castello, Luigi Mario
    Ceruti, Federico
    Chichino, Guido
    Chirico, Emilio
    Cisari, Carlo
    Cittone, Micol Giulia
    Colombo, Crizia
    Comi, Cristoforo
    Croce, Eleonora
    Daffara, Tommaso
    Danna, Pietro
    Della Corte, Francesco
    De Vecchi, Simona
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [9] Characteristics and predictors of death among 4035 consecutively hospitalized patients with COVID-19 in Spain
    Berenguer, Juan
    Ryan, Pablo
    Rodriguez-Bano, Jesus
    Jarrin, Inmaculada
    Carratala, Jordi
    Pachon, Jeronimo
    Yllescas, Maria
    Arriba, Jose Ramon
    [J]. CLINICAL MICROBIOLOGY AND INFECTION, 2020, 26 (11) : 1525 - 1536
  • [10] COVID-19 mortality risk assessment: An international multi-center study
    Bertsimas, Dimitris
    Lukin, Galit
    Mingardi, Luca
    Nohadani, Omid
    Orfanoudaki, Agni
    Stellato, Bartolomeo
    Wiberg, Holly
    Gonzalez-Garcia, Sara
    Parra-Calderon, Carlos Luis
    Robinson, Kenneth
    Schneider, Michelle
    Stein, Barry
    Estirado, Alberto
    Beccara, Lia
    Canino, Rosario
    Dal Bello, Martina
    Pezzetti, Federica
    Pan, Angelo
    [J]. PLOS ONE, 2020, 15 (12):