Prediction of death status on the course of treatment in SARS-COV-2 patients with deep learning and machine learning methods

被引:27
作者
Kivrak, Mehmet [1 ]
Guldogan, Emek [1 ]
Colak, Cemil [1 ]
机构
[1] Inonu Univ, Fac Med, Dept Biostat & Med Informat, Malatya, Turkey
关键词
SARS-COV-2; Data Mining; Deep Learning; Extreme Gradient Boosting; Machine Learning; WUHAN;
D O I
10.1016/j.cmpb.2021.105951
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: The new type of Coronavirus (2019-nCov) epidemic spread rapidly, causing more than 250 thousand deaths worldwide. The virus, which first appeared as a sign of pneumonia, was later called the SARS-COV-2 with Severe Acute Respiratory Syndrome by the World Health Organization. The SARS-COV-2 virus is triggered by binding to the Angiotensin-Converting Enzyme 2 (ACE 2) inhibitor, which is vital in cardiovascular diseases and the immune system, especially in conditions such as cerebrovascular, hypertension, and diabetes. This study aims to evaluate the prediction performance of death status based on the demographic/clinical factors (including COVID-19 severity) by data mining methods. Methods: The dataset consists of 1603 SARS-COV-2 patients and 13 variables obtained from an open source web address. The current dataset contains age, gender, chronic disease (hypertension, diabetes, renal, cardiovascular, etc.), some enzymes (ACE, angiotensin II receptor blockers), and COVID-19 severity, which are used to predict death status using deep learning and machine learning approaches (random forest, k-nearest neighbor, extreme gradient boosting [XGBoost]). A grid search algorithm tunes hyperparameters of the models, and predictions are assessed through performance metrics. Steps of knowledge discovery in databases are applied to obtain the relevant information. Results: The accuracy rate of deep learning (97.15%) was more successful than the accuracy rate based on classical machine learning (92.15% for RF and 93.4% for k-NN), but the ensemble classifier XGBoost method gave the highest accuracy (99.7%). While COVID-19 severity and age calculated from XGBoost were the two most important factors associated with death status, the most determining variables for death status estimated from deep learning were COVID-19 severity and hypertension. Conclusions: The proposed model (XGBoost) achieved the best prediction of death status based on the factors as compared to the other algorithms. The results of this study can guide patients with certain variables to take early measures and access preventive health care services before they become infected with the virus. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 30 条
  • [11] Campbell M., 2019, RSTUDIO PROJECTS LEA, P39, DOI 10.1007/978-1-4842-4511-8_4
  • [12] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [13] Common cardiovascular risk factors and in-hospital mortality in 3,894 patients with COVID-19: survival analysis and machine learning-based findings from the multicentre Italian CORIST Study
    Di Castelnuovo, Augusto
    Bonaccio, Marialaura
    Costanzo, Simona
    Gialluisi, Alessandro
    Antinori, Andrea
    Berselli, Nausicaa
    Blandi, Lorenzo
    Bruno, Raffaele
    Cauda, Roberto
    Guaraldi, Giovanni
    My, Ilaria
    Menicanti, Lorenzo
    Parruti, Giustino
    Patti, Giuseppe
    Perlini, Stefano
    Santilli, Francesca
    Signorelli, Carlo
    Stefanini, Giulio G.
    Vergori, Alessandra
    Abdeddaim, Amina
    Ageno, Walter
    Agodi, Antonella
    Agostoni, Piergiuseppe
    Aiello, Luca
    Al Moghazi, Samir
    Aucella, Filippo
    Barbieri, Greta
    Bartoloni, Alessandro
    Bologna, Carolina
    Bonfanti, Paolo
    Brancati, Serena
    Cacciatore, Francesco
    Caiano, Lucia
    Cannata, Francesco
    Carrozzi, Laura
    Cascio, Antonio
    Cingolani, Antonella
    Cipollone, Francesco
    Colomba, Claudia
    Crisetti, Annalisa
    Crosta, Francesca
    Danzi, Gian B.
    D'Ardes, Damiano
    Donati, Katleen de Gaetano
    Di Gennaro, Francesco
    Di Palma, Gisella
    Di Tano, Giuseppe
    Fantoni, Massimo
    Filippini, Tommaso
    Fioretto, Paola
    [J]. NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2020, 30 (11) : 1899 - 1913
  • [14] Fayyad U, 2001, RELATIONAL DATA MINI, P28, DOI DOI 10.1007/978-3-662-04599-2_2
  • [15] Recent advances in convolutional neural networks
    Gu, Jiuxiang
    Wang, Zhenhua
    Kuen, Jason
    Ma, Lianyang
    Shahroudy, Amir
    Shuai, Bing
    Liu, Ting
    Wang, Xingxing
    Wang, Gang
    Cai, Jianfei
    Chen, Tsuhan
    [J]. PATTERN RECOGNITION, 2018, 77 : 354 - 377
  • [16] Hofmann M., 2016, RAPIDMINER DATA MINI
  • [17] Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China
    Huang, Chaolin
    Wang, Yeming
    Li, Xingwang
    Ren, Lili
    Zhao, Jianping
    Hu, Yi
    Zhang, Li
    Fan, Guohui
    Xu, Jiuyang
    Gu, Xiaoying
    Cheng, Zhenshun
    Yu, Ting
    Xia, Jiaan
    Wei, Yuan
    Wu, Wenjuan
    Xie, Xuelei
    Yin, Wen
    Li, Hui
    Liu, Min
    Xiao, Yan
    Gao, Hong
    Guo, Li
    Xie, Jungang
    Wang, Guangfa
    Jiang, Rongmeng
    Gao, Zhancheng
    Jin, Qi
    Wang, Jianwei
    Cao, Bin
    [J]. LANCET, 2020, 395 (10223) : 497 - 506
  • [18] The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health - The latest 2019 novel coronavirus outbreak in Wuhan, China
    Hui, David S.
    Azhar, Esam I.
    Madani, Tariq A.
    Ntoumi, Francine
    Kock, Richard
    Dar, Osman
    Ippolito, Giuseppe
    Mchugh, Timothy D.
    Memish, Ziad A.
    Drosten, Christian
    Zumla, Alimuddin
    Petersen, Eskild
    [J]. INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2020, 91 : 264 - 266
  • [19] Web spam classification method based on deep belief networks
    Li, Yuancheng
    Nie, Xiangqian
    Huang, Rong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 261 - 270
  • [20] Association between weather data and COVID-19 pandemic predicting mortality rate: Machine learning approaches
    Malki, Zohair
    Atlam, El-Sayed
    Hassanien, Aboul Ella
    Dagnew, Guesh
    Elhosseini, Mostafa A.
    Gad, Ibrahim
    [J]. CHAOS SOLITONS & FRACTALS, 2020, 138