Prediction of death status on the course of treatment in SARS-COV-2 patients with deep learning and machine learning methods

被引:27
作者
Kivrak, Mehmet [1 ]
Guldogan, Emek [1 ]
Colak, Cemil [1 ]
机构
[1] Inonu Univ, Fac Med, Dept Biostat & Med Informat, Malatya, Turkey
关键词
SARS-COV-2; Data Mining; Deep Learning; Extreme Gradient Boosting; Machine Learning; WUHAN;
D O I
10.1016/j.cmpb.2021.105951
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: The new type of Coronavirus (2019-nCov) epidemic spread rapidly, causing more than 250 thousand deaths worldwide. The virus, which first appeared as a sign of pneumonia, was later called the SARS-COV-2 with Severe Acute Respiratory Syndrome by the World Health Organization. The SARS-COV-2 virus is triggered by binding to the Angiotensin-Converting Enzyme 2 (ACE 2) inhibitor, which is vital in cardiovascular diseases and the immune system, especially in conditions such as cerebrovascular, hypertension, and diabetes. This study aims to evaluate the prediction performance of death status based on the demographic/clinical factors (including COVID-19 severity) by data mining methods. Methods: The dataset consists of 1603 SARS-COV-2 patients and 13 variables obtained from an open source web address. The current dataset contains age, gender, chronic disease (hypertension, diabetes, renal, cardiovascular, etc.), some enzymes (ACE, angiotensin II receptor blockers), and COVID-19 severity, which are used to predict death status using deep learning and machine learning approaches (random forest, k-nearest neighbor, extreme gradient boosting [XGBoost]). A grid search algorithm tunes hyperparameters of the models, and predictions are assessed through performance metrics. Steps of knowledge discovery in databases are applied to obtain the relevant information. Results: The accuracy rate of deep learning (97.15%) was more successful than the accuracy rate based on classical machine learning (92.15% for RF and 93.4% for k-NN), but the ensemble classifier XGBoost method gave the highest accuracy (99.7%). While COVID-19 severity and age calculated from XGBoost were the two most important factors associated with death status, the most determining variables for death status estimated from deep learning were COVID-19 severity and hypertension. Conclusions: The proposed model (XGBoost) achieved the best prediction of death status based on the factors as compared to the other algorithms. The results of this study can guide patients with certain variables to take early measures and access preventive health care services before they become infected with the virus. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 30 条
  • [1] A machine learning model to identify early stage symptoms of SARS-Cov-2 infected patients
    Ahamad, Md Martuza
    Aktar, Sakifa
    Rashed-Al-Mahfuz, Md
    Uddin, Shahadat
    Lio, Pietro
    Xu, Haoming
    Summers, Matthew A.
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [2] Aktukun A., 2005, EKONOMETRI ISTATISTI, V1, P1
  • [3] Helping doctors hasten COVID-19 treatment: Towards a rescue framework for the transfusion of best convalescent plasma to the most critical patients based on biological requirements via ml and novel MCDM methods
    Albahri, O. S.
    Al-Obaidi, Jameel R.
    Zaidan, A. A.
    Albahri, A. S.
    Zaidan, B. B.
    Salih, Mahmood M.
    Qays, Abdulhadi
    Dawood, K. A.
    Mohammed, R. T.
    Abdulkareem, Karrar Hameed
    Aleesa, A. M.
    Alamoodi, A. H.
    Chyad, M. A.
    Zulkifli, Che Zalina
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196
  • [4] A survey of cross-validation procedures for model selection
    Arlot, Sylvain
    Celisse, Alain
    [J]. STATISTICS SURVEYS, 2010, 4 : 40 - 79
  • [5] Presumed Asymptomatic Carrier Transmission of COVID-19
    Bai, Yan
    Yao, Lingsheng
    Wei, Tao
    Tian, Fei
    Jin, Dong-Yan
    Chen, Lijuan
    Wang, Meiyun
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (14): : 1406 - 1407
  • [6] Use of Machine Learning and Artificial Intelligence to predict SARS-CoV-2 infection from Full Blood Counts in a population
    Banerjee, Abhirup
    Ray, Surajit
    Vorselaars, Bart
    Kitson, Joanne
    Mamalakis, Michail
    Weeks, Simonne
    Baker, Mark
    Mackenzie, Louise S.
    [J]. INTERNATIONAL IMMUNOPHARMACOLOGY, 2020, 86
  • [7] Benavides C., 2020, EVALUATION COUNTRY D, P371
  • [8] Predictors of severe or lethal COVID-19, including Angiotensin Converting Enzyme inhibitors and Angiotensin II Receptor Blockers, in a sample of infected Italian citizens
    Bravi, Francesca
    Flacco, Maria Elena
    Carradori, Tiziano
    Volta, Carlo Alberto
    Cosenza, Giuseppe
    De Togni, Aldo
    Acuti Martellucci, Cecilia
    Parruti, Giustino
    Mantovani, Lorenzo
    Manzoli, Lamberto
    [J]. PLOS ONE, 2020, 15 (06):
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Explainable Deep Learning for Pulmonary Disease and Coronavirus COVID-19 Detection from X-rays
    Brunese, Luca
    Mercaldo, Francesco
    Reginelli, Alfonso
    Santone, Antonella
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196