Prediction of COVID-19 disease severity using synthetic data oversampling and machine learning methods on data at first hospitalization

被引:0
作者
Koksal, Kubra [1 ]
Dogan, Buket [1 ]
Altikardes, Zehra Aysun [2 ,3 ]
机构
[1] Marmara Univ, Fac Technol, Dept Comp Engn, TR-34722 Istanbul, Turkiye
[2] Marmara Univ, Vocat Sch Tech Sci, Dept Comp Technol, TR-34722 Istanbul, Turkiye
[3] Marmara Univ, Hypertens & Atherosclerosis Res Ctr HIPAM, Istanbul, Turkiye
来源
JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY | 2025年 / 40卷 / 01期
关键词
COVID-19; machine learning; laboratory data; prognosis;
D O I
10.17341/gazimmfd.1348341
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
COVID-19, originating in Wuhan, China, in December 2019 declared a pandemic by the World Health Organization on March 11, 2020, rapidly spread worldwide, significantly impacting human life and the healthsector. This study aims to develop a WHO (World Health Organization) oriented disease severity prediction modelusing laboratory and demographic data from COVID-19 patients upon admission to Marmara University Hospital.The relationship between oxygen and intensive care needs with laboratory results on the data set was analyzedusing K-nearest neighbor, Bagging, Random Forest and Decision Tree machine learning methods. The dataset'sunbalanced class distribution was balanced using the SMOTE method, and the impact of data multiplication onclassification performance was evaluated. In the data set without SMOTE, the patient's oxygen requirement during the first hospitalization was estimated with 16 features at 91.67% accuracy, the oxygen requirement athospitalization with 18 features at 91.96%, and the intensive care need at hospitalization with 12 features at 92.17%accuracy. After SMOTE data multiplication, an increase of 6%, 24% and 21% was observed in F1-Score values, respectively. This study significantly contributes to the field by utilizing machine learning methods on patient data,essential for COVID-19 diagnosis, monitoring, and clinical management through required laboratory tests.
引用
收藏
页码:413 / 427
页数:16
相关论文
共 43 条
  • [1] Akarsu E, 2022, Avrupa Bilim ve Teknoloji Dergisi, P13
  • [2] Random forest method for the recognition of susceptibility and resistance patterns in antibiograms
    Ayala-Aldana, Nicolas
    Gonzalez-Valdes, Leticia
    [J]. REVISTA CHILENA DE INFECTOLOGIA, 2023, 40 (01): : 76 - 77
  • [3] Aljameel SS, 2021, SCI PROGRAMMING-NETH, V2021, DOI [10.1155/2021/6494889, 10.1155/2021/5587188]
  • [4] Development of a machine learning algorithm to predict intubation among hospitalized patients with COVID-19
    Arvind, Varun
    Kim, Jun S.
    Cho, Brian H.
    Geng, Eric
    Cho, Samuel K.
    [J]. JOURNAL OF CRITICAL CARE, 2021, 62 : 25 - 30
  • [5] Aydilek I.B, 2018, Pamukkale Universitesi Muhendislik Bilimleri Dergisi, V24, pn31
  • [6] Use of Machine Learning and Artificial Intelligence to predict SARS-CoV-2 infection from Full Blood Counts in a population
    Banerjee, Abhirup
    Ray, Surajit
    Vorselaars, Bart
    Kitson, Joanne
    Mamalakis, Michail
    Weeks, Simonne
    Baker, Mark
    Mackenzie, Louise S.
    [J]. INTERNATIONAL IMMUNOPHARMACOLOGY, 2020, 86
  • [7] Bhargava N., 2017, 2017 2 INT C COMM EL, P32
  • [8] Breiman L, 1996, Machine learning, V24, P36
  • [9] Prediction of respiratory decompensation in Covid-19 patients using machine learning: The READY trial
    Burdick, Hoyt
    Lam, Carson
    Mataraso, Samson
    Siefkas, Anna
    Braden, Gregory
    Dellinger, R. Phillip
    McCoy, Andrea
    Vincent, Jean-Louis
    Green-Saxena, Abigail
    Barnes, Gina
    Hoffman, Jana
    Calvert, Jacob
    Pellegrini, Emily
    Das, Ritankar
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 124
  • [10] Development, evaluation, and validation of machine learning models for COVID-19 detection based on routine blood tests
    Cabitza, Federico
    Campagner, Andrea
    Ferrari, Davide
    Di Resta, Chiara
    Ceriotti, Daniele
    Sabetta, Eleonora
    Colombini, Alessandra
    De Vecchi, Elena
    Banfi, Giuseppe
    Locatelli, Massimo
    Carobene, Anna
    [J]. CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2021, 59 (02) : 421 - 431