A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data

被引:1
|
作者
Sethi, Sahil [1 ]
Shakyawar, Sushil [1 ]
Reddy, Athreya S. [2 ]
Patel, Jai Chand [1 ]
Guda, Chittibabu [1 ]
机构
[1] Univ Nebraska Med Ctr, Dept Genet Cell Biol & Anat, Omaha, NE 68105 USA
[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65211 USA
关键词
COVID-19; severity prediction; machine learning; feature selection; PROTEINS;
D O I
10.3390/diagnostics14121284
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The premise for this study emanated from the need to understand SARS-CoV-2 infections at the molecular level and to develop predictive tools for managing COVID-19 severity. With the varied clinical outcomes observed among infected individuals, creating a reliable machine learning (ML) model for predicting the severity of COVID-19 became paramount. Despite the availability of large-scale genomic and clinical data, previous studies have not effectively utilized multi-modality data for disease severity prediction using data-driven approaches. Our primary goal is to predict COVID-19 severity using a machine-learning model trained on a combination of patients' gene expression, clinical features, and co-morbidity data. Employing various ML algorithms, including Logistic Regression (LR), XGBoost (XG), Na & iuml;ve Bayes (NB), and Support Vector Machine (SVM), alongside feature selection methods, we sought to identify the best-performing model for disease severity prediction. The results highlighted XG as the superior classifier, with 95% accuracy and a 0.99 AUC (Area Under the Curve), for distinguishing severity groups. Additionally, the SHAP analysis revealed vital features contributing to prediction, including several genes such as COX14, LAMB2, DOLK, SDCBP2, RHBDL1, and IER3-AS1. Notably, two clinical features, the absolute neutrophil count and Viremia Categories, emerged as top contributors. Integrating multiple data modalities has significantly improved the accuracy of disease severity prediction compared to using any single modality. The identified features could serve as biomarkers for COVID-19 prognosis and patient care, allowing clinicians to optimize treatment strategies and refine clinical decision-making processes for enhanced patient outcomes.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] COVID-19 Data Analysis and Appropriate Vaccine Prediction using Machine Learning
    Ullah, Md. Oli
    Nobel, S. M. Nuruzzaman
    2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 496 - 504
  • [42] Machine and Deep Learning Algorithms for COVID-19 Mortality Prediction Using Clinical and Radiomic Features
    Verzellesi, Laura
    Botti, Andrea
    Bertolini, Marco
    Trojani, Valeria
    Carlini, Gianluca
    Nitrosi, Andrea
    Monelli, Filippo
    Besutti, Giulia
    Castellani, Gastone
    Remondini, Daniel
    Milanese, Gianluca
    Croci, Stefania
    Sverzellati, Nicola
    Salvarani, Carlo
    Iori, Mauro
    ELECTRONICS, 2023, 12 (18)
  • [43] Building Machine Learning Models on Limited Transcriptomic RNA-seq Data
    Jin Shuai
    Li Yaoyu
    Peng Jiawu
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 358 - 363
  • [44] Rapid Thrombogenesis Prediction in Covid-19 Patients Using Machine Learning
    Lee, Joong-Lyul
    Alwajidi, Safaa
    Tree, Mike
    Cristobal, Angelo
    Zhao, Haitao
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT II, 2023, 676 : 373 - 384
  • [45] Prediction of COVID-19 Patient using Supervised Machine Learning Algorithm
    Buvana, M.
    Muthumayil, K.
    SAINS MALAYSIANA, 2021, 50 (08): : 2479 - 2497
  • [46] Automatic COVID-19 prediction using explainable machine learning techniques
    Solayman S.
    Aumi S.A.
    Mery C.S.
    Mubassir M.
    Khan R.
    International Journal of Cognitive Computing in Engineering, 2023, 4 : 36 - 46
  • [47] Prediction of COVID-19 Severity Using Chest Computed Tomography and Laboratory Measurements: Evaluation Using a Machine Learning Approach
    Li, Daowei
    Zhang, Qiang
    Tan, Yue
    Feng, Xinghuo
    Yue, Yuanyi
    Bai, Yuhan
    Li, Jimeng
    Li, Jiahang
    Xu, Youjun
    Chen, Shiyu
    Xiao, Si-Yu
    Sun, Muyan
    Li, Xiaona
    Zhu, Fang
    JMIR MEDICAL INFORMATICS, 2020, 8 (11)
  • [48] Machine Learning Techniques for Extracting Relevant Features from Clinical Data for COVID-19 Mortality Prediction
    Fraccaroli, Michele
    Mazzuchelli, Giulia
    Bizzarri, Alice
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [49] COVID-19: Symptoms Clustering and Severity Classification Using Machine Learning Approach
    Mohamand, Nurul Fathia
    Sylvestro, Herold
    Ahmad, Norulhusna
    Annanurov, Bayram
    Mohd Noor, Norliza
    INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2023, 15 (03): : 1 - 14
  • [50] Comprehensive analysis of clinical data for COVID-19 outcome estimation with machine learning models
    Moris, Daniel I.
    de Moura, Joaquim
    Marcos, Pedro J.
    Rey, Enrique Miguez
    Novo, Jorge
    Ortega, Marcos
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84