A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data

被引:1
|
作者
Sethi, Sahil [1 ]
Shakyawar, Sushil [1 ]
Reddy, Athreya S. [2 ]
Patel, Jai Chand [1 ]
Guda, Chittibabu [1 ]
机构
[1] Univ Nebraska Med Ctr, Dept Genet Cell Biol & Anat, Omaha, NE 68105 USA
[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65211 USA
关键词
COVID-19; severity prediction; machine learning; feature selection; PROTEINS;
D O I
10.3390/diagnostics14121284
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The premise for this study emanated from the need to understand SARS-CoV-2 infections at the molecular level and to develop predictive tools for managing COVID-19 severity. With the varied clinical outcomes observed among infected individuals, creating a reliable machine learning (ML) model for predicting the severity of COVID-19 became paramount. Despite the availability of large-scale genomic and clinical data, previous studies have not effectively utilized multi-modality data for disease severity prediction using data-driven approaches. Our primary goal is to predict COVID-19 severity using a machine-learning model trained on a combination of patients' gene expression, clinical features, and co-morbidity data. Employing various ML algorithms, including Logistic Regression (LR), XGBoost (XG), Na & iuml;ve Bayes (NB), and Support Vector Machine (SVM), alongside feature selection methods, we sought to identify the best-performing model for disease severity prediction. The results highlighted XG as the superior classifier, with 95% accuracy and a 0.99 AUC (Area Under the Curve), for distinguishing severity groups. Additionally, the SHAP analysis revealed vital features contributing to prediction, including several genes such as COX14, LAMB2, DOLK, SDCBP2, RHBDL1, and IER3-AS1. Notably, two clinical features, the absolute neutrophil count and Viremia Categories, emerged as top contributors. Integrating multiple data modalities has significantly improved the accuracy of disease severity prediction compared to using any single modality. The identified features could serve as biomarkers for COVID-19 prognosis and patient care, allowing clinicians to optimize treatment strategies and refine clinical decision-making processes for enhanced patient outcomes.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Machine learning model for predicting Major Depressive Disorder using RNA-Seq data: optimization of classification approach
    Pragya Verma
    Madhvi Shakya
    Cognitive Neurodynamics, 2022, 16 : 443 - 453
  • [22] Cancer, more than a "COVID-19 co-morbidity"
    Jani, Chinmay T.
    Schooley, Robert T.
    Mckay, Rana R.
    Lippman, Scott M.
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [23] Dynamics of novel COVID-19 in the presence of Co-morbidity
    Saha, Amit Kumar
    Podder, Chandra Nath
    Niger, Ashrafi Meher
    INFECTIOUS DISEASE MODELLING, 2022, 7 (02) : 138 - 160
  • [24] Efficient analysis of COVID-19 clinical data using machine learning models
    Sarwan Ali
    Yijing Zhou
    Murray Patterson
    Medical & Biological Engineering & Computing, 2022, 60 : 1881 - 1896
  • [25] Machine Learning Analysis of RNA-seq Data for Diagnostic and Prognostic Prediction of Colon Cancer
    Bostanci, Erkan
    Kocak, Engin
    Unal, Metehan
    Guzel, Mehmet Serdar
    Acici, Koray
    Asuroglu, Tunc
    SENSORS, 2023, 23 (06)
  • [26] Efficient analysis of COVID-19 clinical data using machine learning models
    Ali, Sarwan
    Zhou, Yijing
    Patterson, Murray
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (07) : 1881 - 1896
  • [27] Machine Learning Applied to Clinical Laboratory Data in Spain for COVID-19 Outcome Prediction: Model Development and Validation
    Dominguez-Olmedo, Juan L.
    Gragera-Martinez, Alvaro
    Mata, Jacinto
    Pachon Alvarez, Victoria
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (04)
  • [28] Discovering common pathogenic processes between COVID-19 and HFRS by integrating RNA-seq differential expression analysis with machine learning
    Noor, Fatima
    Ashfaq, Usman Ali
    Bakar, Abu
    ul Haq, Waqar
    Allemailem, Khaled S. S.
    Alharbi, Basmah F. F.
    Al-Megrin, Wafa Abdullah I.
    ul Qamar, Muhammad Tahir
    FRONTIERS IN MICROBIOLOGY, 2023, 14
  • [29] Machine Learning-Based Prediction of COVID-19 Prognosis Using Clinical and Hematologic Data
    Kamel, Fatemah O.
    Magadmi, Rania
    Qutub, Sulafah
    Badawi, Maha
    Badawi, Mazen
    Madani, Tariq A.
    Alhothali, Areej
    Abozinadah, Ehab A.
    Bakhshwin, Duaa M.
    Jamal, Maha H.
    Burzangi, Abdulhadi S.
    Bazuhair, Mohammed
    Alqutub, Hussamaldin
    Alqutub, Abdulaziz
    Felemban, Sameera M.
    Al-Sayes, Fatin
    Adam, Soheir
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (12)
  • [30] Clinical Decision Making and Outcome Prediction for COVID-19 Patients Using Machine Learning
    Maria, Adamopoulou
    Dimitrios, Velissaris
    Ioanna, Michou
    Charalampos, Matzaroglou
    Gerasimos, Messaris
    Constantinos, Koutsojannis
    PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE, PERVASIVE HEALTH 2021, 2022, 431 : 3 - 14