A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data

被引:1
|
作者
Sethi, Sahil [1 ]
Shakyawar, Sushil [1 ]
Reddy, Athreya S. [2 ]
Patel, Jai Chand [1 ]
Guda, Chittibabu [1 ]
机构
[1] Univ Nebraska Med Ctr, Dept Genet Cell Biol & Anat, Omaha, NE 68105 USA
[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65211 USA
关键词
COVID-19; severity prediction; machine learning; feature selection; PROTEINS;
D O I
10.3390/diagnostics14121284
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The premise for this study emanated from the need to understand SARS-CoV-2 infections at the molecular level and to develop predictive tools for managing COVID-19 severity. With the varied clinical outcomes observed among infected individuals, creating a reliable machine learning (ML) model for predicting the severity of COVID-19 became paramount. Despite the availability of large-scale genomic and clinical data, previous studies have not effectively utilized multi-modality data for disease severity prediction using data-driven approaches. Our primary goal is to predict COVID-19 severity using a machine-learning model trained on a combination of patients' gene expression, clinical features, and co-morbidity data. Employing various ML algorithms, including Logistic Regression (LR), XGBoost (XG), Na & iuml;ve Bayes (NB), and Support Vector Machine (SVM), alongside feature selection methods, we sought to identify the best-performing model for disease severity prediction. The results highlighted XG as the superior classifier, with 95% accuracy and a 0.99 AUC (Area Under the Curve), for distinguishing severity groups. Additionally, the SHAP analysis revealed vital features contributing to prediction, including several genes such as COX14, LAMB2, DOLK, SDCBP2, RHBDL1, and IER3-AS1. Notably, two clinical features, the absolute neutrophil count and Viremia Categories, emerged as top contributors. Integrating multiple data modalities has significantly improved the accuracy of disease severity prediction compared to using any single modality. The identified features could serve as biomarkers for COVID-19 prognosis and patient care, allowing clinicians to optimize treatment strategies and refine clinical decision-making processes for enhanced patient outcomes.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] COVID-19 Mortality Prediction Using Machine Learning Techniques
    Schirato, Lindsay
    Makina, Kennedy
    Flanders, Dwayne
    Pouriyeh, Seyedamin
    Shahriar, Hossain
    2021 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (ICDH 2021), 2021, : 197 - 202
  • [32] COVID-19 Outbreak Prediction by Using Machine Learning Algorithms
    Sher, Tahir
    Rehman, Abdul
    Kim, Dongsun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1561 - 1574
  • [33] Covid-19 Mortality Risk Prediction Model Using Machine Learning
    Sanchez-Galvez, Alba Maribel
    Sanchez-Galvez, Sully
    Alvarez-Gonzalez, Ricardo
    Rojas-Alarcon, Frida
    COMPUTACION Y SISTEMAS, 2023, 27 (04): : 881 - 888
  • [34] Severity prediction in COVID-19 patients using clinical markers and explainable artificial intelligence: A stacked ensemble machine learning approach
    Chadaga, Krishnaraj
    Prabhu, Srikanth
    Sampathila, Niranjana
    Chadaga, Rajagopala
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 959 - 982
  • [35] Mortality Prediction Utilizing Blood Biomarkers to Predict the Severity of COVID-19 Using Machine Learning Technique
    Rahman, Tawsifur
    Al-Ishaq, Fajer A.
    Al-Mohannadi, Fatima S.
    Mubarak, Reem S.
    Al-Hitmi, Maryam H.
    Islam, Khandaker Reajul
    Khandakar, Amith
    Hssain, Ali Ait
    Al-Madeed, Somaya
    Zughaier, Susu M.
    Chowdhury, Muhammad E. H.
    DIAGNOSTICS, 2021, 11 (09)
  • [36] Machine learning based approaches for detecting COVID-19 using clinical text data
    Khanday A.M.U.D.
    Rabani S.T.
    Khan Q.R.
    Rouf N.
    Mohi Ud Din M.
    International Journal of Information Technology, 2020, 12 (3) : 731 - 739
  • [37] Epidemic Prediction using Machine Learning and Deep Learning Models on COVID-19 Data
    Mohanraj, G.
    Mohanraj, V
    Marimuthu, M.
    Sathiyamoorthi, V
    Luhach, Ashish Kr
    Kumar, Sandeep
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (03) : 377 - 393
  • [38] COVID-19 Outbreak Prediction with Machine Learning
    Ardabili, Sina F.
    Mosavi, Amir
    Ghamisi, Pedram
    Ferdinand, Filip
    Varkonyi-Koczy, Annamaria R.
    Reuter, Uwe
    Rabczuk, Timon
    Atkinson, Peter M.
    ALGORITHMS, 2020, 13 (10)
  • [39] Analysis and Prediction of COVID-19 using Machine Learning
    Parthiban, M.
    Alphy, Anna
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [40] Microbiome characteristics description of COVID-19 patients based on bulk RNA-seq and scRNA-Seq data
    Zhang, Sainan
    Liu, Xingwang
    Zhao, Yue
    Wang, Ping
    Yu, Rui
    Xu, Peigang
    Jiang, Yue
    Cheng, Liang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165