A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data

被引:1
|
作者
Sethi, Sahil [1 ]
Shakyawar, Sushil [1 ]
Reddy, Athreya S. [2 ]
Patel, Jai Chand [1 ]
Guda, Chittibabu [1 ]
机构
[1] Univ Nebraska Med Ctr, Dept Genet Cell Biol & Anat, Omaha, NE 68105 USA
[2] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65211 USA
关键词
COVID-19; severity prediction; machine learning; feature selection; PROTEINS;
D O I
10.3390/diagnostics14121284
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The premise for this study emanated from the need to understand SARS-CoV-2 infections at the molecular level and to develop predictive tools for managing COVID-19 severity. With the varied clinical outcomes observed among infected individuals, creating a reliable machine learning (ML) model for predicting the severity of COVID-19 became paramount. Despite the availability of large-scale genomic and clinical data, previous studies have not effectively utilized multi-modality data for disease severity prediction using data-driven approaches. Our primary goal is to predict COVID-19 severity using a machine-learning model trained on a combination of patients' gene expression, clinical features, and co-morbidity data. Employing various ML algorithms, including Logistic Regression (LR), XGBoost (XG), Na & iuml;ve Bayes (NB), and Support Vector Machine (SVM), alongside feature selection methods, we sought to identify the best-performing model for disease severity prediction. The results highlighted XG as the superior classifier, with 95% accuracy and a 0.99 AUC (Area Under the Curve), for distinguishing severity groups. Additionally, the SHAP analysis revealed vital features contributing to prediction, including several genes such as COX14, LAMB2, DOLK, SDCBP2, RHBDL1, and IER3-AS1. Notably, two clinical features, the absolute neutrophil count and Viremia Categories, emerged as top contributors. Integrating multiple data modalities has significantly improved the accuracy of disease severity prediction compared to using any single modality. The identified features could serve as biomarkers for COVID-19 prognosis and patient care, allowing clinicians to optimize treatment strategies and refine clinical decision-making processes for enhanced patient outcomes.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Predicting COVID-19 Severity Integrating RNA-Seq Data Using Machine Learning Techniques
    Bajo-Morales, Javier
    Castillo-Secilla, Daniel
    Herrera, Luis Javier
    Caba, Octavio
    Prados, Jose Carlos
    Rojas, Ignacio
    CURRENT BIOINFORMATICS, 2023, 18 (03) : 221 - 231
  • [2] COVID-19 Prediction model using Machine Learning
    Jadi, Amr
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (08): : 247 - 253
  • [3] Exploring Prediction of COVID-19 and its Severity using Machine Learning
    Asif, Sumaira
    Saba, Tanzila
    Alghanim, Amerah
    2022 FIFTH INTERNATIONAL CONFERENCE OF WOMEN IN DATA SCIENCE AT PRINCE SULTAN UNIVERSITY (WIDS-PSU 2022), 2022, : 117 - 122
  • [4] Machine learning approaches in Covid-19 severity risk prediction in Morocco
    Laatifi, Mariam
    Douzi, Samira
    Bouklouz, Abdelaziz
    Ezzine, Hind
    Jaafari, Jaafar
    Zaid, Younes
    El Ouahidi, Bouabid
    Naciri, Mariam
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [5] Machine learning approaches in Covid-19 severity risk prediction in Morocco
    Mariam Laatifi
    Samira Douzi
    Abdelaziz Bouklouz
    Hind Ezzine
    Jaafar Jaafari
    Younes Zaid
    Bouabid El Ouahidi
    Mariam Naciri
    Journal of Big Data, 9
  • [6] Applying Different Machine Learning Techniques for Prediction of COVID-19 Severity
    Sayed, Safynaz Abdel-Fattah
    Elkorany, Abeer Mohamed
    Mohammad, Sabah Sayed
    IEEE ACCESS, 2021, 9 : 135697 - 135707
  • [7] Machine Learning Assisted Prediction of Prognostic Biomarkers Associated With COVID-19, Using Clinical and Proteomics Data
    Sardar, Rahila
    Sharma, Arun
    Gupta, Dinesh
    FRONTIERS IN GENETICS, 2021, 12
  • [8] COVID-19 Severity Prediction Using Combined Machine Learning and Transfer Learning Approaches
    Rambola, Ame Rayan
    Andavar, Suruliandi
    Raj, Raja Soosaimarian Peter
    BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2024, 67
  • [9] Prediction of Severity of COVID-19-Infected Patients Using Machine Learning Techniques
    Alotaibi, Aziz
    Shiblee, Mohammad
    Alshahrani, Adel
    COMPUTERS, 2021, 10 (03)
  • [10] Machine Learning-Based Prediction of COVID-19 Severity and Progression to Critical Illness Using CT Imaging and Clinical Data
    Purkayastha, Subhanik
    Xiao, Yanhe
    Jiao, Zhicheng
    Thepumnoeysuk, Rujapa
    Halsey, Kasey
    Wu, Jing
    Thi My Linh Tran
    Ben Hsieh
    Choi, Ji Whae
    Wang, Dongcui
    Vallieres, Martin
    Wang, Robin
    Collins, Scott
    Feng, Xue
    Feldman, Michael
    Zhang, Paul J.
    Atalay, Michael
    Sebro, Ronnie
    Yang, Li
    Fan, Yong
    Liao, Wei-hua
    Bai, Harrison X.
    KOREAN JOURNAL OF RADIOLOGY, 2021, 22 (07) : 1213 - 1224