Hybrid Predictive Machine Learning Model for the Prediction of Immunodominant Peptides of Respiratory Syncytial Virus

被引:0
作者
Bukhari, Syed Nisar Hussain [1 ]
Ogudo, Kingsley A. [2 ]
机构
[1] Govt India, Minist Elect & Informat Technol MeitY, Natl Inst Elect & Informat Technol NIELIT, Srinagar 191132, India
[2] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn, Johannesburg, South Africa
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 08期
关键词
respiratory syncytial virus; immunodominant peptides; T-cell epitope; peptide-based vaccine; hybrid; predictive model; machine learning;
D O I
10.3390/bioengineering11080791
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Respiratory syncytial virus (RSV) is a common respiratory pathogen that infects the human lungs and respiratory tract, often causing symptoms similar to the common cold. Vaccination is the most effective strategy for managing viral outbreaks. Currently, extensive efforts are focused on developing a vaccine for RSV. Traditional vaccine design typically involves using an attenuated form of the pathogen to elicit an immune response. In contrast, peptide-based vaccines (PBVs) aim to identify and chemically synthesize specific immunodominant peptides (IPs), known as T-cell epitopes (TCEs), to induce a targeted immune response. Despite their potential for enhancing vaccine safety and immunogenicity, PBVs have received comparatively less attention. Identifying IPs for PBV design through conventional wet-lab experiments is challenging, costly, and time-consuming. Machine learning (ML) techniques offer a promising alternative, accurately predicting TCEs and significantly reducing the time and cost of vaccine development. This study proposes the development and evaluation of eight hybrid ML predictive models created through the permutations and combinations of two classification methods, two feature weighting techniques, and two feature selection algorithms, all aimed at predicting the TCEs of RSV. The models were trained using the experimentally determined TCEs and non-TCE sequences acquired from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) repository. The hybrid model composed of the XGBoost (XGB) classifier, chi-squared (ChST) weighting technique, and backward search (BST) as the optimal feature selection algorithm (ChST-BST-XGB) was identified as the best model, achieving an accuracy, sensitivity, specificity, F1 score, AUC, precision, and MCC of 97.10%, 0.98, 0.97, 0.98, 0.99, 0.99, and 0.96, respectively. Additionally, K-fold cross-validation (KFCV) was performed to ensure the model's reliability and an average accuracy of 97.21% was recorded for the ChST-BST-XGB model. The results indicate that the hybrid XGBoost model consistently outperforms other hybrid approaches. The epitopes predicted by the proposed model may serve as promising vaccine candidates for RSV, subject to in vitro and in vivo scientific assessments. This model can assist the scientific community in expediting the screening of active TCE candidates for RSV, ultimately saving time and resources in vaccine development.
引用
收藏
页数:16
相关论文
共 72 条
  • [1] Adiga Rama, 2021, Avicenna Journal of Medical Biotechnology, V13, P87, DOI 10.18502/ajmb.v13i2.5527
  • [2] Alpaydin E., 2021, Machine learning
  • [3] Immunoinformatics aided approach for predicting potent cytotoxic T cell epitopes of respiratory syncytial virus
    Anandhan, Gayathri
    Narkhede, Yogesh B.
    Mohan, Manikandan
    Paramasivam, Premasudha
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2023, 41 (21) : 12093 - 12105
  • [4] BACHI T, 1973, J VIROL, V12, P1173
  • [5] Respiratory syncytial virus entry and how to block it
    Battles, Michael B.
    McLellan, Jason S.
    [J]. NATURE REVIEWS MICROBIOLOGY, 2019, 17 (04) : 233 - 245
  • [6] Berger C.M., 2004, Handbook of Cancer Vaccines. Cancer Drug Discovery and Development, DOI [10.1007/978-1-59259-680-510, DOI 10.1007/978-1-59259-680-510]
  • [7] Prediction of CTL epitopes using QM, SVM and ANN techniques
    Bhasin, M
    Raghava, GPS
    [J]. VACCINE, 2004, 22 (23-24) : 3195 - 3204
  • [8] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [9] Development and use of machine learning algorithms in vaccine target selection
    Bravi, Barbara
    [J]. NPJ VACCINES, 2024, 9 (01)
  • [10] Bukhari S.N.H., 2021, Lecture Notes on Data Engineering and Communications Technologies, VVolume 91, P275, DOI [10.1007/978-981-16-6285-023, DOI 10.1007/978-981-16-6285-023]