Vaxi-DL: A web-based deep learning server to identify potential vaccine candidates

被引:10
作者
Rawal, Kamal [1 ]
Sinha, Robin [1 ]
Nath, Swarsat Kaushik [1 ]
Preeti, P. [1 ]
Kumari, Priya [1 ]
Gupta, Srijanee [1 ]
Sharma, Trapti [1 ]
Strych, Ulrich [2 ,3 ]
Hotez, Peter [2 ,3 ,4 ]
Bottazzi, Maria Elena [2 ,3 ,4 ]
机构
[1] Amity Univ, Amity Inst Biotechnol, Noida, Uttar Pradesh, India
[2] Baylor Coll Med, Natl Sch Trop Med, Texas Childrens Ctr Vaccine Dev, Dept Pediat, Houston, TX 77030 USA
[3] Baylor Coll Med, Natl Sch Trop Med, Texas Childrens Ctr Vaccine Dev, Dept Mol Virol & Microbiol, Houston, TX 77030 USA
[4] Baylor Univ, Dept Biol, Waco, TX 76798 USA
关键词
Vaccine; Deep learning; In silico vaccine development; Antigen prediction; Machine learning; Artificial intelligence; Vaxi-DL server; Vaccine design; COVID-19; SARS-CoV-2; Coronavirus; mRNA vaccines; REVERSE VACCINOLOGY; PREDICTION; AAINDEX; IDENTIFICATION; INFORMATION;
D O I
10.1016/j.compbiomed.2022.105401
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The development of a new vaccine is a challenging exercise involving several steps including computational studies, experimental work, and animal studies followed by clinical studies. To accelerate the process, in silico screening is frequently used for antigen identification. Here, we present Vaxi-DL, web-based deep learning (DL) software that evaluates the potential of protein sequences to serve as vaccine target antigens. Four different DL pathogen models were trained to predict target antigens in bacteria, protozoa, fungi, and viruses that cause infectious diseases in humans. Datasets containing antigenic and non-antigenic sequences were derived from known vaccine candidates and the Protegen database. Biological and physicochemical properties were computed for the datasets using publicly available bioinformatics tools. For each of the four pathogen models, the datasets were divided into training, validation, and testing subsets and then scaled and normalised. The models were constructed using Fully Connected Layers (FCLs), hyper-tuned, and trained using the training subset. Accuracy, sensitivity, specificity, precision, recall, and AUC (Area under the Curve) were used as metrics to assess the performance of these models. The models were benchmarked using independent datasets of known target antigens against other prediction tools such as VaxiJen and Vaxign-ML. We also tested Vaxi-DL on 219 known potential vaccine candidates (PVC) from 37 different pathogens. Our tool predicted 175 PVCs correctly out of 219 sequences. We also tested Vaxi-DL on different datasets obtained from multiple resources. Our tool has demonstrated an average sensitivity of 93% and will thus be a useful tool for prioritising PVCs for preclinical studies.
引用
收藏
页数:67
相关论文
共 47 条
[1]  
[Anonymous], 2022, JOHNS HOPK U MED COR
[2]  
[Anonymous], 2020, WHO TOP 10 CAUS DEAT
[3]   Prediction of new vaccine targets in the core genome of Corynebacterium pseudotuberculosis through omits approaches and reverse vaccinology [J].
Araujo, Carlos Leonardo ;
Alves, Jorianne ;
Nogueira, Wylerson ;
Pereira, Lino Cesar ;
Gomide, Anne Cybelle ;
Ramos, Rommel ;
Azevedo, Vasco ;
Silva, Artur ;
Folador, Adriana .
GENE, 2019, 702 :36-45
[4]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[5]   Improving reverse vaccinology with a machine learning approach [J].
Bowman, Brett N. ;
McAdam, Paul R. ;
Vivona, Sandro ;
Zhang, Jin X. ;
Luong, Tiffany ;
Belew, Richard K. ;
Sahota, Harpal ;
Guiney, Donald ;
Valafar, Faramarz ;
Fierer, Joshua ;
Woelk, Christopher H. .
VACCINE, 2011, 29 (45) :8156-8164
[6]   Supervised Learning for Detection of Duplicates in Genomic Sequence Databases [J].
Chen, Qingyu ;
Zobel, Justin ;
Zhang, Xiuzhen ;
Verspoor, Karin .
PLOS ONE, 2016, 11 (08)
[7]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483
[8]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[9]   Comparison of Open-Source Reverse Vaccinology Programs for Bacterial Vaccine Antigen Discovery [J].
Dalsass, Mattia ;
Brozzi, Alessandro ;
Medini, Duccio ;
Rappuoli, Rino .
FRONTIERS IN IMMUNOLOGY, 2019, 10
[10]   Novel in silico tools for designing peptide-based subunit vaccines and immunotherapeutics [J].
Dhanda, Sandeep Kumar ;
Usmani, Salman Sadullah ;
Agrawal, Piyush ;
Nagpal, Gandharva ;
Gautam, Ankur ;
Raghava, Gajendra P. S. .
BRIEFINGS IN BIOINFORMATICS, 2017, 18 (03) :467-478