StaBle-ABPpred: a stacked ensemble predictor based on biLSTM and attention mechanism for accelerated discovery of antibacterial peptides

被引:44
作者
Singh, Vishakha [1 ]
Shrivastava, Sameer [2 ]
Singh, Sanjay Kumar [1 ]
Kumar, Abhinav [1 ,3 ]
Saxena, Sonal [4 ]
机构
[1] IIT BHU, Dept Comp Sci & Engn, Varanasi, Uttar Pradesh, India
[2] IVRI, Div Vet Biotechnol, Izatnagar, Uttar Pradesh, India
[3] IIT BHU, Varanasi, Uttar Pradesh, India
[4] ICAR IVRI, Div Vet Biotechnol, Izatnagar, Uttar Pradesh, India
关键词
Antimicrobial peptides; bacteriophage; stacked ensemble; deep learning; LSTM; T12; phage; AMR; ANTIMICROBIAL PEPTIDES; TOOL; CLASSIFICATION; DATABASE; PROTEIN; SET;
D O I
10.1093/bib/bbab439
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Due to the rapid emergence of multi-drug resistant (MDR) bacteria, existing antibiotics are becoming ineffective. So, researchers are looking for alternatives in the form of antibacterial peptides (ABPs) based medicines. The discovery of novel ABPs using wet-lab experiments is time-consuming and expensive. Many machine learning models have been proposed to search for new ABPs, but there is still scope to develop a robust model that has high accuracy and precision. In this work, we present StaBle-ABPpred, a stacked ensemble technique-based deep learning classifier that uses bidirectional long-short term memory (biLSTM) and attention mechanism at base-level and an ensemble of random forest, gradient boosting and logistic regression at meta-level to classify peptides as antibacterial or otherwise. The performance of our model has been compared with several state-of-the-art classifiers, and results were subjected to analysis of variance (ANOVA) test and its post hoc analysis, which proves that our model performs better than existing classifiers. Furthermore, a web app has been developed and deployed at https://stable-abppred.anvil.app to identify novel ABPs in protein sequences. Using this app, we identified novel ABPs in all the proteins of the Streptococcus phage T12 genome. These ABPs have shown amino acid similarities with experimentally tested antimicrobial peptides (AMPs) of other organisms. Hence, they could be chemically synthesized and experimentally validated for their activity against different bacteria. The model and app developed in this work can be further utilized to explore the protein diversity for identifying novel ABPs with broad-spectrum activity, especially against MDR bacterial pathogens.
引用
收藏
页数:17
相关论文
共 60 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Automatic construction of molecular similarity networks for visual graph mining in chemical space of bioactive peptides: an unsupervised learning approach [J].
Aguilera-Mendoza, Longendri ;
Marrero-Ponce, Yovani ;
Garcia-Jacas, Cesar R. ;
Chavez, Edgar ;
Beltran, Jesus A. ;
Guillen-Ramirez, Hugo A. ;
Brizuela, Carlos A. .
SCIENTIFIC REPORTS, 2020, 10 (01)
[3]   Graph-based data integration from bioactive peptide databases of pharmaceutical interest: toward an organized collection enabling visual network analysis [J].
Aguilera-Mendoza, Longendri ;
Marrero-Ponce, Yovani ;
Beltran, Jesus A. ;
Ibarra, Roberto Tellez ;
Guillen-Ramirez, Hugo A. ;
Brizuela, Carlos A. .
BIOINFORMATICS, 2019, 35 (22) :4739-4747
[4]   Overlap and diversity in antimicrobial peptide databases: compiling a non-redundant set of sequences [J].
Aguilera-Mendoza, Longendri ;
Marrero-Ponce, Yovani ;
Tellez-Ibarra, Roberto ;
Llorente-Quesada, Monica T. ;
Salgado, Jesus ;
Barigye, Stephen J. ;
Liu, Jun .
BIOINFORMATICS, 2015, 31 (15) :2553-2559
[5]   BIPEP: Sequence-based Prediction of Biofilm Inhibitory Peptides Using a Combination of NMR and Physicochemical Descriptors [J].
Atanaki, Fereshteh Fallah ;
Behrouzi, Saman ;
Ariaeenejad, Shohreh ;
Boroomand, Amin ;
Kavousi, Kaveh .
ACS OMEGA, 2020, 5 (13) :7290-7297
[6]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[7]   UniProt: a worldwide hub of protein knowledge [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Alpi, Emanuele ;
Bely, Benoit ;
Bingley, Mark ;
Britto, Ramona ;
Bursteinas, Borisas ;
Busiello, Gianluca ;
Bye-A-Jee, Hema ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Daniel ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Ignatchenko, Alexandr ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Nightingale, Andrew ;
Onwubiko, Joseph ;
Palka, Barbara ;
Pichler, Klemens ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Renaux, Alexandre ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Vasudev, Preethi ;
Volynkin, Vladimir ;
Wardell, Tony .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D506-D515
[8]   AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest [J].
Bhadra, Pratiti ;
Yan, Jielu ;
Li, Jinyan ;
Fong, Simon ;
Siu, Shirley W. I. .
SCIENTIFIC REPORTS, 2018, 8
[9]   A Novel Multiobjective GDWCN-PSO Algorithm and Its Application to Medical Data Security [J].
Bharti, Vandana ;
Biswas, Bhaskar ;
Shukla, Kaushal Kumar .
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2021, 21 (02)
[10]  
Bharti V, 2020, PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, P294, DOI [10.1109/Confluence47617.2020.9057841, 10.1109/confluence47617.2020.9057841]