EnAMP: A novel deep learning ensemble antibacterial peptide recognition algorithm based on multi-features

被引:4
作者
Zhuang, Jujuan [1 ]
Gao, Wanquan [1 ]
Su, Rui [1 ]
机构
[1] Dalian Maritime Univ, Sch Sci, Dalian, Liaoning, Peoples R China
关键词
Antimicrobial peptides prediction; word embedding; deep learning; machine learning; ensemble learning; ANTIMICROBIAL PEPTIDES; PREDICTION;
D O I
10.1142/S021972002450001X
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Antimicrobial peptides (AMPs), as the preferred alternatives to antibiotics, have wide application with good prospects. Identifying AMPs through wet lab experiments remains expensive, time-consuming and challenging. Many machine learning methods have been proposed to predict AMPs and achieved good results. In this work, we combine two kinds of word embedding features with the statistical features of peptide sequences to develop an ensemble classifier, named EnAMP, in which, two deep neural networks are trained based on Word2vec and Glove word embedding features of peptide sequences, respectively, meanwhile, we utilize statistical features of peptide sequences to train random forest and support vector machine classifiers. The average of four classifiers is the final prediction result. Compared with other state-of-the-art algorithms on six datasets, EnAMP outperforms most existing models with similar computational costs, even when compared with high computational cost algorithms based on Bidirectional Encoder Representation from Transformers (BERT), the performance of our model is comparable. EnAMP source code and the data are available at https://github.com/ruisue/EnAMP.
引用
收藏
页数:16
相关论文
共 43 条
[31]   Deep learning improves antimicrobial peptide recognition [J].
Veltri, Daniel ;
Kamath, Uday ;
Shehu, Amarda .
BIOINFORMATICS, 2018, 34 (16) :2740-2747
[32]   CAMPR3: a database on sequences, structures and signatures of antimicrobial peptides [J].
Waghu, Faiza Hanif ;
Barai, Ram Shankar ;
Gurung, Pratima ;
Idicula-Thomas, Susan .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1094-D1097
[33]   APD3: the antimicrobial peptide database as a tool for research and education [J].
Wang, Guangshun ;
Li, Xia ;
Wang, Zhe .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1087-D1093
[34]   APD2: the updated antimicrobial peptide database and its application in peptide design [J].
Wang, Guangshun ;
Li, Xia ;
Wang, Zhe .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D933-D937
[35]   Dependency-based long short term memory network for drug-drug interaction extraction [J].
Wang, Wei ;
Yang, Xi ;
Yang, Canqun ;
Guo, Xiaowei ;
Zhang, Xiang ;
Wu, Chengkun .
BMC BIOINFORMATICS, 2017, 18
[36]   APD: the Antimicrobial Peptide Database [J].
Wang, Z ;
Wang, GS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D590-D592
[37]   iAMP-CA2L: a new CNN-BiLSTM-SVM classifier based on cellular automata image for identifying antimicrobial peptides and their functional types [J].
Xiao, Xuan ;
Shao, Yu-Tao ;
Cheng, Xiang ;
Stamatovic, Biljana .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[38]   iAMP-2L: A two-level multi-label classifier for identifying antimicrobial peptides and their functional types [J].
Xiao, Xuan ;
Wang, Pu ;
Lin, Wei-Zhong ;
Jia, Jian-Hua ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2013, 436 (02) :168-177
[39]   Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning [J].
Yan, Jielu ;
Bhadra, Pratiti ;
Li, Ang ;
Sethiya, Pooja ;
Qin, Longguang ;
Tai, Hio Kuan ;
Wong, Koon Ho ;
Siu, Shirley W., I .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 20 :882-894
[40]  
Youmans M, 2017, IEEE INT C BIOINFORM, P498, DOI 10.1109/BIBM.2017.8217697