iAMP-Attenpred: a novel antimicrobial peptide predictor based on BERT feature extraction method and CNN-BiLSTM-Attention combination model

被引:17
|
作者
Xing, Wenxuan [2 ]
Zhang, Jie [3 ]
Li, Chen [4 ]
Huo, Yujia [3 ]
Dong, Gaifang [1 ,3 ]
机构
[1] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, 29 Erdos East St, Hohhot 010011, Peoples R China
[2] Northeastern Univ, Sch Comp Sci & Engn, Comp Technol, Shenyang, Peoples R China
[3] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, Hohhot, Peoples R China
[4] Northeastern Univ, Sch Comp Sci & Engn, Artificial Intelligence, Shenyang, Peoples R China
关键词
antimicrobial peptides; BERT; CNN-BiLSTM-Attention; AMP predictor; ANTICANCER PEPTIDES; DATABASE;
D O I
10.1093/bib/bbad443
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a kind of small molecule protein that can fight against various microorganisms in nature, antimicrobial peptides (AMPs) play an indispensable role in maintaining the health of organisms and fortifying defenses against diseases. Nevertheless, experimental approaches for AMP identification still demand substantial allocation of human resources and material inputs. Alternatively, computing approaches can assist researchers effectively and promptly predict AMPs. In this study, we present a novel AMP predictor called iAMP-Attenpred. As far as we know, this is the first work that not only employs the popular BERT model in the field of natural language processing (NLP) for AMPs feature encoding, but also utilizes the idea of combining multiple models to discover AMPs. Firstly, we treat each amino acid from preprocessed AMPs and non-AMP sequences as a word, and then input it into BERT pre-training model for feature extraction. Moreover, the features obtained from BERT method are fed to a composite model composed of one-dimensional CNN, BiLSTM and attention mechanism for better discriminating features. Finally, a flatten layer and various fully connected layers are utilized for the final classification of AMPs. Experimental results reveal that, compared with the existing predictors, our iAMP-Attenpred predictor achieves better performance indicators, such as accuracy, precision and so on. This further demonstrates that using the BERT approach to capture effective feature information of peptide sequences and combining multiple deep learning models are effective and meaningful for predicting AMPs.
引用
收藏
页数:9
相关论文
共 1 条
  • [1] A Novel Industrial Intrusion Detection Method based on Threshold-optimized CNN-BiLSTM-Attention using ROC Curve
    Lan, Mindi
    Luo, Jun
    Chai, Senchun
    Chai, Ruiqi
    Zhang, Chen
    Zhang, Baihai
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7384 - 7389