AMP-BERT: Prediction of antimicrobial peptide function based on a BERT model

被引:51
作者
Lee, Hansol [1 ]
Lee, Songyeon [1 ]
Lee, Ingoo [1 ]
Nam, Hojung [1 ,2 ]
机构
[1] Gwangju Inst Sci & Technol GIST, Sch Elect Engn & Comp Sci, Gwangju, South Korea
[2] Gwangju Inst Sci & Technol, AI Grad Sch, 123 Cheomdangwagi Ro, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
antimicrobial peptides; antimicrobial resistance; BERT; deep learning; drug discovery; machine learning; sequence classification; transformer; CD-HIT; PROTEIN; THIONINS;
D O I
10.1002/pro.4529
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Antimicrobial resistance is a growing health concern. Antimicrobial peptides (AMPs) disrupt harmful microorganisms by nonspecific mechanisms, making it difficult for microbes to develop resistance. Accordingly, they are promising alternatives to traditional antimicrobial drugs. In this study, we developed an improved AMP classification model, called AMP-BERT. We propose a deep learning model with a fine-tuned didirectional encoder representations from transformers (BERT) architecture designed to extract structural/functional information from input peptides and identify each input as AMP or non-AMP. We compared the performance of our proposed model and other machine/deep learning-based methods. Our model, AMP-BERT, yielded the best prediction results among all models evaluated with our curated external dataset. In addition, we utilized the attention mechanism in BERT to implement an interpretable feature analysis and determine the specific residues in known AMPs that contribute to peptide structure and antimicrobial function. The results show that AMP-BERT can capture the structural properties of peptides for model learning, enabling the prediction of AMPs or non-AMPs from input sequences. AMP-BERT is expected to contribute to the identification of candidate AMPs for functional validation and drug development. The code and dataset for the fine-tuning of AMP-BERT is publicly available at .
引用
收藏
页数:13
相关论文
共 50 条
[31]   Prediction of Author's Profile Basing on Fine-Tuning BERT Model [J].
Bsir B. ;
Khoufi N. ;
Zrigui M. .
Informatica (Slovenia), 2024, 48 (01) :69-78
[32]   TGCN-Bert Emoji Prediction in Information Systems Using TCN and GCN Fusing Features Based on BERT [J].
Yang, Zhangping ;
Ye, Xia ;
Xu, Hantao .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2023, 19 (01)
[33]   A Transformer-Based Stock Market Price Prediction by Incorporating BERT Embedding [J].
Pradeep, Parvathi ;
Premjith, B. ;
Madhu, M. Nimal ;
Gopalakrishnan, E. A. .
PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTING, ICMC 2024, VOL 1, 2024, 964 :95-107
[34]   CitEnergy : A BERT based model to analyse Citizens' Energy-Tweets [J].
Bedi, Jatin ;
Toshniwal, Durga .
SUSTAINABLE CITIES AND SOCIETY, 2022, 80
[35]   Proposing sentiment analysis model based on BERT and XLNet for movie reviews [J].
Danyal, Mian Muhammad ;
Khan, Sarwar Shah ;
Khan, Muzammil ;
Ullah, Subhan ;
Mehmood, Faheem ;
Ali, Ijaz .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) :64315-64339
[36]   Long text semantic matching model based on BERT and densecomposite network [J].
Chen Y.-L. ;
Gao Z.-C. ;
Cai X.-D. .
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (01) :232-239
[37]   SHORT TEXT SEMANTIC MATCHING MODEL BASED ON BERT AND ADVERSARIAL NETWORK [J].
Zhang, Tao ;
Zhang, Zhe ;
Li, Xiang ;
Wu, Yulin ;
Peng, Bo ;
Qian, Yurong ;
Ma, Mengnan ;
Leng, Hongyong .
2022 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA, 2022, :397-401
[38]   A joint model for entity and relation extraction based on BERT [J].
Bo Qiao ;
Zhuoyang Zou ;
Yu Huang ;
Kui Fang ;
Xinghui Zhu ;
Yiming Chen .
Neural Computing and Applications, 2022, 34 :3471-3481
[39]   A joint model for entity and relation extraction based on BERT [J].
Qiao, Bo ;
Zou, Zhuoyang ;
Huang, Yu ;
Fang, Kui ;
Zhu, Xinghui ;
Chen, Yiming .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05) :3471-3481
[40]   CAN-BERT do it? Controller Area Network Intrusion Detection System based on BERT Language Model [J].
Alkhatib, Natasha ;
Mushtaq, Maria ;
Ghauch, Hadi ;
Danger, Jean-Luc .
2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,