A BERT-based approach for identifying anti-inflammatory peptides using sequence information

被引:4
作者
Xu, Teng [1 ]
Wang, Qian [2 ]
Yang, Zhigang [1 ]
Ying, Jianchao [3 ,4 ]
机构
[1] Baotou Cent Hosp, Inst Translat Med, Baotou, Peoples R China
[2] Wenzhou Med Univ, Wenzhou Peoples Hosp, Clin Inst 3, Dept Clin Lab, Wenzhou, Peoples R China
[3] Wenzhou Med Univ, Affiliated Hosp 1, Dept Emergency, Wenzhou Key Lab Emergency Crit Care & Disaster Med, Wenzhou, Peoples R China
[4] Wenzhou Med Univ, Affiliated Hosp 1, Cent Lab, Wenzhou, Peoples R China
关键词
Anti-inflammatory peptide; Protein function prediction; Feature extraction; Model development; Deep learning; AMINO-ACID-COMPOSITION; INFLAMMATION;
D O I
10.1016/j.heliyon.2024.e32951
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of anti-inflammatory peptides (AIPs) as an alternative therapeutic approach for inflammatory diseases holds great research significance. Due to the high cost and difficulty in identifying AIPs with experimental methods, the discovery and design of peptides by computational methods before the experimental stage have become promising technology. In this study, we present BertAIP, a bidirectional encoder representation from transformers (BERT)-based method for predicting AIPs directly from their amino acid sequence without using any other information. BertAIP implements a BERT model to extract features of a protein, and uses a fully connected feed-forward network for AIP classification. It was constructed and evaluated using the AIP datasets that were reconstructed from the latest Immune Epitope Database. The experimental results showed that BertAIP achieved an accuracy of 0.751 and a Matthews correlation coefficient of 0.451, which were higher than other commonly used methods. The results of the independent test suggested that BertAIP outperformed the existing AIP predictors. In addition, to enhance the interpretability of BertAIP, we explored and visualized the amino acids that the model considered important for AIP prediction. We believe that the BertAIP proposed herein will be a useful tool for large-scale screening and identifying novel AIPs for drug development and therapeutic research related to inflammatory diseases.
引用
收藏
页数:10
相关论文
共 45 条
[1]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[2]   BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides [J].
Charoenkwan, Phasit ;
Nantasenamat, Chanin ;
Hasan, Md Mehedi ;
Manavalan, Balachandran ;
Shoombuatong, Watshara .
BIOINFORMATICS, 2021, 37 (17) :2556-2562
[3]   iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Leier, Andre ;
Marquez-Lago, Tatiana T. ;
Wang, Yanan ;
Webb, Geoffrey I. ;
Smith, A. Ian ;
Daly, Roger J. ;
Chou, Kuo-Chen ;
Song, Jiangning .
BIOINFORMATICS, 2018, 34 (14) :2499-2502
[4]   Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks [J].
Choi, Hyunjin ;
Kim, Judong ;
Joe, Seongho ;
Gwon, Youngjune .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :5482-5487
[5]   Antimicrobial peptides: Role in human disease and potential as immunotherapies [J].
de la Fuente-Nunez, Cesar ;
Silva, Osmar N. ;
Lu, Timothy K. ;
Franco, Octavio Luiz .
PHARMACOLOGY & THERAPEUTICS, 2017, 178 :132-140
[6]   Glucocorticoid resistance as a major drive in sepsis pathology [J].
Dendoncker, Karen ;
Libert, Claude .
CYTOKINE & GROWTH FACTOR REVIEWS, 2017, 35 :85-96
[7]   Prediction of anti-inflammatory peptides by a sequence-based stacking ensemble model named AIPStack [J].
Deng, Hua ;
Lou, Chaofeng ;
Wu, Zengrui ;
Li, Weihua ;
Liu, Guixia ;
Tang, Yun .
ISCIENCE, 2022, 25 (09)
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Dhamdhere K., 2018, ARXIV
[10]   ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning [J].
Elnaggar, Ahmed ;
Heinzinger, Michael ;
Dallago, Christian ;
Rehawi, Ghalia ;
Wang, Yu ;
Jones, Llion ;
Gibbs, Tom ;
Feher, Tamas ;
Angerer, Christoph ;
Steinegger, Martin ;
Bhowmik, Debsindhu ;
Rost, Burkhard .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :7112-7127