AGRAMP: machine learning models for predicting antimicrobial peptides against phytopathogenic bacteria

被引:5
|
作者
Shao, Jonathan [1 ,2 ]
Zhao, Yan [3 ]
Wei, Wei [3 ]
Vaisman, Iosif I. [2 ]
机构
[1] ARS, USDA, Stat & Bioinformat Grp Northeast Area, Beltsville, MD USA
[2] George Mason Univ, Sch Syst Biol, Manassas, VA 20110 USA
[3] ARS, USDA, Mol Plant Pathol Lab, Beltsville, MD USA
关键词
antimicrobial peptide; AGRAMP; Spiroplasma; N-gram; random forest; AMP; PROTEIN; SEQUENCE;
D O I
10.3389/fmicb.2024.1304044
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Introduction Antimicrobial peptides (AMPs) are promising alternatives to traditional antibiotics for combating plant pathogenic bacteria in agriculture and the environment. However, identifying potent AMPs through laborious experimental assays is resource-intensive and time-consuming. To address these limitations, this study presents a bioinformatics approach utilizing machine learning models for predicting and selecting AMPs active against plant pathogenic bacteria.Methods N-gram representations of peptide sequences with 3-letter and 9-letter reduced amino acid alphabets were used to capture the sequence patterns and motifs that contribute to the antimicrobial activity of AMPs. A 5-fold cross-validation technique was used to train the machine learning models and to evaluate their predictive accuracy and robustness.Results The models were applied to predict putative AMPs encoded by intergenic regions and small open reading frames (ORFs) of the citrus genome. Approximately 7% of the 10,000-peptide dataset from the intergenic region and 7% of the 685,924-peptide dataset from the whole genome were predicted as probable AMPs. The prediction accuracy of the reported models range from 0.72 to 0.91. A subset of the predicted AMPs was selected for experimental test against Spiroplasma citri, the causative agent of citrus stubborn disease. The experimental results confirm the antimicrobial activity of the selected AMPs against the target bacterium, demonstrating the predictive capability of the machine learning models.Discussion Hydrophobic amino acid residues and positively charged amino acid residues are among the key features in predicting AMPs by the Random Forest Algorithm. Aggregation propensity appears to be correlated with the effectiveness of the AMPs. The described models would contribute to the development of effective AMP-based strategies for plant disease management in agricultural and environmental settings. To facilitate broader accessibility, our model is publicly available on the AGRAMP (Agricultural Ngrams Antimicrobial Peptides) server.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Predicting Web Survey Breakoffs Using Machine Learning Models
    Chen, Zeming
    Cernat, Alexandru
    Shlomo, Natalie
    SOCIAL SCIENCE COMPUTER REVIEW, 2023, 41 (02) : 573 - 591
  • [22] THPep: A machine learning-based approach for predicting tumor homing peptides
    Shoombuatong, Watshara
    Schaduangrat, Nalini
    Pratiwi, Reny
    Nantasenamat, Chanin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 80 : 441 - 451
  • [23] Marine Antimicrobial Peptides: Nature Provides Templates for the Design of Novel Compounds against Pathogenic Bacteria
    Falanga, Annarita
    Lombardi, Lucia
    Franci, Gianluigi
    Vitiello, Mariateresa
    Iovene, Maria Rosaria
    Morelli, Giancarlo
    Galdiero, Massimiliano
    Galdiero, Stefania
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (05)
  • [24] Classification and Prediction of Antimicrobial Peptides Using N-gram Representation and Machine Learning
    Othman, Manal
    Ratna, Sujay
    Tewari, Anant
    Kang, Anthony M.
    Du, Katherine
    Vaisman, Iosif I.
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 605 - 605
  • [25] Waste to resource: Mining antimicrobial peptides in sludge from metagenomes using machine learning
    Xu, Jiaqi
    Xu, Xin
    Jiang, Yunhan
    Fu, Yulong
    Shen, Chaofeng
    ENVIRONMENT INTERNATIONAL, 2024, 186
  • [26] Transgenic expression of antimicrobial peptides from black soldier fly enhance resistance against entomopathogenic bacteria in the silkworm, Bombyx mori
    Xu, Jian
    Luo, Xingyu
    Fang, Gangqi
    Zhan, Shuai
    Wu, Jun
    Wang, Dun
    Huang, Yongping
    INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2020, 127
  • [27] Hybrid Machine Learning Models for Predicting Types of Human T-cell Lymphotropic Virus
    Sharma, Gaurav
    Rana, Prashant Singh
    Bawa, Seema
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (04) : 1524 - 1534
  • [28] Machine learning models for predicting steroid-resistant of nephrotic syndrome
    Ye, Qing
    Li, Yuzhou
    Liu, Huihui
    Mao, Jianhua
    Jiang, Hangjin
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [29] Predicting GPA of University Students with Supervised Regression Machine Learning Models
    Falat, Lukas
    Piscova, Terezia
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [30] Predicting element concentrations by machine learning models in neutron activation analysis
    Huu Nghia Nguyen
    Quang Thien Tran
    Tuan Anh Tran
    Quang Trung Phan
    Minh Dao Nguyen
    Thi Thu Huong Tuong
    Thi Nhu Quynh Chau
    Journal of Radioanalytical and Nuclear Chemistry, 2024, 333 : 1759 - 1768