AGRAMP: machine learning models for predicting antimicrobial peptides against phytopathogenic bacteria

被引:5
|
作者
Shao, Jonathan [1 ,2 ]
Zhao, Yan [3 ]
Wei, Wei [3 ]
Vaisman, Iosif I. [2 ]
机构
[1] ARS, USDA, Stat & Bioinformat Grp Northeast Area, Beltsville, MD USA
[2] George Mason Univ, Sch Syst Biol, Manassas, VA 20110 USA
[3] ARS, USDA, Mol Plant Pathol Lab, Beltsville, MD USA
关键词
antimicrobial peptide; AGRAMP; Spiroplasma; N-gram; random forest; AMP; PROTEIN; SEQUENCE;
D O I
10.3389/fmicb.2024.1304044
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Introduction Antimicrobial peptides (AMPs) are promising alternatives to traditional antibiotics for combating plant pathogenic bacteria in agriculture and the environment. However, identifying potent AMPs through laborious experimental assays is resource-intensive and time-consuming. To address these limitations, this study presents a bioinformatics approach utilizing machine learning models for predicting and selecting AMPs active against plant pathogenic bacteria.Methods N-gram representations of peptide sequences with 3-letter and 9-letter reduced amino acid alphabets were used to capture the sequence patterns and motifs that contribute to the antimicrobial activity of AMPs. A 5-fold cross-validation technique was used to train the machine learning models and to evaluate their predictive accuracy and robustness.Results The models were applied to predict putative AMPs encoded by intergenic regions and small open reading frames (ORFs) of the citrus genome. Approximately 7% of the 10,000-peptide dataset from the intergenic region and 7% of the 685,924-peptide dataset from the whole genome were predicted as probable AMPs. The prediction accuracy of the reported models range from 0.72 to 0.91. A subset of the predicted AMPs was selected for experimental test against Spiroplasma citri, the causative agent of citrus stubborn disease. The experimental results confirm the antimicrobial activity of the selected AMPs against the target bacterium, demonstrating the predictive capability of the machine learning models.Discussion Hydrophobic amino acid residues and positively charged amino acid residues are among the key features in predicting AMPs by the Random Forest Algorithm. Aggregation propensity appears to be correlated with the effectiveness of the AMPs. The described models would contribute to the development of effective AMP-based strategies for plant disease management in agricultural and environmental settings. To facilitate broader accessibility, our model is publicly available on the AGRAMP (Agricultural Ngrams Antimicrobial Peptides) server.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Predicting student success in MOOCs: a comprehensive analysis using machine learning models
    Althibyani, Hosam A.
    PeerJ Computer Science, 2024, 10
  • [42] A systematic review of machine learning models for predicting outcomes of stroke with structured data
    Wang, Wenjuan
    Kiik, Martin
    Peek, Niels
    Curcin, Vasa
    Marshall, Iain J.
    Rudd, Anthony G.
    Wang, Yanzhong
    Douiri, Abdel
    Wolfe, Charles D.
    Bray, Benjamin
    PLOS ONE, 2020, 15 (06):
  • [43] Modifications on amphiphilicity and cationicity of unnatural amino acid containing peptides for the improvement of antimicrobial activity against pathogenic bacteria
    Taira, Junichi
    Kida, Yutaka
    Yamaguchi, Hiroshi
    Kuwano, Koichi
    Higashimoto, Yuichiro
    Kodama, Hiroaki
    JOURNAL OF PEPTIDE SCIENCE, 2010, 16 (11) : 607 - 612
  • [44] Machine learning-based models for genomic predicting neoadjuvant Machine learning-based models for genomic predicting neoadjuvant chemotherapeutic sensitivity in cervical cancer chemotherapeutic sensitivity in cervical cancer
    Guo, Lu
    Wang, Wei
    Xie, Xiaodong
    Wang, Shuihua
    Zhang, Yudong
    BIOMEDICINE & PHARMACOTHERAPY, 2023, 159
  • [45] Prediction of Therapeutic Peptides Using Machine Learning: Computational Models, Datasets, and Feature Encodings
    Attique, Muhammad
    Farooq, Muhammad Shoaib
    Khelifi, Adel
    Abid, Adnan
    IEEE ACCESS, 2020, 8 (08): : 148570 - 148594
  • [46] Combining statistical, machine learning and experimental approaches for screening of novel antimicrobial peptides of calf cruor hydrolysates
    Sanchez-Reinoso, Zain
    Garcia-Vela, Sara
    Clement, Jean-Pierre
    Bazinet, Laurent
    FOOD BIOSCIENCE, 2025, 65
  • [47] Development of Predictive Models using Machine Learning Algorithms for Food Adulterants Bacteria Detection
    Amado, Timothy M.
    Burman, Ma Rica
    Chicote, Relamae F.
    Espenida, Sheila May C.
    Masangcay, Honeyleth L.
    Ventura, Camille H.
    Tolentino, Lean Karlo S.
    Padilla, Maria Victoria C.
    Madrigal, Gilfred Allen M.
    Enriquez, Lejan Alfred C.
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT, AND MANAGEMENT (HNICEM), 2019,
  • [48] Pharmaceutical nanotechnology: Antimicrobial peptides as potential new drugs against WHO list of critical, high, and medium priority bacteria
    Roque-Borda, Cesar Augusto
    Da Silva, Patricia Bento
    Rodrigues, Mosar Correa
    Filippo, Leonardo Delello Di
    Duarte, Jonatas L.
    Chorilli, Marlus
    Vicente, Eduardo Festozo
    Garrido, Saulo Santesso
    Pavan, Fernando Rogenio
    EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2022, 241
  • [49] Design, synthesis, and antimicrobial activities of novel functional peptides against Gram-positive and Gram-negative bacteria
    Lee, Ping-Chien
    Chu, Chia-Chun
    Tsai, Yi-Je
    Chuang, Ya-Chu
    Lung, Feng-Di
    CHEMICAL BIOLOGY & DRUG DESIGN, 2019, 94 (02) : 1537 - 1544
  • [50] Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries
    Otoom, Mohammad Mahmood
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (01): : 220 - 225