BIPEP: Sequence-based Prediction of Biofilm Inhibitory Peptides Using a Combination of NMR and Physicochemical Descriptors

被引:28
|
作者
Atanaki, Fereshteh Fallah [1 ]
Behrouzi, Saman [1 ]
Ariaeenejad, Shohreh [2 ]
Boroomand, Amin [3 ]
Kavousi, Kaveh [1 ]
机构
[1] Univ Tehran, IBB, Dept Bioinformat, Lab Complex Biol Syst & Bioinformat CBB, Tehran 1417466191, Iran
[2] AREEO, ABRII, Dept Syst & Synthet Biol, Karaj 315351897, Iran
[3] Univ Calif Merced, Sch Nat Sci, Merced, CA 95343 USA
来源
ACS OMEGA | 2020年 / 5卷 / 13期
基金
美国国家科学基金会;
关键词
DATABASE; PROTEIN;
D O I
10.1021/acsomega.9b04119
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Biofilms are biological systems that are formed by a community of microorganisms in which microbial cells are connected on a surface within a self-produced matrix of an extracellular polymeric substance. On some occasions, microorganisms use biofilms to protect themselves against the harmful effects of the host body immune system and the surrounding environment, hence increasing their chances of survival against the various anti-microbial agents. Biofilms play a crucial role in medicine and industry because of the problems they cause. Designing agents that inhibit bacterial biofilm formation is very costly and takes too much time in the laboratory to be discovered and validated. Therefore, developing computational tools for the prediction of biofilm inhibitor peptides is inevitable and important. Here, we present a computational prediction tool to screen the vast number of peptide sequences and select potential candidate peptides for further lab experiments and validation. In this learning model, different feature vectors, extracted from the peptide primary structure, are exploited to learn patterns from the sequence of biofilm inhibitory peptides. Various classification algorithms including SVM, random forest, and k-nearest neighbor have been examined to evaluate their performance. Overall, our approach showed better prediction in comparison with other prediction methods. In this study, for the first time, we applied features extracted from NMR spectra of amino acids along with physicochemical features. Although each group of features showed good discrimination potential alone, we used a combination of features to enhance the performance of our method. Our prediction tool is freely available.
引用
收藏
页码:7290 / 7297
页数:8
相关论文
共 23 条
  • [21] Sequence-based prediction of SARS-CoV-2 vaccine targets using a mass spectrometry-based bioinformatics predictor identifies immunogenic T cell epitopes
    Poran, Asaf
    Harjanto, Dewi
    Malloy, Matthew
    Arieta, Christina M.
    Rothenberg, Daniel A.
    Lenkala, Divya
    van Buuren, Marit M.
    Addona, Terri A.
    Rooney, Michael S.
    Srinivasan, Lakshmi
    Gaynor, Richard B.
    GENOME MEDICINE, 2020, 12 (01)
  • [22] Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation
    Manavalan, Balachandran
    Basith, Shaherin
    Shin, Tae Hwan
    Wei, Leyi
    Lee, Gwang
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 16 : 733 - 744
  • [23] Enhanced prediction of anti-tubercular peptides from sequence information using divergence measure-based intuitionistic fuzzy-rough feature selection
    Jain, Pankhuri
    Tiwari, Anoop Kumar
    Som, Tanmoy
    SOFT COMPUTING, 2021, 25 (04) : 3065 - 3086