Accelerating the Screening of Small Peptide Ligands by Combining Peptide-Protein Docking and Machine Learning

被引:4
|
作者
Codina, Josep-Ramon [1 ]
Mascini, Marcello [2 ]
Dikici, Emre [1 ,3 ]
Deo, Sapna K. [1 ,3 ]
Daunert, Sylvia [1 ,3 ,4 ]
机构
[1] Univ Miami, Miller Sch Med, Dept Biochem & Mol Biol, Miami, FL 33136 USA
[2] Univ Teramo, Dept Biosci & Technol Food Agr & Environm, I-64100 Teramo, Italy
[3] Univ Miami, Dr John T Macdonald Fdn Biomed Nanotechnol Inst Bi, Miami, FL 33136 USA
[4] Univ Miami, Clin & Translat Sci Inst CTSI, Miami, FL 33136 USA
基金
美国国家卫生研究院;
关键词
machine learning; small peptides; molecular docking; classification algorithms; AMINO-ACID; INTERACTION SITES; DESCRIPTORS; PREDICTION; COMPLEX; DESIGN; INDEX;
D O I
10.3390/ijms241512144
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This research introduces a novel pipeline that couples machine learning (ML), and molecular docking for accelerating the process of small peptide ligand screening through the prediction of peptide-protein docking. Eight ML algorithms were analyzed for their potential. Notably, Light Gradient Boosting Machine (LightGBM), despite having comparable F1-score and accuracy to its counterparts, showcased superior computational efficiency. LightGBM was used to classify peptide-protein docking performance of the entire tetrapeptide library of 160,000 peptide ligands against four viral envelope proteins. The library was classified into two groups, 'better performers' and 'worse performers'. By training the LightGBM algorithm on just 1% of the tetrapeptide library, we successfully classified the remaining 99%with an accuracy range of 0.81-0.85 and an F1-score between 0.58-0.67. Three different molecular docking software were used to prove that the process is not software dependent. With an adjustable probability threshold (from 0.5 to 0.95), the process could be accelerated by a factor of at least 10-fold and still get 90-95% concurrence with the method without ML. This study validates the efficiency of machine learning coupled to molecular docking in rapidly identifying top peptides without relying on high-performance computing power, making it an effective tool for screening potential bioactive compounds.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Leveraging machine learning models for peptide-protein interaction prediction
    Yin, Song
    Mi, Xuenan
    Shukla, Diwakar
    RSC CHEMICAL BIOLOGY, 2024, 5 (05): : 401 - 417
  • [2] Fully Blind Peptide-Protein Docking with pepATTRACT
    Schindler, Christina E. M.
    de Vries, Sjoerd J.
    Zacharias, Martin
    STRUCTURE, 2015, 23 (08) : 1507 - 1515
  • [3] Extensive benchmark of rDock as a peptide-protein docking tool
    Daniel Soler
    Yvonne Westermaier
    Robert Soliva
    Journal of Computer-Aided Molecular Design, 2019, 33 : 613 - 626
  • [4] Harnessing protein folding neural networks for peptide-protein docking
    Tsaban, Tomer
    Varga, Julia K.
    Avraham, Orly
    Ben-Aharon, Ziv
    Khramushin, Alisa
    Schueler-Furman, Ora
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [5] Extensive benchmark of rDock as a peptide-protein docking tool
    Soler, Daniel
    Westermaier, Yvonne
    Soliva, Robert
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2019, 33 (07) : 613 - 626
  • [6] Comparison of two docking methods for peptide-protein interactions
    Yu, Qiuying
    Wang, Fangyu
    Hu, Xiaofei
    Xing, Guangxu
    Deng, Ruiguang
    Guo, Junqing
    Cheng, Anchun
    Wang, Jing
    Hao, Junfang
    Zhao, Dong
    Teng, Man
    Zhang, Gaiping
    JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2018, 98 (10) : 3722 - 3727
  • [7] Parallel Evolutionary Optimization Algorithms for Peptide-Protein Docking
    Poluyan, Sergey
    Ershov, Nikolay
    MATHEMATICAL MODELING AND COMPUTATIONAL PHYSICS 2017 (MMCP 2017), 2018, 173
  • [8] Protein-protein and peptide-protein docking and refinement using ATTRACT in CAPRI
    Schindler, Christina E. M.
    de Beauchene, Isaure Chauvot
    de Vries, Sjoerd J.
    Zacharias, Martin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2017, 85 (03) : 391 - 398
  • [9] A fragment-based docking simulation for investigating peptide-protein bindings
    Liao, Jun-min
    Wang, Yeng-Tseng
    Lin, Chen-lung Steve
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2017, 19 (16) : 10436 - 10442
  • [10] TPepPro: a deep learning model for predicting peptide-protein interactions
    Jin, Xiaohong
    Chen, Zimeng
    Yu, Dan
    Jiang, Qianhui
    Chen, Zhuobin
    Yan, Bin
    Qin, Jing
    Liu, Yong
    Wang, Junwen
    BIOINFORMATICS, 2024, 41 (01)