Machine Learning Approaches for Predicting Protein Complex Similarity

被引:0
|
作者
Farhoodi, Roshanak [1 ]
Akbal-Delibas, Bahar [2 ]
Haspel, Nurit [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Kadir Has Univ, Dept Comp Engn, Istanbul, Turkey
关键词
machine learning; neural networks; protein docking and refinement; RMSD prediction; scoring functions; EVOLUTIONARY TRACE; WEB SERVER; DOCKING; ELECTROSTATICS; DESOLVATION; REFINEMENT; ALGORITHMS;
D O I
10.1089/cmb.2016.0137
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g., Van der Waals, electrostatic, and desolvation forces) and the similarity of a conformation to its native structure, the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods, producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods, we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network, a multilayer neural network, and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small, often less than 1.5 angstrom, error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom, the average prediction error was still relatively small, attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.
引用
收藏
页码:40 / 51
页数:12
相关论文
共 50 条
  • [21] Machine learning solutions for predicting protein-protein interactions
    Casadio, Rita
    Martelli, Pier Luigi
    Savojardo, Castrense
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2022, 12 (06)
  • [22] Progressive Machine Learning Approaches for Predicting the Soil Compaction Parameters
    Benbouras, Mohammed Amin
    Lefilef, Lina
    TRANSPORTATION INFRASTRUCTURE GEOTECHNOLOGY, 2023, 10 (02) : 211 - 238
  • [23] Predicting Methylphenidate Response in ADHD Using Machine Learning Approaches
    Kim, Jae-Won
    Sharma, Vinod
    Ryan, Neal D.
    INTERNATIONAL JOURNAL OF NEUROPSYCHOPHARMACOLOGY, 2015, 18 (11)
  • [24] Predicting EHL film thickness parameters by machine learning approaches
    Max Marian
    Jonas Mursak
    Marcel Bartz
    Francisco J. Profito
    Andreas Rosenkranz
    Sandro Wartzack
    Friction, 2023, 11 : 992 - 1013
  • [25] Predicting EHL film thickness parameters by machine learning approaches
    MARIAN, Max
    MURSAK, Jonas
    BARTZ, Marcel
    PROFITO, Francisco J.
    ROSENKRANZ, Andreas
    WARTZACK, Sandro
    FRICTION, 2023, 11 (06) : 992 - 1013
  • [26] Predicting DPP-IV inhibitors with machine learning approaches
    Cai, Jie
    Li, Chanjuan
    Liu, Zhihong
    Du, Jiewen
    Ye, Jiming
    Gu, Qiong
    Xu, Jun
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (04) : 393 - 402
  • [27] Predicting the evolution of scientific communities by interpretable machine learning approaches
    Tian, Yunpei
    Li, Gang
    Mao, Jin
    JOURNAL OF INFORMETRICS, 2023, 17 (02)
  • [28] Machine learning approaches for predicting biomolecule-disease associations
    Ding, Yulian
    Lei, Xiujuan
    Liao, Bo
    Wu, Fang-Xiang
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2021, 20 (04) : 273 - 287
  • [29] Predicting novel microRNA: a comprehensive comparison of machine learning approaches
    Stegmayer, Georgina
    Di Persia, Leandro E.
    Rubiolo, Mariano
    Gerard, Matias
    Pividori, Milton
    Yones, Cristian
    Bugnon, Leandro A.
    Rodriguez, Tadeo
    Raad, Jonathan
    Milone, Diego H.
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (05) : 1607 - 1620
  • [30] Machine learning approaches for predicting household transportation energy use
    Amiri, Shideh Shams
    Mostafavi, Nariman
    Lee, Earl Rusty
    Hoque, Simi
    CITY AND ENVIRONMENT INTERACTIONS, 2020, 7