Machine Learning Approaches for Predicting Protein Complex Similarity

被引:0
|
作者
Farhoodi, Roshanak [1 ]
Akbal-Delibas, Bahar [2 ]
Haspel, Nurit [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Kadir Has Univ, Dept Comp Engn, Istanbul, Turkey
关键词
machine learning; neural networks; protein docking and refinement; RMSD prediction; scoring functions; EVOLUTIONARY TRACE; WEB SERVER; DOCKING; ELECTROSTATICS; DESOLVATION; REFINEMENT; ALGORITHMS;
D O I
10.1089/cmb.2016.0137
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g., Van der Waals, electrostatic, and desolvation forces) and the similarity of a conformation to its native structure, the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods, producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods, we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network, a multilayer neural network, and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small, often less than 1.5 angstrom, error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom, the average prediction error was still relatively small, attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.
引用
收藏
页码:40 / 51
页数:12
相关论文
共 50 条
  • [31] Progressive Machine Learning Approaches for Predicting the Soil Compaction Parameters
    Mohammed Amin Benbouras
    Lina Lefilef
    Transportation Infrastructure Geotechnology, 2023, 10 : 211 - 238
  • [32] Predicting online shopping cart abandonment with machine learning approaches
    Rausch, Theresa Maria
    Derra, Nicholas Daniel
    Wolf, Lukas
    INTERNATIONAL JOURNAL OF MARKET RESEARCH, 2022, 64 (01) : 89 - 112
  • [33] Machine learning approaches for predicting shielding effectiveness of carbon fiber-reinforced mortars
    Husnain, Ali
    Iqbal, Munir
    Ashraf, Muhammad
    Mohammed, Deema
    Javed, Muhammad Faisal
    Alabduljabbar, Hisham
    Elminaam, Diaa Salama Abd
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2024, 20
  • [34] Predicting Inhibitors of Acetylcholinesterase by Regression and Classification Machine Learning Approaches with Combinations of Molecular Descriptors
    Chekmarev, Dmitriy
    Kholodovych, Vladyslav
    Kortagere, Sandhya
    Welsh, William J.
    Ekins, Sean
    PHARMACEUTICAL RESEARCH, 2009, 26 (09) : 2216 - 2224
  • [35] Machine learning approaches for predicting link failures in production networks
    Wubete, Bruck W.
    Esfandiari, Babak
    Kunz, Thomas
    COMPUTER NETWORKS, 2025, 259
  • [36] Comparative analysis of machine learning approaches for predicting respiratory virus infection and symptom severity
    Isik, Yunus Emre
    Aydin, Zafer
    PEERJ, 2023, 11
  • [37] Predicting Inhibitors of Acetylcholinesterase by Regression and Classification Machine Learning Approaches with Combinations of Molecular Descriptors
    Dmitriy Chekmarev
    Vladyslav Kholodovych
    Sandhya Kortagere
    William J. Welsh
    Sean Ekins
    Pharmaceutical Research, 2009, 26 : 2216 - 2224
  • [38] Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning
    Kiouri, Despoina P.
    Batsis, Georgios C.
    Chasapis, Christos T.
    BIOMOLECULES, 2025, 15 (01)
  • [39] Predicting protein complex geometries with a neural network
    Chae, Myong-Ho
    Krull, Florian
    Lorenzen, Stephan
    Knapp, Ernst-Walter
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2010, 78 (04) : 1026 - 1039
  • [40] Predicting the concentration of sulfate using machine learning methods
    Tahraoui, Hichem
    Belhadj, Abd-Elmouneim
    Amrane, Abdeltif
    Houssein, Essam H.
    EARTH SCIENCE INFORMATICS, 2022, 15 (02) : 1023 - 1044