Machine Learning Approaches for Predicting Protein Complex Similarity

被引:0
|
作者
Farhoodi, Roshanak [1 ]
Akbal-Delibas, Bahar [2 ]
Haspel, Nurit [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Kadir Has Univ, Dept Comp Engn, Istanbul, Turkey
关键词
machine learning; neural networks; protein docking and refinement; RMSD prediction; scoring functions; EVOLUTIONARY TRACE; WEB SERVER; DOCKING; ELECTROSTATICS; DESOLVATION; REFINEMENT; ALGORITHMS;
D O I
10.1089/cmb.2016.0137
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g., Van der Waals, electrostatic, and desolvation forces) and the similarity of a conformation to its native structure, the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods, producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods, we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network, a multilayer neural network, and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small, often less than 1.5 angstrom, error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom, the average prediction error was still relatively small, attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.
引用
收藏
页码:40 / 51
页数:12
相关论文
共 50 条
  • [41] Machine learning for predicting protein properties: A comprehensive review
    Wang, Yizhen
    Zhang, Yanyun
    Zhan, Xuhui
    He, Yuhao
    Yang, Yongfu
    Cheng, Li
    Alghazzawi, Daniyal
    NEUROCOMPUTING, 2024, 597
  • [42] ppdx: Automated modeling of protein-protein interaction descriptors for use with machine learning
    Conti, Simone
    Ovchinnikov, Victor
    Karplus, Martin
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2022, 43 (25) : 1747 - 1757
  • [43] Machine Learning Approaches for Quality Assessment of Protein Structures
    Chen, Jiarui
    Siu, Shirley W., I
    BIOMOLECULES, 2020, 10 (04)
  • [44] Streamflow regionalisation of an ungauged catchment with machine learning approaches
    Singh, Lakhwinder
    Mishra, Prabhash Kumar
    Pingale, Santosh Murlidhar
    Khare, Deepak
    Thakur, Hitesh Prasad
    HYDROLOGICAL SCIENCES JOURNAL, 2022, 67 (06) : 886 - 897
  • [45] Similarity-Based Machine Learning Model for Predicting the Metabolic Pathways of Compounds
    Jia, Yanjuan
    Zhao, Ran
    Chen, Lei
    IEEE ACCESS, 2020, 8 : 130687 - 130696
  • [46] Machine Learning Approaches for Predicting Power Conversion Efficiency in Organic Solar Cells: A Comprehensive Review
    Jiang, Yang
    Yao, Chuang
    Yang, Yezi
    Wang, Jinshan
    SOLAR RRL, 2024, 8 (22):
  • [47] Predicting the glass formation of metallic glasses using machine learning approaches
    Li, Zhuang
    Long, Zhilin
    Lei, Shan
    Zhang, Ting
    Liu, Xiaowei
    Kuang, Dumin
    COMPUTATIONAL MATERIALS SCIENCE, 2021, 197
  • [48] Predicting Measles Outbreaks in the United States: Evaluation of Machine Learning Approaches
    Ru, Boshu
    Kujawski, Stephanie
    Afanador, Nelson Lee
    Baumgartner, Richard
    Pawaskar, Manjiri
    Das, Amar
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [49] Predicting financial distress using machine learning approaches: Evidence China
    Rahman, Md Jahidur
    Zhu, Hongtao
    JOURNAL OF CONTEMPORARY ACCOUNTING & ECONOMICS, 2024, 20 (01)
  • [50] A review of machine learning approaches for predicting lettuce yield in hydroponic systems
    Sharmin, Sabrina
    Hossan, Md. Tazel
    Uddin, Mohammad Shorif
    SMART AGRICULTURAL TECHNOLOGY, 2025, 11