iSEE: Interface structure, evolution, and energy-based machine learning predictor of binding affinity changes upon mutations

被引:59
作者
Geng, Cunliang [1 ]
Vangone, Anna [1 ,2 ]
Folkers, Gert E. [1 ]
Xue, Li C. [1 ]
Bonvin, Alexandre M. J. J. [1 ]
机构
[1] Univ Utrecht, Fac Sci Chem, Bijvoet Ctr Biomol Res, Padualaan 8, NL-3584 CH Utrecht, Netherlands
[2] Roche Innovat Ctr Penzberg, Roche Pharmaceut Res & Early Dev, Large Mol Res, Penzberg, Germany
基金
欧盟地平线“2020”;
关键词
binding affinity; full mutation scanning; machine learning; protein-protein interactions; single point mutation; PROTEIN-PROTEIN BINDING; HOT-SPOTS; DATABASE; HADDOCK; CAPRI;
D O I
10.1002/prot.25630
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning-based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy-based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein-protein complexes. It competes with existing state-of-the-art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2-p53 complex.
引用
收藏
页码:110 / 119
页数:10
相关论文
共 40 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], BIOINFORMATICS
[3]   Small-molecule inhibitors of protein-protein interactions: Progressing towards the dream [J].
Arkin, MR ;
Wells, JA .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (04) :301-317
[4]   Predicting free energy changes using structural ensembles [J].
Benedix, Alexander ;
Becker, Caroline M. ;
de Groot, Bert L. ;
Caflisch, Amedeo ;
Boeckmann, Rainer A. .
NATURE METHODS, 2009, 6 (01) :3-4
[5]   Combining Structural Modeling with Ensemble Machine Learning to Accurately Predict Protein Fold Stability and Binding Affinity Effects upon Mutation [J].
Berliner, Niklas ;
Teyra, Joan ;
Colak, Recep ;
Garcia Lopez, Sebastian ;
Kim, Philip M. .
PLOS ONE, 2014, 9 (09)
[6]   Anatomy of hot spots in protein interfaces [J].
Bogan, AA ;
Thorn, KS .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) :1-9
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles [J].
Brender, Jeffrey R. ;
Zhang, Yang .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (10)
[9]   BeAtMuSiC: prediction of changes in protein-protein binding affinity on mutations [J].
Dehouck, Yves ;
Kwasigroch, Jean Marc ;
Rooman, Marianne ;
Gilis, Dimitri .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W333-W339
[10]   Unraveling hot spots in binding interfaces: progress and challenges [J].
DeLano, WL .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (01) :14-20