Prediction of protein-protein binding affinity using diverse protein-protein interface features

被引:8
作者
Ma, Duo [1 ]
Guo, Yanzhi [1 ]
Luo, Jiesi [1 ]
Pu, Xuemei [1 ]
Li, Menglong [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein-protein interaction; Binding; Affinity prediction; Random forest; Feature importance evaluation; FREE-ENERGY CALCULATIONS; EVOLUTIONARY CONSERVATION; COMPUTATIONAL DESIGN; MEAN FORCE; HOT-SPOTS; LIGAND; FLEXIBILITY; TRANSIENT;
D O I
10.1016/j.chemolab.2014.07.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein-protein interactions play fundamental roles in almost all biological processes. Determining the protein-protein binding affinity has been recognized not only as an important step but also as a challenging task for further understanding of the molecular mechanism and the modeling of the biological systems. Unlike the traditional methods like empirical scoring algorithms and molecular dynamic which are time consuming, we developed a fast and reliable machine learning method for the prediction of protein-protein binding affinity. Based on diverse protein-protein interface features calculated using commonly used available tools, 432 features were obtained to represent hydrogen bond, Van der Waals force, hydrophobic interaction, electrostatic force, interface shape and configuration and allosteric effect. Considering the limited number of the available structures and affinity-known protein complexes, in order to avoid overfitting and remove noises in the feature set, feature importance evaluation was implemented and 154 optimal features were selected, then the prediction model based on random forest (RF) was constructed. We demonstrate that the RE model yields promising results and the predictive power of our method is better than other existing methods. Using leave-one-out cross-validation, our model gives a correlation coefficient (r) of 0.708 on the whole benchmark dataset of 133 complexes and a high r of 0.806 on the validated set of 53 samples. When performing the same two independent datasets, our method outperforms other two methods and achieves a high r of 0.793 and 0.907 respectively. All results indicate that our method can be a useful implement in determining protein-protein binding affinity. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:7 / 13
页数:7
相关论文
共 57 条
[31]   Predicting permanent and transient protein-protein interfaces [J].
La, David ;
Kong, Misun ;
Hoffman, William ;
Choi, Youn Im ;
Kihara, Daisuke .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (05) :805-818
[32]  
Li YC, 2008, PROTEIN PEPTIDE LETT, V15, P223
[33]   A fast empirical approach to binding free energy calculations based on protein interface information [J].
Ma, XH ;
Wang, CX ;
Li, CH ;
Chen, WZ .
PROTEIN ENGINEERING, 2002, 15 (08) :677-681
[34]   Structure, function, and evolution of transient and obligate protein-protein interactions [J].
Mintseris, J ;
Weng, ZP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (31) :10930-10935
[35]   Protein-protein binding affinity prediction on a diverse set of structures [J].
Moal, Iain H. ;
Agius, Rudi ;
Bates, Paul A. .
BIOINFORMATICS, 2011, 27 (21) :3002-3009
[36]   Hot spots-A review of the protein-protein interface determinant amino-acid residues [J].
Moreira, Irina S. ;
Fernandes, Pedro A. ;
Ramos, Maria J. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 68 (04) :803-812
[37]   Diversity of protein-protein interactions [J].
Nooren, IMA ;
Thornton, JM .
EMBO JOURNAL, 2003, 22 (14) :3486-3492
[38]   The modular architecture of protein-protein binding interfaces [J].
Reichmann, D ;
Rahat, O ;
Albeck, S ;
Meged, R ;
Dym, O ;
Schreiber, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (01) :57-62
[39]   Conserved patterns of protein interaction in multiple species [J].
Sharan, R ;
Suthram, S ;
Kelley, RM ;
Kuhn, T ;
McCuine, S ;
Uetz, P ;
Sittler, T ;
Karp, RM ;
Ideker, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (06) :1974-1979
[40]   The relationship between the flexibility of proteins and their conformational states on forming protein-protein complexes with an application to protein-protein docking [J].
Smith, GR ;
Sternberg, MJE ;
Bates, PA .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 347 (05) :1077-1101