Improving classical scoring functions using random forest: The non-additivity of free energy terms' contributions in binding

被引:23
作者
Afifi, Karim [1 ]
AI-Sadek, Ahmed Farouk [2 ]
机构
[1] Modern Sci & Arts Univ, Dept Comp Sci, Giza, Egypt
[2] Minist Agr & Land Reclamat, Cent Lab Agr Experts Syst, Giza, Egypt
关键词
AUTODOCK; AUTODOCK VINA; docking; drug design; RF-SCORE; scoring; scoring function; virtual screening; X-SCORE; PROTEIN-LIGAND-BINDING; AFFINITY PREDICTION; AUTODOCK VINA; MOLECULAR DOCKING; PDBBIND DATABASE; ACCURACY;
D O I
10.1111/cbdd.13206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite recent efforts to improve the scoring performance of scoring functions, accurately predicting the binding affinity is still a challenging task. Therefore, different approaches were tried to improve the prediction performance of four scoring functions (X-SCORE, VINA, AUTODOCK, and RF-SCORE) by substituting the linear regression model of classical scoring function by random forest to examine the performance improvement if an additive functional form is not imposed, and by combining different scoring functions into hybrid ones. The datasets were derived from the PDBbind-CN database version 2016. When evaluating the original scoring functions on the generic dataset, RF-SCORE has outperformed classical scoring functions, which shows the superiority of descriptor-based scoring functions. Substituting linear regression as a linear model by random forest as a nonlinear model had largely improved the scoring performance of AUTODOCK and VINA while X-SCORE had only a slight performance increase. All hybrid scoring functions had only a slight improvement-if any-on both of the combined scoring functions, which is not worth the slower calculation time.
引用
收藏
页码:1429 / 1434
页数:6
相关论文
共 20 条
[1]  
[Anonymous], 2011, J CHEMINFORM
[2]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175
[3]   Non-additivity of Functional Group Contributions in Protein Ligand Binding: A Comprehensive Study by Crystallography and Isothermal Titration Calorimetry [J].
Baum, Bernhard ;
Muley, Laveena ;
Smolinski, Michael ;
Heine, Andreas ;
Hangauer, David ;
Klebe, Gerhard .
JOURNAL OF MOLECULAR BIOLOGY, 2010, 397 (04) :1042-1054
[4]   NNScore 2.0: A Neural-Network Receptor-Ligand Scoring Function [J].
Durrant, Jacob D. ;
McCammon, J. Andrew .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (11) :2897-2903
[5]   Advances and Challenges in Protein-Ligand Docking [J].
Huang, Sheng-You ;
Zou, Xiaoqin .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2010, 11 (08) :3016-3034
[6]   A semiempirical free energy force field with charge-based desolvation [J].
Huey, Ruth ;
Morris, Garrett M. ;
Olson, Arthur J. ;
Goodsell, David S. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2007, 28 (06) :1145-1152
[7]   Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise [J].
Koes, David Ryan ;
Baumgartner, Matthew P. ;
Camacho, Carlos J. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (08) :1893-1904
[8]  
Li H., 2014, BMC BIOINFORMATICS, V15, P56
[9]   Improving AutoDock Vina Using Random Forest: The Growing Accuracy of Binding Affinity Prediction by the Effective Exploitation of Larger Data Sets [J].
Li, Hongjian ;
Leung, Kwong-Sak ;
Wong, Man-Hon ;
Ballester, Pedro J. .
MOLECULAR INFORMATICS, 2015, 34 (2-3) :115-126
[10]   Comparative Assessment of Scoring Functions on an Updated Benchmark: 2. Evaluation Methods and General Results [J].
Li, Yan ;
Han, Li ;
Liu, Zhihai ;
Wang, Renxiao .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (06) :1717-1736