Improving classical scoring functions using random forest: The non-additivity of free energy terms' contributions in binding

被引:23
作者
Afifi, Karim [1 ]
AI-Sadek, Ahmed Farouk [2 ]
机构
[1] Modern Sci & Arts Univ, Dept Comp Sci, Giza, Egypt
[2] Minist Agr & Land Reclamat, Cent Lab Agr Experts Syst, Giza, Egypt
关键词
AUTODOCK; AUTODOCK VINA; docking; drug design; RF-SCORE; scoring; scoring function; virtual screening; X-SCORE; PROTEIN-LIGAND-BINDING; AFFINITY PREDICTION; AUTODOCK VINA; MOLECULAR DOCKING; PDBBIND DATABASE; ACCURACY;
D O I
10.1111/cbdd.13206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite recent efforts to improve the scoring performance of scoring functions, accurately predicting the binding affinity is still a challenging task. Therefore, different approaches were tried to improve the prediction performance of four scoring functions (X-SCORE, VINA, AUTODOCK, and RF-SCORE) by substituting the linear regression model of classical scoring function by random forest to examine the performance improvement if an additive functional form is not imposed, and by combining different scoring functions into hybrid ones. The datasets were derived from the PDBbind-CN database version 2016. When evaluating the original scoring functions on the generic dataset, RF-SCORE has outperformed classical scoring functions, which shows the superiority of descriptor-based scoring functions. Substituting linear regression as a linear model by random forest as a nonlinear model had largely improved the scoring performance of AUTODOCK and VINA while X-SCORE had only a slight performance increase. All hybrid scoring functions had only a slight improvement-if any-on both of the combined scoring functions, which is not worth the slower calculation time.
引用
收藏
页码:1429 / 1434
页数:6
相关论文
共 20 条
  • [1] [Anonymous], 2011, J CHEMINFORM
  • [2] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    [J]. BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [3] Non-additivity of Functional Group Contributions in Protein Ligand Binding: A Comprehensive Study by Crystallography and Isothermal Titration Calorimetry
    Baum, Bernhard
    Muley, Laveena
    Smolinski, Michael
    Heine, Andreas
    Hangauer, David
    Klebe, Gerhard
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2010, 397 (04) : 1042 - 1054
  • [4] NNScore 2.0: A Neural-Network Receptor-Ligand Scoring Function
    Durrant, Jacob D.
    McCammon, J. Andrew
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (11) : 2897 - 2903
  • [5] Advances and Challenges in Protein-Ligand Docking
    Huang, Sheng-You
    Zou, Xiaoqin
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2010, 11 (08) : 3016 - 3034
  • [6] A semiempirical free energy force field with charge-based desolvation
    Huey, Ruth
    Morris, Garrett M.
    Olson, Arthur J.
    Goodsell, David S.
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2007, 28 (06) : 1145 - 1152
  • [7] Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise
    Koes, David Ryan
    Baumgartner, Matthew P.
    Camacho, Carlos J.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (08) : 1893 - 1904
  • [8] Li H., 2014, BMC BIOINFORMATICS, V15, P56
  • [9] Improving AutoDock Vina Using Random Forest: The Growing Accuracy of Binding Affinity Prediction by the Effective Exploitation of Larger Data Sets
    Li, Hongjian
    Leung, Kwong-Sak
    Wong, Man-Hon
    Ballester, Pedro J.
    [J]. MOLECULAR INFORMATICS, 2015, 34 (2-3) : 115 - 126
  • [10] Comparative Assessment of Scoring Functions on an Updated Benchmark: 2. Evaluation Methods and General Results
    Li, Yan
    Han, Li
    Liu, Zhihai
    Wang, Renxiao
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (06) : 1717 - 1736