Comparative Assessment of Scoring Functions on an Updated Benchmark: 1. Compilation of the Test Set

被引:160
作者
Li, Yan [1 ]
Liu, Zhihai [1 ]
Li, Jie [1 ]
Han, Li [1 ]
Liu, Jie [1 ]
Zhao, Zhixiong [1 ]
Wang, Renxiao [1 ,2 ]
机构
[1] Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan & Nat Prod Chem, 345 Lingling Rd, Shanghai 200032, Peoples R China
[2] Macau Univ Sci & Technol, Macau Inst Appl Res Med & Hlth, State Key Lab Qual Res Chinese Med, Macau, Peoples R China
关键词
PROTEIN-LIGAND COMPLEXES; DRUG DISCOVERY; WATER-MOLECULES; CRYSTAL-STRUCTURES; DOCKING PROGRAMS; PDBBIND DATABASE; RECOGNITION; EFFICIENCY; EXERCISE; AFFINITY;
D O I
10.1021/ci500080q
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Scoring functions are often applied in combination with molecular docking methods to predict ligand binding poses and ligand binding affinities or to identify active compounds through virtual screening. An objective benchmark for assessing the performance of current scoring functions is expected to provide practical guidance for the users to make smart choices among available methods. It can also elucidate the common weakness in current methods for future improvements. The primary goal of our comparative assessment of scoring functions (CASF) project is to provide a high-standard, publicly accessible benchmark of this type. Our latest study, i.e., CASF-2013, evaluated 20 popular scoring functions on an updated set of protein-ligand complexes. This data set was selected out of 8302 protein-ligand complexes recorded in the PDBbind database (version 2013) through a fairly complicated process. Sample selection was made by considering the quality of complex structures as well as binding data. Finally, qualified complexes were clustered by 90% similarity in protein sequences. Three representative complexes were chosen from each cluster to control sample redundancy. The final outcome, namely, the PDBbind core set (version 2013), consists of 195 protein ligand complexes in 65 clusters with binding constants spanning nearly 10 orders of magnitude. In this data set, 82% of the ligand molecules are "druglike" and 78% of the protein molecules are validated or potential drug targets. Correlation between binding constants and several key properties of ligands are discussed. Methods and results of the scoring function evaluation will be described in a companion work in this issue (doi: 10.1021/ci500081m).
引用
收藏
页码:1700 / +
页数:18
相关论文
共 65 条
  • [1] Ligand efficiency indices as guideposts for drug discovery
    Abad-Zapatero, C
    Metz, JT
    [J]. DRUG DISCOVERY TODAY, 2005, 10 (07) : 464 - 469
  • [2] Robust classification of "Relevant" water molecules in putative protein binding sites
    Amadasi, Alessio
    Surface, J. Andrew
    Spyrakis, Francesca
    Cozzini, Pietro
    Mozzarelli, Andrea
    Kellogg, Glen E.
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2008, 51 (04) : 1063 - 1067
  • [3] Molecular recognition of protein-ligand complexes: Applications to drug design
    Babine, RE
    Bender, SL
    [J]. CHEMICAL REVIEWS, 1997, 97 (05) : 1359 - 1472
  • [4] Classification of water molecules in protein binding sites
    Barillari, Caterina
    Taylor, Justine
    Viner, Russell
    Essex, Jonathan W.
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2007, 129 (09) : 2577 - 2587
  • [5] Ligand efficiency and fragment-based drug discovery
    Bembenek, Scott D.
    Tounge, Brett A.
    Reynolds, Chades H.
    [J]. DRUG DISCOVERY TODAY, 2009, 14 (5-6) : 278 - 283
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations
    Bissantz, C
    Folkers, G
    Rognan, D
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (25) : 4759 - 4767
  • [8] Rearrangement of Cruickshank's formulae for the diffraction-component precision index
    Blow, DM
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2002, 58 : 792 - 797
  • [9] The use of scoring functions in drug discovery applications
    Böhm, HJ
    Stahl, M
    [J]. REVIEWS IN COMPUTATIONAL CHEMISTRY, VOL 18, 2002, 18 : 41 - 87
  • [10] VAN DER WAALS VOLUMES + RADII
    BONDI, A
    [J]. JOURNAL OF PHYSICAL CHEMISTRY, 1964, 68 (03) : 441 - +