Benchmarking methods and data sets for ligand enrichment assessment in virtual screening

被引：40

作者：

Xia, Jie ^{[1
,2
]}

Tilahun, Ermias Lemma ^{[2
]}

Reid, Terry-Elinor ^{[2
]}

Zhang, Liangren ^{[1
]}

Wang, Xiang Simon ^{[2
]}

机构：

[1] Peking Univ, Sch Pharmaceut Sci, State Key Lab Nat & Biomimet Drugs, Beijing 100191, Peoples R China

[2] Howard Univ, Coll Pharm, Dist Columbia Dev Ctr AIDS Res DC D CFAR,Mol Mode, Dept Pharmaceut Sci,Lab Cheminformat & Drug Desig, Washington, DC 20059 USA

来源：

METHODS | 2015年 / 71卷

基金：

美国国家卫生研究院;

关键词：

Benchmarking methodology; Decoy sets; Structure-based virtual screening; Ligand-based virtual screening; Artificial enrichment; Analogue bias; SCORING FUNCTIONS; MOLECULAR DOCKING; DRUG DISCOVERY; ACCURATE DOCKING; PERFORMANCE; OPTIMIZATION; STRATEGIES; SELECTION; AFFINITY; SHAPE;

D O I：

10.1016/j.ymeth.2014.11.015

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. "analogue bias", "artificial enrichment" and "false negative". In addition, we introduce our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylases (HDACs) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The leave-one-out cross-validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased as measured by property matching, ROC curves and AUCs. (C) 2014 Elsevier Inc. All rights reserved.

引用

页码：146 / 157

页数：12

共 113 条

[1] ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].