Decoys Selection in Benchmarking Datasets: Overview and Perspectives

被引:64
作者
Reau, Manon [1 ]
Langenfeld, Florent [1 ]
Zagury, Jean-Francois [1 ]
Lagarde, Nathalie [1 ]
Montes, Matthieu [1 ]
机构
[1] Conservatoire Natl Arts & Metiers, Lab GBA, EA4627, Paris, France
关键词
virtual screening; benchmarking databases; benchmarking; decoy; structure-based drug design; ligand-based drug design; PROTEIN-COUPLED RECEPTORS; HIGH-THROUGHPUT DOCKING; MOLECULAR DOCKING; SCORING FUNCTION; DRUG DISCOVERY; PHARMACOPHORE MODELS; LIGAND INTERACTIONS; BINDING AFFINITIES; CHEMICAL DATABASES; AUTOMATED DOCKING;
D O I
10.3389/fphar.2018.00011
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Virtual Screening (VS) is designed to prospectively help identifying potential hits, i.e., compounds capable of interacting with a given target and potentially modulate its activity, out of large compound collections. Among the variety of methodologies, it is crucial to select the protocol that is the most adapted to the query/target system under study and that yields the most reliable output. To this aim, the performance of VS methods is commonly evaluated and compared by computing their ability to retrieve active compounds in benchmarking datasets. The benchmarking datasets contain a subset of known active compounds together with a subset of decoys, i.e., assumed non-active molecules. The composition of both the active and the decoy compounds subsets is critical to limit the biases in the evaluation of the VS methods. In this review, we focus on the selection of decoy compounds that has considerably changed over the years, from randomly selected compounds to highly customized or experimentally validated negative compounds. We first outline the evolution of decoys selection in benchmarking databases as well as current benchmarking databases that tend to minimize the introduction of biases, and secondly, we propose recommendations for the selection and the design of benchmarking datasets.
引用
收藏
页数:15
相关论文
共 95 条
[1]   Ligand-Based Virtual Screening Using Bayesian Networks [J].
Abdo, Ammar ;
Chen, Beining ;
Mueller, Christoph ;
Salim, Naomie ;
Willett, Peter .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (06) :1012-1020
[2]   Identification of a Novel Class of BRD4 Inhibitors by Computational Screening and Binding Simulations [J].
Allen, Bryce K. ;
Mehta, Saurabh ;
Ember, Stuart W. J. ;
Zhu, Jin-Yi ;
Schonbrunn, Ernst ;
Ayad, Nagi G. ;
Schurer, Stephan C. .
ACS OMEGA, 2017, 2 (08) :4760-4771
[3]   Large-Scale Computational Screening Identifies First in Class Multitarget Inhibitor of EGFR Kinase and BRD4 [J].
Allen, Bryce K. ;
Mehta, Saurabh ;
Ember, Stewart W. J. ;
Schonbrunn, Ernst ;
Ayad, Nagi ;
Schuerer, Stephan C. .
SCIENTIFIC REPORTS, 2015, 5
[4]   Characterization of chemical libraries for luciferase inhibitory activity [J].
Auld, Douglas S. ;
Southall, Noel T. ;
Jadhav, Ajit ;
Johnson, Ronald L. ;
Diller, David J. ;
Simeonov, Anton ;
Austin, Christopher P. ;
Inglese, James .
JOURNAL OF MEDICINAL CHEMISTRY, 2008, 51 (08) :2372-2386
[5]  
Bauer MR, 2013, J CHEM INF MODEL, V53, P1447, DOI [10.1021/ci400115b, 10.1021/ci400115bl]
[6]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[7]   Multiple Structures for Virtual Ligand Screening: Defining Binding Site Properties-Based Criteria to Optimize the Selection of the Query [J].
Ben Nasr, Nesrine ;
Guillemain, Helene ;
Lagarde, Nathalie ;
Zagury, Jean-Francois ;
Montes, Matthieu .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (02) :293-311
[8]   Binding MOAD, a high-quality protein-ligand database [J].
Benson, Mark L. ;
Smith, Richard D. ;
Khazanov, Nickolay A. ;
Dimcheff, Brandon ;
Beaver, John ;
Dresslar, Peter ;
Nerothin, Jason ;
Carlson, Heather A. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D674-D678
[9]   The ChEMBL bioactivity database: an update [J].
Bento, A. Patricia ;
Gaulton, Anna ;
Hersey, Anne ;
Bellis, Louisa J. ;
Chambers, Jon ;
Davies, Mark ;
Krueger, Felix A. ;
Light, Yvonne ;
Mak, Lora ;
McGlinchey, Shaun ;
Nowotka, Michal ;
Papadatos, George ;
Santos, Rita ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D1083-D1090
[10]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242