Open-source platform to benchmark fingerprints for ligand-based virtual screening

被引:241
作者
Riniker, Sereina [1 ]
Landrum, Gregory A. [1 ]
机构
[1] Novartis Inst BioMed Res, Basel, Switzerland
关键词
Virtual screening; Benchmark; Similarity; Fingerprints; MOLECULAR DESCRIPTOR; SIMILARITY; VALIDATION; METRICS; SETS;
D O I
10.1186/1758-2946-5-26
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Similarity-search methods using molecular fingerprints are an important tool for ligand-based virtual screening. A huge variety of fingerprints exist and their performance, usually assessed in retrospective benchmarking studies using data sets with known actives and known or assumed inactives, depends largely on the validation data sets used and the similarity measure used. Comparing new methods to existing ones in any systematic way is rather difficult due to the lack of standard data sets and evaluation procedures. Here, we present a standard platform for the benchmarking of 2D fingerprints. The open-source platform contains all source code, structural data for the actives and inactives used (drawn from three publicly available collections of data sets), and lists of randomly selected query molecules to be used for statistically valid comparisons of methods. This allows the exact reproduction and comparison of results for future studies. The results for 12 standard fingerprints together with two simple baseline fingerprints assessed by seven evaluation methods are shown together with the correlations between methods. High correlations were found between the 12 fingerprints and a careful statistical analysis showed that only the two baseline fingerprints were different from the others in a statistically significant way. High correlations were also found between six of the seven evaluation methods, indicating that despite their seeming differences, many of these methods are similar to each other.
引用
收藏
页数:17
相关论文
共 47 条
[1]  
[Anonymous], 2011, MACCS STRUCT KEVS
[2]  
[Anonymous], 2010, R LANG ENV STAT COMP
[3]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[4]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[5]   How similar are those molecules after all? Use two descriptors and you will have three different answers [J].
Bender, Andreas .
EXPERT OPINION ON DRUG DISCOVERY, 2010, 5 (12) :1141-1151
[6]   How Similar Are Similarity Searching Methods? A Principal Component Analysis of Molecular Descriptor Space [J].
Bender, Andreas ;
Jenkins, Jeremy L. ;
Scheiber, Josef ;
Sukuru, Sai Chelan K. ;
Glick, Meir ;
Davies, John W. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (01) :108-119
[7]   On scaffolds and hopping in medicinal chemistry [J].
Brown, Nathan ;
Jacoby, Edgar .
MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2006, 6 (11) :1217-1229
[8]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[9]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103