Towards a gold standard: regarding quality in public domain chemistry databases and approaches to improving the situation

被引:84
作者
Williams, Antony J. [1 ]
Ekins, Sean [2 ]
Tkachenko, Valery [1 ]
机构
[1] Royal Soc Chem, US Off, Wake Forest, NC 27587 USA
[2] Collaborat Chem, Fuquay Varina, NC 27526 USA
关键词
VALIDATION PRACTICES; ERROR-DETECTION; DRUG TARGETS; INFORMATION; ANNOTATION; TOXICOLOGY; LIBRARIES; ONTOLOGY; CURATION; DESIGN;
D O I
10.1016/j.drudis.2012.02.013
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In recent years there has been a dramatic increase in the number of freely accessible online databases serving the chemistry community. The internet provides chemistry data that can be used for data-mining, for computer models, and integration into systems to aid drug discovery. There is however a responsibility to ensure that the data are high quality to ensure that time is not wasted in erroneous searches, that models are underpinned by accurate data and that improved discoverability of online resources is not marred by incorrect data. In this article we provide an overview of some of the experiences of the authors using online chemical compound databases, critique the approaches taken to assemble data and we suggest approaches to deliver definitive reference data sources.
引用
收藏
页码:685 / 701
页数:17
相关论文
共 39 条
[1]  
[Anonymous], COLLABORATIVE COMPUT
[2]   Open-access chemistry databases evolving slowly but not surely [J].
Baker, Monya .
NATURE REVIEWS DRUG DISCOVERY, 2006, 5 (09) :707-708
[3]  
Bell AW, 2009, NAT METHODS, V6, P423, DOI [10.1038/NMETH.1333, 10.1038/nmeth.1333]
[4]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[5]  
BRZUSTOWICZ LM, 1993, AM J HUM GENET, V53, P1137
[6]   An empirical assessment of validation practices for molecular classifiers [J].
Castaldi, Peter J. ;
Dahabreh, Issa J. ;
Ioannidis, John P. A. .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (03) :189-202
[7]   Toward automated biochemotype annotation for large compound libraries [J].
Chen, Xian ;
Liang, Yizeng ;
Xu, Jun .
MOLECULAR DIVERSITY, 2006, 10 (03) :495-509
[8]   Applying modern error theory to the problem of missed injuries in trauma [J].
Clarke, D. L. ;
Gouveia, J. ;
Thomson, S. R. ;
Muckart, D. J. J. .
WORLD JOURNAL OF SURGERY, 2008, 32 (06) :1176-1182
[9]   Limitations and lessons in the use of X-ray structural information in drug design [J].
Davis, Andrew M. ;
St-Gallay, Stephen A. ;
Kleywegt, Gerard J. .
DRUG DISCOVERY TODAY, 2008, 13 (19-20) :831-841
[10]   The ToxCast program for prioritizing toxicity testing of environmental chemicals [J].
Dix, David J. ;
Houck, Keith A. ;
Martin, Matthew T. ;
Richard, Ann M. ;
Setzer, R. Woodrow ;
Kavlock, Robert J. .
TOXICOLOGICAL SCIENCES, 2007, 95 (01) :5-12