Making every SAR point count: the development of Chemistry Connect for the large-scale integration of structure and bioactivity data

被引:60
作者
Muresan, Sorel [1 ]
Petrov, Plamen [1 ]
Southan, Christopher [2 ]
Kjellberg, Magnus J. [3 ]
Koger, Thierry [1 ]
Tyrchan, Christian [3 ]
Varkonyi, Peter [1 ]
Xie, Paul Hongxing [1 ]
机构
[1] AstraZeneca R&D, DECS Computat Sci, S-43183 Molndal, Sweden
[2] AstraZeneca R&D, R&D Informat, S-43183 Molndal, Sweden
[3] CVGI AstraZeneca R&D Molndal, S-43183 Molndal, Sweden
关键词
COMMERCIAL DATABASES; INFORMATION; COMPLEMENTARITY; TAUTOMERISM; MANAGEMENT; KNOWLEDGE; DRUGS;
D O I
10.1016/j.drudis.2011.10.005
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
The increase in drug research output from patent applications, together with the expansion of public data collections, such as ChEMBL and PubChem BioAssay, has made it essential for pharmaceutical companies to integrate both internal and external 'SAR estate'. The AstraZeneca response has been the development of an enterprise application, Chemistry Connect, containing 45 million unique chemical structures from 18 internal and external data sources. It includes merged compound-to-assay-to-result-to-target relationships extracted from patents, papers and internal data. Users can explore connections between these by searching using drug names or synonyms, chemical structures, patent numbers and target protein identifiers at a scale not previously available.
引用
收藏
页码:1019 / 1030
页数:12
相关论文
共 28 条
[1]   Advanced biological and chemical discovery (ABCD): Centralizing discovery knowledge in an inherently decentralized world [J].
Agrafiotis, Dimitris K. ;
Alex, Simson ;
Dai, Heng ;
Derkinderen, An ;
Farnum, Michael ;
Gates, Peter ;
Izrailev, Sergei ;
Jaeger, Edward P. ;
Konstant, Paul ;
Leung, Albert ;
Lobanov, Victor S. ;
Marichal, Patrick ;
Martin, Douglas ;
Rassokhin, Dmitrii N. ;
Shemanarev, Maxim ;
Skalkin, Andrew ;
Stong, John ;
Tabruyn, Tom ;
Vermeiren, Marleen ;
Wan, Jackson ;
Xu, Xiang Yang ;
Yao, Xiang .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (06) :1999-2014
[2]   Exploiting personalized information for reagent selection in drug design [J].
Bostrom, Jonas ;
Falk, Niklas ;
Tyrchan, Christian .
DRUG DISCOVERY TODAY, 2011, 16 (5-6) :181-187
[3]   Thousands of chemical starting points for antimalarial lead identification [J].
Gamo, Francisco-Javier ;
Sanz, Laura M. ;
Vidal, Jaume ;
de Cozar, Cristina ;
Alvarez, Emilio ;
Lavandera, Jose-Luis ;
Vanderwall, Dana E. ;
Green, Darren V. S. ;
Kumar, Vinod ;
Hasan, Samiul ;
Brown, James R. ;
Peishoff, Catherine E. ;
Cardon, Lon R. ;
Garcia-Bustos, Jose F. .
NATURE, 2010, 465 (7296) :305-U56
[4]   Combinatorial compound libraries for drug discovery: An ongoing challenge [J].
Geysen, HM ;
Schoenen, F ;
Wagner, D ;
Wagner, R .
NATURE REVIEWS DRUG DISCOVERY, 2003, 2 (03) :222-230
[5]   Follow-on drugs: How far should chemists look? [J].
Giordanetto, Fabrizio ;
Bostrom, Jonas ;
Tyrchan, Christian .
DRUG DISCOVERY TODAY, 2011, 16 (15-16) :722-732
[6]   Lingos, finite state machines, and fast similarity searching [J].
Grant, J. Andrew ;
Haigh, James A. ;
Pickup, Barry T. ;
Nicholls, Anthony ;
Sayle, Roger A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (05) :1912-1918
[7]   A dictionary to identify small molecules and drugs in free text [J].
Hettne, Kristina M. ;
Stierum, Rob H. ;
Schuemie, Martijn J. ;
Hendriksen, Peter J. M. ;
Schijvenaars, Bob J. A. ;
van Mulligen, Erik M. ;
Kleinjans, Jos ;
Kors, Jan A. .
BIOINFORMATICS, 2009, 25 (22) :2983-2991
[8]  
HUANG R, 2011, SCI TRANSL MED, V3, pS16
[9]   HASH CODES FOR THE IDENTIFICATION AND CLASSIFICATION OF MOLECULAR-STRUCTURE ELEMENTS [J].
IHLENFELDT, WD ;
GASTEIGER, J .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (08) :793-813
[10]  
Jagarlapudi SARP, 2009, METHODS MOL BIOL, V575, P159, DOI 10.1007/978-1-60761-274-2_6