A Searchable Map of PubChem

被引:51
作者
van Deursen, Ruud [1 ]
Blum, Lorenz C. [1 ]
Reymond, Jean-Louis [1 ]
机构
[1] Univ Bern, Dept Chem & Biochem, CH-3012 Bern, Switzerland
基金
瑞士国家科学基金会;
关键词
CHEMICAL UNIVERSE; VIRTUAL EXPLORATION; DATABASE; SPACE; CLASSIFICATION; INHIBITORS; MOLECULES; CHEMISTRY; DISCOVERY; SYSTEM;
D O I
10.1021/ci100237q
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The database PubChem was classified using 42 integer value descriptors of molecular structure, here called molecular quantum numbers (MQNs), which count atoms and bond types, polar groups, and topological features. Principal component analysis of the MQN data set shows that Pub Chem compounds occupy a partially filled elliptical cone in the (PC1,PC2,PC3) space whose axis is the first principal component PC1 (65% variability) representing molecular size, and the ellipse axes are PC2 (18% variability, representing structural flexibility) and PC3 (7% variability, representing polarity). A visual overview of Pub Chem is provided by color-coded representations of the (PC2,PC3) plane. The MQNs form a scalar fingerprint which can be used to measure the similarity between pairs of molecules and enable ligand-based virtual screening, as illustrated for the enrichment of bioactives from the DUD data set from Pub Chem. An MQN-annotated version of Pub Chem with an MQN-similarity search tool is available at www.gdb.unibe.ch.
引用
收藏
页码:1924 / 1934
页数:11
相关论文
共 35 条
[1]   Honeycomb Carbon: A Review of Graphene [J].
Allen, Matthew J. ;
Tung, Vincent C. ;
Kaner, Richard B. .
CHEMICAL REVIEWS, 2010, 110 (01) :132-145
[2]   Locating biologically active compounds in medium-sized heterogeneous datasets by topological autocorrelation vectors: Dopamine and benzodiazepine agonists [J].
Bauknecht, H ;
Zell, A ;
Bayer, H ;
Levi, P ;
Wagener, M ;
Sadowski, J ;
Gasteiger, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (06) :1205-1213
[3]   970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13 [J].
Blum, Lorenz C. ;
Reymond, Jean-Louis .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2009, 131 (25) :8732-+
[4]   A rule of three for fragment-based lead discovery? [J].
Congreve, M ;
Carr, R ;
Murray, C ;
Jhoti, H .
DRUG DISCOVERY TODAY, 2003, 8 (19) :876-877
[5]   Isolation and structure of higher diamondoids, nanometer-sized diamond molecules [J].
Dahl, JE ;
Liu, SG ;
Carlson, RMK .
SCIENCE, 2003, 299 (5603) :96-99
[6]   Virtual exploration of the small-molecule chemical universe below 160 daltons [J].
Fink, T ;
Bruggesser, H ;
Reymond, JL .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2005, 44 (10) :1504-1508
[7]   Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: Assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discovery [J].
Fink, Tobias ;
Reymond, Jean-Louis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) :342-353
[8]  
GARCIADELGADO N, 2010, ACS MED CHEM LETT
[9]   Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation [J].
Geppert, Hanna ;
Vogt, Martin ;
Bajorath, Juergen .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (02) :205-216
[10]   Comparison of shape-matching and docking as virtual screening tools [J].
Hawkins, Paul C. D. ;
Skillman, A. Geoffrey ;
Nicholls, Anthony .
JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (01) :74-82