Progress in visual representations of chemical space

被引:56
作者
Osolodkin, Dmitry I. [1 ,2 ]
Radchenko, Eugene V. [1 ,3 ]
Orlov, Alexey A. [1 ,2 ]
Voronkov, Andrey E. [1 ,4 ,5 ]
Palyulin, Vladimir A. [1 ,3 ]
Zefirov, Nikolay S. [1 ,3 ]
机构
[1] Moscow MV Lomonosov State Univ, Dept Chem, Moscow 119991, Russia
[2] Chumakov Inst Poliomyelitis & Viral Encephalitid, Moscow 142782, Russia
[3] RAS, Inst Physiol Act Cpds, Chernogolovka 142432, Moscow Region, Russia
[4] Digital BioPharm Ltd, London EC1V 4PW, England
[5] Moscow Inst Phys & Technol, Dolgoprudnyi 141700, Moscow Region, Russia
关键词
big data; chemical space; chemoinformatics; data mining; visualization; NONLINEAR DIMENSIONALITY REDUCTION; SELF-ORGANIZING MAPS; DATA VISUALIZATION; DRUG DISCOVERY; MEDICINAL CHEMISTRY; COMPOUND LIBRARIES; CHES-MAPPER; BIG DATA; CLASSIFICATION; CHEMOGRAPHY;
D O I
10.1517/17460441.2015.1060216
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Introduction: The concept of 'chemical space' reveals itself in two forms: the discrete set of all possible molecules, and multi-dimensional descriptor space encompassing all the possible molecules. Approaches based on this concept are widely used for the analysis and enumeration of compound databases, library design, and structure-activity relationships (SAR) and landscape studies. Visual representations of chemical space differ in their applicability domains and features and require expert knowledge for choosing the right tool for a particular problem. Areas covered: In this review, the authors present recent advances in visualization of the chemical space in the framework of current general understanding of this topic. Attention is given to such methods as van Krevelen diagrams, descriptor plots, principal components analysis (PCA), self-organizing maps (SOM), generative topographic mapping (GTM), graph and network-based approaches. Notable application examples are provided. Expert opinion: With the growth of computational power, representations of large datasets are becoming more and more common instruments in the toolboxes of chemoinformaticians. Every scientist in the field can find the method of choice for a particular task. However, there is no universal reference representation of the chemical space currently available and expert knowledge is required.
引用
收藏
页码:959 / 973
页数:15
相关论文
共 111 条
[31]   Using the Golden Triangle to optimize clearance and oral absorption [J].
Johnson, Ted W. ;
Dress, Klaus R. ;
Edwards, Martin .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2009, 19 (19) :5560-5564
[32]   Predicting new molecular targets for known drugs [J].
Keiser, Michael J. ;
Setola, Vincent ;
Irwin, John J. ;
Laggner, Christian ;
Abbas, Atheir I. ;
Hufeisen, Sandra J. ;
Jensen, Niels H. ;
Kuijer, Michael B. ;
Matos, Roberto C. ;
Tran, Thuy B. ;
Whaley, Ryan ;
Glennon, Richard A. ;
Hert, Jerome ;
Thomas, Kelan L. H. ;
Edwards, Douglas D. ;
Shoichet, Brian K. ;
Roth, Bryan L. .
NATURE, 2009, 462 (7270) :175-U48
[33]   Graphical method for analysis of ultrahigh-resolution broadband mass spectra of natural organic matter, the van Krevelen diagram [J].
Kim, S ;
Kramer, RW ;
Hatcher, PG .
ANALYTICAL CHEMISTRY, 2003, 75 (20) :5336-5344
[34]   Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure-Activity Modeling and Dataset Comparison [J].
Kireeva, N. ;
Baskin, I. I. ;
Gaspar, H. A. ;
Horvath, D. ;
Marcou, G. ;
Varnek, A. .
MOLECULAR INFORMATICS, 2012, 31 (3-4) :301-312
[35]   Toward Navigating Chemical Space of Ionic Liquids: Prediction of Melting Points Using Generative Topographic Maps [J].
Kireeva, Natalia ;
Kuznetsov, Sergey L. ;
Tsivadze, Aslan Yu .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2012, 51 (44) :14337-14343
[36]   Nonlinear Dimensionality Reduction for Visualizing Toxicity Data: Distance- Based Versus Topology- Based Approaches [J].
Kireeva, Natalia V. ;
Ovchinnikova, Svetlana I. ;
Tetko, Igor V. ;
Asiri, Abdullah M. ;
Balakin, Konstantin V. ;
Tsivadze, Aslan Yu. .
CHEMMEDCHEM, 2014, 9 (05) :1047-1059
[37]   Visual Analysis of Biological Activity Data with Scaffold Hunter [J].
Klein, Karsten ;
Koch, Oliver ;
Kriege, Nils ;
Mutzel, Petra ;
Schaefer, Till .
MOLECULAR INFORMATICS, 2013, 32 (11-12) :964-975
[38]   SELF-ORGANIZED FORMATION OF TOPOLOGICALLY CORRECT FEATURE MAPS [J].
KOHONEN, T .
BIOLOGICAL CYBERNETICS, 1982, 43 (01) :59-69
[39]   Scaffold Diversity of Exemplified Medicinal Chemistry Space [J].
Langdon, Sarah R. ;
Brown, Nathan ;
Blagg, Julian .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (09) :2174-2185
[40]   Expanding the ChemGPS chemical space with natural products [J].
Larsson, J ;
Gottfries, J ;
Bohlin, L ;
Backlund, A .
JOURNAL OF NATURAL PRODUCTS, 2005, 68 (07) :985-991