Mining Chemical Reactions Using Neighborhood Behavior and Condensed Graphs of Reactions Approaches

被引:21
作者
de Luca, Aurelie [1 ]
Horvath, Dragos [1 ]
Marcou, Gilles [1 ]
Solov'ev, Vitaly [1 ,2 ]
Varnek, Alexandre [1 ]
机构
[1] Univ Strasbourg, CNRS, Lab Infochim, UMR7177, F-67008 Strasbourg, France
[2] Russian Acad Sci, Inst Phys Chem & Electrochem, Moscow 119991, Russia
关键词
TRICENTRIC PHARMACOPHORE FINGERPRINTS; GENOME-SCALE CLASSIFICATION; SILICO STRUCTURAL SPACES; VITRO ACTIVITY SPACES; ORGANIC-REACTIONS; METABOLIC REACTIONS; SIMILARITY; FRAGMENT; DESCRIPTORS; PRESERVATION;
D O I
10.1021/ci300149n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This work addresses the problem of similarity search and classification of chemical reactions using Neighborhood Behavior (NB) and Condensed Graphs of Reaction (CGR) approaches. The CGR formalism represents chemical reactions as a classical molecular graph with dynamic bonds, enabling descriptor calculations on this graph. Different types of the ISIDA fragment descriptors generated for CGRs in combination with two metrics - Tanimoto and Euclidean - were considered as chemical spaces, to serve for reaction dissimilarity scoring. The NB method has been used to select an optimal combination of descriptors which distinguish different types of chemical reactions in a database containing 8544 reactions of 9 classes. Relevance of NB - analysis has been validated in generic (multiclass) similarity search and in clustering with Self-Organizing Maps (SOM). NB compliant sets of descriptors were shown to display enhanced mapping propensities, allowing the construction of better Self-Organizing Maps and similarity searches (NB and classical similarity search criteria - AUC ROC - correlate at a level of 0.7). The analysis of the SOM clusters proved chemically meaningful CGR substructures representing specific reaction signatures.
引用
收藏
页码:2325 / 2338
页数:14
相关论文
共 49 条
  • [21] InfoClunie, ISIDA QSPR PROGR
  • [22] KASKI S, 1996, P INT C ART NEUR NET, P809
  • [23] Quantitative structure-property relationship modeling of β-cyclodextrin complexation free energies
    Katritzky, AR
    Fara, DC
    Yang, HF
    Karelson, M
    Suzuki, T
    Solov'ev, VP
    Varnek, A
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (02): : 529 - 541
  • [24] Kirt T., 2007, P 11 E EUR C ADV DAT, P107
  • [25] Kohonen T., SOM PAK SELF ORG MAP
  • [26] Genome-scale classification of metabolic reactions: A chemoinformatics approach
    Latino, DARS
    Aires-de-Sousa, J
    [J]. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2006, 45 (13) : 2066 - 2069
  • [27] Genome-scale classification of metabolic reactions and assignment of EC numbers with self-organizing maps
    Latino, Diogo A. R. S.
    Zhang, Qing-You
    Aires-de-Sousa, Joao
    [J]. BIOINFORMATICS, 2008, 24 (19) : 2236 - 2244
  • [28] Analysis of Neighborhood Behavior in Lead Optimization and Array Design
    Papadatos, George
    Cooper, Anthony W. J.
    Kadirkamanathan, Visakan
    Macdonald, Simon J. F.
    McLay, Iain M.
    Pickett, Stephen D.
    Pritchard, John M.
    Willett, Peter
    Gillet, Valerie J.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (02) : 195 - 208
  • [29] Neighborhood behavior: A useful concept for validation of ''molecular diversity'' descriptors
    Patterson, DE
    Cramer, RD
    Ferguson, AM
    Clark, RD
    Weinberger, LE
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (16) : 3049 - 3059
  • [30] POLANI D, 2001, SPRINGER STUDIES FUZ, P13