Mining Chemical Reactions Using Neighborhood Behavior and Condensed Graphs of Reactions Approaches

被引:21
作者
de Luca, Aurelie [1 ]
Horvath, Dragos [1 ]
Marcou, Gilles [1 ]
Solov'ev, Vitaly [1 ,2 ]
Varnek, Alexandre [1 ]
机构
[1] Univ Strasbourg, CNRS, Lab Infochim, UMR7177, F-67008 Strasbourg, France
[2] Russian Acad Sci, Inst Phys Chem & Electrochem, Moscow 119991, Russia
关键词
TRICENTRIC PHARMACOPHORE FINGERPRINTS; GENOME-SCALE CLASSIFICATION; SILICO STRUCTURAL SPACES; VITRO ACTIVITY SPACES; ORGANIC-REACTIONS; METABOLIC REACTIONS; SIMILARITY; FRAGMENT; DESCRIPTORS; PRESERVATION;
D O I
10.1021/ci300149n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This work addresses the problem of similarity search and classification of chemical reactions using Neighborhood Behavior (NB) and Condensed Graphs of Reaction (CGR) approaches. The CGR formalism represents chemical reactions as a classical molecular graph with dynamic bonds, enabling descriptor calculations on this graph. Different types of the ISIDA fragment descriptors generated for CGRs in combination with two metrics - Tanimoto and Euclidean - were considered as chemical spaces, to serve for reaction dissimilarity scoring. The NB method has been used to select an optimal combination of descriptors which distinguish different types of chemical reactions in a database containing 8544 reactions of 9 classes. Relevance of NB - analysis has been validated in generic (multiclass) similarity search and in clustering with Self-Organizing Maps (SOM). NB compliant sets of descriptors were shown to display enhanced mapping propensities, allowing the construction of better Self-Organizing Maps and similarity searches (NB and classical similarity search criteria - AUC ROC - correlate at a level of 0.7). The analysis of the SOM clusters proved chemically meaningful CGR substructures representing specific reaction signatures.
引用
收藏
页码:2325 / 2338
页数:14
相关论文
共 49 条
  • [1] [Anonymous], CONDENSED GRAPH REAC
  • [2] BENZOCYCLO-OCTENES .5. THERMAL REARRANGEMENTS OF BENZOCYCLO-OCTENES TO BENZO[A]CYCLOPROPA[C,D]PENTALENES
    BARTON, JW
    SHEPHERD, MK
    [J]. JOURNAL OF THE CHEMICAL SOCIETY-PERKIN TRANSACTIONS 1, 1986, (06): : 961 - 966
  • [3] QUANTIFYING THE NEIGHBORHOOD PRESERVATION OF SELF-ORGANIZING FEATURE MAPS
    BAUER, HU
    PAWELZIK, KR
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (04): : 570 - 579
  • [4] Neural maps and topographic vector quantization
    Bauer, HU
    Herrmann, M
    Villmann, T
    [J]. NEURAL NETWORKS, 1999, 12 (4-5) : 659 - 676
  • [5] Fuzzy tricentric pharmacophore fingerprints.: 2.: application of topological fuzzy pharmacophore triplets in quantitative structure-activity relationships
    Bonachera, Fanny
    Horvath, Dragos
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (02) : 409 - 425
  • [6] Fuzzy tricentric pharmacophore fingerprints.: 1.: Topological fuzzy pharmacophore triplets and adapted molecular similarity scoring schemes
    Bonachera, Fanny
    Parent, Benjamin
    Barbosa, Frederique
    Froloff, Nicolas
    Horvath, Dragos
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) : 2457 - 2477
  • [7] A new statistical approach to predicting aromatic hydroxylation sites. Comparison with model-based approaches
    Borodina, Y
    Rudik, A
    Filimonov, D
    Kharchevnikova, N
    Dmitriev, A
    Blinova, V
    Porolkov, V
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06): : 1998 - 2009
  • [8] Knowledge discovery in reaction databases: Landscaping organic reactions by a self-organizing neural network
    Chen, LR
    Gasteiger, J
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1997, 119 (17) : 4033 - 4042
  • [9] Daylight Chemical Information Systems I, FING SCREEN SIM
  • [10] One-dimensional molecular representations and similarity calculations: Methodology and validation
    Dixon, SL
    Merz, KM
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (23) : 3795 - 3809