Molecular similarity and diversity in chemoinformatics: From theory to applications

被引:180
作者
Maldonado, AG [1 ]
Doucet, JP [1 ]
Petitjean, M [1 ]
Fan, BT [1 ]
机构
[1] Univ Paris 07, ITODYS, CNRS, UMR 7086, F-75005 Paris, France
关键词
chemoinformatics; classification methods; clustering methods; combinatorial chemistry; compound selection; descriptors; drug design; high throughput screening; library design; molecular diversity; molecular similarity; partitioning; selection methods; similar property principle; validation methods;
D O I
10.1007/s11030-006-8697-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This review is dedicated to a survey on molecular similarity and diversity. Key findings reported in recent investigations are selectively highlighted and Summarized. Even if this overview is mainly centered inchemoinformatics, applications in other areas (pharmaceutical and medical chemistry, combinatorial chemistry, chemical databases management, etc.) are also introduced. The approaches used to define and descript the concepts of molecular similarity and diversity in the context of chemoinformatics are discussed in the first part of this review. We introduce, in the second and third parts, the descriptions and analyses of different methods and techniques. Finally, current applications and problems are enumerated and discussed in the last part.
引用
收藏
页码:39 / 79
页数:41
相关论文
共 326 条
  • [1] Combinatorial informatics in the post-genomics era
    Agrafiotis, DK
    Lobanov, VS
    Salemme, FR
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (05) : 337 - 346
  • [2] Feature selection for structure-activity correlation using binary particle swarms
    Agrafiotis, DK
    Cedeño, W
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (05) : 1098 - 1107
  • [3] On the use of neural network ensembles in QSAR and QSPR
    Agrafiotis, DK
    Cedeño, W
    Lobanov, VS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04): : 903 - 911
  • [4] On the use of information theory for assessing molecular diversity
    Agrafiotis, DK
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03): : 576 - 580
  • [5] Chirality codes and molecular structure
    Aires-De-Sousa, J
    Gasteiger, J
    Gutman, I
    Vidovic, DI
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03): : 831 - 836
  • [6] A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening
    Amendolia, SR
    Cossu, G
    Ganadu, ML
    Golosio, B
    Masala, GL
    Mura, GM
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 69 (1-2) : 13 - 20
  • [7] Applications of maximum likelihood principal component analysis: incomplete data sets and calibration transfer
    Andrews, DT
    Wentzell, PD
    [J]. ANALYTICA CHIMICA ACTA, 1997, 350 (03) : 341 - 352
  • [8] [Anonymous], TOPOLOGICAL INDICES
  • [9] SitePrint: Three-dimensional pharmacophore descriptors derived from protein binding sites for family based active site analysis, classification, and drug design
    Arnold, JR
    Burdick, KW
    Pegg, SCH
    Toba, S
    Lamb, ML
    Kuntz, ID
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06): : 2190 - 2198
  • [10] SYBYL line notation (SLN): A versatile language for chemical structure representation
    Ash, S
    Cline, MA
    Homer, RW
    Hurst, T
    Smith, GB
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01): : 71 - 79