Molecular similarity and diversity in chemoinformatics: From theory to applications

被引:182
作者
Maldonado, AG [1 ]
Doucet, JP [1 ]
Petitjean, M [1 ]
Fan, BT [1 ]
机构
[1] Univ Paris 07, ITODYS, CNRS, UMR 7086, F-75005 Paris, France
关键词
chemoinformatics; classification methods; clustering methods; combinatorial chemistry; compound selection; descriptors; drug design; high throughput screening; library design; molecular diversity; molecular similarity; partitioning; selection methods; similar property principle; validation methods;
D O I
10.1007/s11030-006-8697-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This review is dedicated to a survey on molecular similarity and diversity. Key findings reported in recent investigations are selectively highlighted and Summarized. Even if this overview is mainly centered inchemoinformatics, applications in other areas (pharmaceutical and medical chemistry, combinatorial chemistry, chemical databases management, etc.) are also introduced. The approaches used to define and descript the concepts of molecular similarity and diversity in the context of chemoinformatics are discussed in the first part of this review. We introduce, in the second and third parts, the descriptions and analyses of different methods and techniques. Finally, current applications and problems are enumerated and discussed in the last part.
引用
收藏
页码:39 / 79
页数:41
相关论文
共 326 条
[1]   Combinatorial informatics in the post-genomics era [J].
Agrafiotis, DK ;
Lobanov, VS ;
Salemme, FR .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (05) :337-346
[2]   Feature selection for structure-activity correlation using binary particle swarms [J].
Agrafiotis, DK ;
Cedeño, W .
JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (05) :1098-1107
[3]   On the use of neural network ensembles in QSAR and QSPR [J].
Agrafiotis, DK ;
Cedeño, W ;
Lobanov, VS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04) :903-911
[4]   On the use of information theory for assessing molecular diversity [J].
Agrafiotis, DK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03) :576-580
[5]   Chirality codes and molecular structure [J].
Aires-De-Sousa, J ;
Gasteiger, J ;
Gutman, I ;
Vidovic, DI .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :831-836
[6]   A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening [J].
Amendolia, SR ;
Cossu, G ;
Ganadu, ML ;
Golosio, B ;
Masala, GL ;
Mura, GM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 69 (1-2) :13-20
[7]   Applications of maximum likelihood principal component analysis: incomplete data sets and calibration transfer [J].
Andrews, DT ;
Wentzell, PD .
ANALYTICA CHIMICA ACTA, 1997, 350 (03) :341-352
[8]  
[Anonymous], TOPOLOGICAL INDICES
[9]   SitePrint: Three-dimensional pharmacophore descriptors derived from protein binding sites for family based active site analysis, classification, and drug design [J].
Arnold, JR ;
Burdick, KW ;
Pegg, SCH ;
Toba, S ;
Lamb, ML ;
Kuntz, ID .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :2190-2198
[10]   SYBYL line notation (SLN): A versatile language for chemical structure representation [J].
Ash, S ;
Cline, MA ;
Homer, RW ;
Hurst, T ;
Smith, GB .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :71-79