Similarity metrics and descriptor spaces - Which combinations to choose?

被引:42
作者
Glen, Robert C. [1 ]
Adams, Samuel E. [1 ]
机构
[1] Univ Cambridge, Dept Chem, Unilever Ctr Mol Sci Informat, Cambridge CB2 1EW, England
来源
QSAR & COMBINATORIAL SCIENCE | 2006年 / 25卷 / 12期
关键词
descriptors; diversity; high-throughput; in-silico; machine learning; metrics; QSAR; screening; similarity; structure activity; virtual library;
D O I
10.1002/qsar.200610097
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Molecular similarity is widely used in virtual screening. There are a very large number of possible combinations of molecular descriptors and analysis methods that can be combined to pre-select compounds. The objectives strongly influence the methods chosen, in particular whether the desired outcome is to design a diverse library for initial screening; to follow up with additional similar hits (to perhaps help in establishing SAR) or to discover novel scaffolds (lead hopping) with the objective of obtaining novel patentable series (perhaps with different pharmacokinetics). Some of the factors that influence these decisions are discussed along with applications that compare and contrast methods and their performance in different situations.
引用
收藏
页码:1133 / 1142
页数:10
相关论文
共 122 条
  • [1] COMPARISON OF PERFORMANCE OF SOME SIMILARITY AND DISSIMILARITY MEASURES IN AUTOMATIC CLASSIFICATION OF CHEMICAL STRUCTURES
    ADAMSON, GW
    BUSH, JA
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1975, 15 (01): : 55 - 58
  • [2] Structure-activity relationships of bifunctional peptides based on overlapping pharmacophores at opioid and cholecystokinin receptors
    Agnes, Richard S.
    Lee, Yeon Sun
    Davis, Peg
    Ma, Shou-Wu
    Badghisi, Hamid
    Porreca, Frank
    Lai, Josephine
    Hruby, Victor J.
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (10) : 2868 - 2875
  • [3] Similarity based virtual screening: A tool for targeted library design
    Alvesalo, JKO
    Siiskonen, A
    Vainio, MJ
    Tammela, PSM
    Vuorela, PM
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (07) : 2353 - 2356
  • [4] [Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
  • [5] Design of compound libraries based on natural product scaffolds and protein structure similarity clustering (PSSC)
    Balamurugan, R
    Dekker, FJ
    Waldmann, H
    [J]. MOLECULAR BIOSYSTEMS, 2005, 1 (01) : 36 - 45
  • [6] Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): Evaluation of performance
    Bender, A
    Mussa, HY
    Glen, RC
    Reiling, S
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (05): : 1708 - 1718
  • [7] Molecular similarity: a key technique in molecular informatics
    Bender, A
    Glen, RC
    [J]. ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) : 3204 - 3218
  • [8] Bender A, 2006, E SCHERING RES FDN W, V58, P47
  • [9] Bender A, 2005, LECT NOTES COMPUT SC, V3695, P175
  • [10] Discussion of measures of enrichment in virtual screening: Comparing the information content of descriptors with increasing levels of sophistication
    Bender, A
    Glen, RC
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (05) : 1369 - 1375