Large-Scale Systematic Analysis of 2D Fingerprint Methods and Parameters to Improve Virtual Screening Enrichments

被引:276
作者
Sastry, Madhavi [2 ]
Lowrie, Jeffrey F. [1 ]
Dixon, Steven L. [1 ]
Sherman, Woody [1 ]
机构
[1] Schrodinger, New York, NY 10036 USA
[2] Schrodinger, Hyderabad 500034, Andhra Pradesh, India
关键词
STRUCTURAL DESCRIPTORS; CHEMICAL STRUCTURES; COMPOUND SELECTION; SIMILARITY; SHAPE; DATABASES; DOCKING; PERFORMANCE; VALIDATION; GENERATION;
D O I
10.1021/ci100062n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A systematic virtual screening study on 11 pharmaceutically relevant targets has been conducted to investigate the interrelation between 8 two-dimensional (2D) fingerprinting methods, 13 atom-typing schemes, 13 bit scaling rules, and 12 similarity metrics using the new cheminformatics package Canvas. In total, 157 872 virtual screens were performed to assess the ability of each combination of parameters to identify actives in a database screen. In general, fingerprint methods, such as MOLPRINT2D, Radial, and Dendritic that encode information about local environment beyond simple linear paths outperformed other fingerprint methods. Atom-typing schemes with more specific information, such as Daylight, Mol2, and Carhart were generally superior to more generic atom-typing schemes. Enrichment factors across all targets were improved considerably with the best settings, although no single set of parameters performed optimally on all targets. The size of the addressable bit space or the fingerprints was also explored, and it was found to have a substantial impact on enrichments. Small bit spaces, such as 1024, resulted in many collisions and in a significant degradation in enrichments compared to larger bit spaces that avoid collisions.
引用
收藏
页码:771 / 784
页数:14
相关论文
共 53 条
  • [1] Can we learn to distinguish between "drug-like" and "nondrug-like" molecules?
    Ajay
    Walters, WP
    Murcko, MA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) : 3314 - 3324
  • [2] [Anonymous], 1984, MACCS 2
  • [3] [Anonymous], 2009, CANV VERS 1 2
  • [4] [Anonymous], 2009, ROCS VERS 3 0
  • [5] ATO SJ, 2000, PHARMACOPHORE PERCEP, P110
  • [6] Lossless compression of chemical fingerprints using integer entropy codes improves storage and retrieval
    Baldi, Pierre
    Benz, Ryan W.
    Hirschberg, Daniel S.
    Swamidass, S. Joshua
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (06) : 2098 - 2109
  • [7] Ballester PJ, 2007, J COMPUT CHEM, V28, P1711, DOI [10.1002/jcc.20681, 10.1002/JCC.20681]
  • [8] Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): Evaluation of performance
    Bender, A
    Mussa, HY
    Glen, RC
    Reiling, S
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (05): : 1708 - 1718
  • [9] Molecular similarity searching using atom environments, information-based feature selection, and a naive Bayesian classifier
    Bender, A
    Mussa, HY
    Glen, RC
    Reiling, S
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01): : 170 - 178
  • [10] Bohm H.J., 2000, VIRTUAL SCREENING BI