Hole filling and library optimization: Application to commercially available fragment libraries

被引:9
作者
An, Yuling [1 ]
Sherman, Woody [1 ]
Dixon, Steven L. [1 ]
机构
[1] Schrodinger Inc, New York, NY 10036 USA
关键词
Cheminformatics; Chemical fingerprint; Compound library optimization; Hole filling; 2D FINGERPRINT METHODS; DRUG; DIVERSITY; SELECTION; DATABASE;
D O I
10.1016/j.bmc.2012.03.037
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Compound libraries comprise an integral component of drug discovery in the pharmaceutical and biotechnology industries. While in-house libraries often contain millions of molecules, this number pales in comparison to the accessible space of drug-like molecules. Therefore, care must be taken when adding new compounds to an existing library in order to ensure that unexplored regions in the chemical space are filled efficiently while not needlessly increasing the library size. In this work, we present an automated method to fill holes in an existing library using compounds from an external source and apply it to commercially available fragment libraries. The method, called Canvas HF, uses distances computed from 2D chemical fingerprints and selects compounds that fill vacuous regions while not suffering from the problem of selecting only compounds at the edge of the chemical space. We show that the method is robust with respect to different databases and the number of requested compounds to retrieve. We also present an extension of the method where chemical properties can be considered simultaneously with the selection process to bias the compounds toward a desired property space without imposing hard property cutoffs. We compare the results of Canvas HF to those obtained with a standard sphere exclusion method and with random compound selection and find that Canvas HF performs favorably. Overall, the method presented here offers an efficient and effective hole-filling strategy to augment compound libraries with compounds from external sources. The method does not have any fit parameters and therefore it should be applicable in most hole-filling applications. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5379 / 5387
页数:9
相关论文
共 29 条
  • [1] Multiobjective optimization of combinatorial libraries
    Agrafiotis, DK
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2001, 45 (3-4) : 545 - 566
  • [2] [Anonymous], 1985, COMPSTAT LECT
  • [3] [Anonymous], 1997, SPRINGER SERIES INFO
  • [4] Integration of virtual and high-throughput screening
    Bajorath, F
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) : 882 - 894
  • [5] The properties of known drugs .1. Molecular frameworks
    Bemis, GW
    Murcko, MA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) : 2887 - 2893
  • [6] How Similar Are Similarity Searching Methods? A Principal Component Analysis of Molecular Descriptor Space
    Bender, Andreas
    Jenkins, Jeremy L.
    Scheiber, Josef
    Sukuru, Sai Chelan K.
    Glick, Meir
    Davies, John W.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (01) : 108 - 119
  • [7] Breinbauer R, 2002, ANGEW CHEM INT EDIT, V41, P2879
  • [8] Broach JR, 1996, NATURE, V384, P14
  • [9] OptiSim: An extended dissimilarity selection method for finding diverse representative subsets
    Clark, RD
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (06): : 1181 - 1188
  • [10] Bioactive diversity and screening library selection via affinity fingerprinting
    Dixon, SL
    Villar, HO
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06): : 1192 - 1203