OptDesign:: Extending optimizable k-dissimilarity selection to combinatorial library design

被引:11
作者
Clark, RD [1 ]
Kar, J [1 ]
Akella, L [1 ]
Soltanshahi, F [1 ]
机构
[1] Tripos Inc, St Louis, MO 63144 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 03期
关键词
D O I
10.1021/ci025662h
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Optimizable k-dissimilarity (OptiSim) selection entails drawing a series of subsamples of size k from a population and choosing the "best" candidate from each such subsample for inclusion in the selection set. By varying the size of the subsample, one can control the balance between representativeness and diversity in the selection set obtained. In the original formulation, a uniform random sampling from among valid candidates was used to draw the subsamples from a single target population. Here we describe in detail two key modifications that serve to extend the OptiSim methodology to vector selection for interdependent variables, specifically as applied to the design of combinatorial sublibraries. The first modification involves pivoting between variables: subsamples are drawn from each reagent pool in turn, with the viability of each candidate being evaluated in isolation as well as in terms of the products it will produce from complementary reagents already selected. The filters applied may be static or dynamic in nature, with molecular weight and hydrophobicity being examples of the former and structural diversity with respect to reagents already selected being an example of the latter. The second key modification is adding the ability to bias the selection of candidate reagents for inclusion in the subsamples. Taken together, these modifications support the efficient generation of multiblock and other sparse matrix designs that are both representative and diverse, and for which "backfilling" of designs edited to remove undesirable reagents or products is straightforward. The method is intrinsically fast and efficient, since enumeration of the full combinatorial is not required- only those candidates actually considered for inclusion need be evaluated. Moreover, because the subsample selection step is separate from the diversity-based selection of the "best" candidate, incorporating such bias in favor of a competing criterion such as low price provides a "natural," nonparametric mechanism for generating designs that are likely to be "good" in a double-objective, Pareto sense.
引用
收藏
页码:829 / 836
页数:8
相关论文
共 18 条
  • [1] Multiobjective optimization of combinatorial libraries
    Agrafiotis, DK
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2002, 16 (5-6) : 335 - 356
  • [2] Design and synthesis of a maximally diverse and druglike screening library using REM resin methodology
    Barn, D
    Caulfield, W
    Cowley, P
    Dickins, R
    Bakker, WI
    McGuire, R
    Morphy, JR
    Rankovic, Z
    Thorn, M
    [J]. JOURNAL OF COMBINATORIAL CHEMISTRY, 2001, 3 (06): : 534 - 541
  • [3] Combinatorial library design for diversity, cost efficiency, and drug-like character
    Brown, RD
    Hassan, M
    Waldman, M
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) : 427 - +
  • [4] Brown Robert D., 2001, P301
  • [5] Four association coefficients for relating molecular similarity measures
    Cheng, C
    Maggiora, G
    Lajiness, M
    Johnson, M
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (04): : 909 - 915
  • [6] OptiSim: An extended dissimilarity selection method for finding diverse representative subsets
    Clark, RD
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (06): : 1181 - 1188
  • [7] Visualizing substructural fingerprints
    Clark, RD
    Patterson, DE
    Soltanshahi, F
    Blake, JF
    Matthew, JB
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) : 404 - +
  • [8] Balancing representativeness against diversity using optimizable K-dissimilarity and hierarchical clustering
    Clark, RD
    Langton, WJ
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06): : 1079 - 1086
  • [9] Getting past diversity in assessing virtual library designs
    Clark, RD
    [J]. JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2002, 13 (06) : 788 - 794
  • [10] Clark Robert D., 2001, P337