Thompson Sampling-An Efficient Method for Searching Ultralarge Synthesis on Demand Databases

被引:20
作者
Klarich, Kathryn [1 ]
Goldman, Brian [2 ]
Kramer, Trevor [2 ]
Riley, Patrick [2 ]
Walters, W. Patrick [2 ]
机构
[1] ReNAgade Therapeut, Cambridge, MA 02139 USA
[2] Relay Therapeut, Cambridge, MA 02141 USA
关键词
SIMILARITY;
D O I
10.1021/acs.jcim.3c01790
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Over the last five years, virtual screening of ultralarge synthesis on-demand libraries has emerged as a powerful tool for hit identification in drug discovery programs. As these libraries have grown to tens of billions of molecules, we have reached a point where it is no longer cost-effective to screen every molecule virtually. To address these challenges, several groups have developed heuristic search methods to rapidly identify the best molecules on a virtual screen. This article describes the application of Thompson sampling (TS), an active learning approach that streamlines the virtual screening of large combinatorial libraries by performing a probabilistic search in the reagent space, thereby never requiring the full enumeration of the library. TS is a general technique that can be applied to various virtual screening modalities, including 2D and 3D similarity search, docking, and application of machine-learning models. In an illustrative example, we show that TS can identify more than half of the top 100 molecules from a docking-based virtual screen of 335 million molecules by evaluating 1% of the data set.
引用
收藏
页码:1158 / 1171
页数:14
相关论文
共 31 条
[21]   POLING - PROMOTING CONFORMATIONAL VARIATION [J].
SMELLIE, A ;
TEIG, SL ;
TOWBIN, P .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1995, 16 (02) :171-187
[22]   Similarity searching [J].
Stumpfe, Dagmar ;
Bajorath, Juergen .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2011, 1 (02) :260-282
[23]   Optimizing active learning for free energy calculations [J].
Thompson, James ;
Walters, Patrick ;
Feng, Jianwen ;
Pabon, Nicolas ;
Xu, Hongcheng ;
Maser, Michael ;
Goldman, Brian B. ;
Moustakas, Demetri ;
Schmidt, Molly ;
York, Forrest .
ARTIFICIAL INTELLIGENCE IN THE LIFE SCIENCES, 2022, 2
[24]   On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. [J].
Thompson, WR .
BIOMETRIKA, 1933, 25 :285-294
[25]   Applications of machine learning in drug discovery and development [J].
Vamathevan, Jessica ;
Clark, Dominic ;
Czodrowski, Paul ;
Dunham, Ian ;
Ferran, Edgardo ;
Lee, George ;
Li, Bin ;
Madabhushi, Anant ;
Shah, Parantu ;
Spitzer, Michaela ;
Zhao, Shanrong .
NATURE REVIEWS DRUG DISCOVERY, 2019, 18 (06) :463-477
[26]   Virtual Compound Libraries in Computer-Assisted Drug Discovery [J].
van Hilten, Niek ;
Chevillard, Florent ;
Kolb, Peter .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (02) :644-651
[27]   Virtual Chemical Libraries Miniperspective [J].
Walters, W. Patrick .
JOURNAL OF MEDICINAL CHEMISTRY, 2019, 62 (03) :1116-1124
[28]   Virtual screening - an overview [J].
Walters, WP ;
Stahl, MT ;
Murcko, MA .
DRUG DISCOVERY TODAY, 1998, 3 (04) :160-178
[29]   Exploration of Ultralarge Compound Collections for Drug Discovery [J].
Warr, Wendy A. ;
Nicklaus, Marc C. ;
Nicolaou, Christos A. ;
Rarey, Matthias .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) :2021-2034
[30]   Efficient Exploration of Chemical Space with Docking and Deep Learning [J].
Yang, Ying ;
Yao, Kun ;
Repasky, Matthew P. ;
Leswing, Karl ;
Abel, Robert ;
Shoichet, Brian K. ;
Jerome, Steven, V .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2021, 17 (11) :7106-7119