A graph-based approach to construct target-focused libraries for virtual screening

被引:28
作者
Naderi, Misagh [1 ]
Alvin, Chris [2 ]
Ding, Yun [3 ]
Mukhopadhyay, Supratik [4 ]
Brylinski, Michal [1 ,5 ]
机构
[1] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
[2] Bradley Univ, Dept Comp Sci & Informat Syst, Peoria, IL 61625 USA
[3] Louisiana State Univ, Dept Phys & Astron, Baton Rouge, LA 70803 USA
[4] Louisiana State Univ, Dept Comp Sci & Engn, Baton Rouge, LA 70803 USA
[5] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
关键词
Molecular synthesis; Virtual screening; Target-focused libraries; eSynth; Chemical space; Targeted drug discovery; DRUG-LIKE MOLECULES; DE-NOVO DESIGN; CHEMICAL SPACE; COMPUTER-PROGRAM; LIGAND DOCKING; DISCOVERY; DATABASE; VALIDATION; STRATEGIES; POTENT;
D O I
10.1186/s13321-016-0126-6
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Due to exorbitant costs of high-throughput screening, many drug discovery projects commonly employ inexpensive virtual screening to support experimental efforts. However, the vast majority of compounds in widely used screening libraries, such as the ZINC database, will have a very low probability to exhibit the desired bioactivity for a given protein. Although combinatorial chemistry methods can be used to augment existing compound libraries with novel drug-like compounds, the broad chemical space is often too large to be explored. Consequently, the trend in library design has shifted to produce screening collections specifically tailored to modulate the function of a particular target or a protein family. Methods: Assuming that organic compounds are composed of sets of rigid fragments connected by flexible linkers, a molecule can be decomposed into its building blocks tracking their atomic connectivity. On this account, we developed eSynth, an exhaustive graph-based search algorithm to computationally synthesize new compounds by reconnecting these building blocks following their connectivity patterns. Results: We conducted a series of benchmarking calculations against the Directory of Useful Decoys, Enhanced database. First, in a self-benchmarking test, the correctness of the algorithm is validated with the objective to recover a molecule from its building blocks. Encouragingly, eSynth can efficiently rebuild more than 80 % of active molecules from their fragment components. Next, the capability to discover novel scaffolds is assessed in a cross-benchmarking test, where eSynth successfully reconstructed 40 % of the target molecules using fragments extracted from chemically distinct compounds. Despite an enormous chemical space to be explored, eSynth is computationally efficient; half of the molecules are rebuilt in less than a second, whereas 90 % take only about a minute to be generated. Conclusions: eSynth can successfully reconstruct chemically feasible molecules from molecular fragments. Furthermore, in a procedure mimicking the real application, where one expects to discover novel compounds based on a small set of already developed bioactives, eSynth is capable of generating diverse collections of molecules with the desired activity profiles. Thus, we are very optimistic that our effort will contribute to targeted drug discovery. eSynth is freely available to the academic community at www.brylinski.org/content/molecular-synthesis.
引用
收藏
页数:16
相关论文
共 59 条
[1]   THE SYNTHESIS OF COMMUNICATION PROTOCOLS [J].
AFRATI, F ;
PAPADIMITRIOU, CH ;
PAPAGEORGIOU, G .
ALGORITHMICA, 1988, 3 (03) :451-472
[2]   Kinase-targeted libraries: The design and synthesis of novel, potent, and selective kinase inhibitors [J].
Akritopoulou-Zanze, Irini ;
Hajduk, Philip J. .
DRUG DISCOVERY TODAY, 2009, 14 (5-6) :291-297
[3]   Similarity based virtual screening: A tool for targeted library design [J].
Alvesalo, JKO ;
Siiskonen, A ;
Vainio, MJ ;
Tammela, PSM ;
Vuorela, PM .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (07) :2353-2356
[4]  
Alvin C, 2014, AAAI CONF ARTIF INTE, P245
[5]  
[Anonymous], 1987, SMILES LINE NOTATION
[6]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[7]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[8]   SPACE/TIME TRADE/OFFS IN HASH CODING WITH ALLOWABLE ERRORS [J].
BLOOM, BH .
COMMUNICATIONS OF THE ACM, 1970, 13 (07) :422-&
[9]  
Bohacek R, 1999, IMA V MATH, V108, P103
[10]   THE COMPUTER-PROGRAM LUDI - A NEW METHOD FOR THE DENOVO DESIGN OF ENZYME-INHIBITORS [J].
BOHM, HJ .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1992, 6 (01) :61-78