BOKEI: Bayesian optimization using knowledge of correlated torsions and expected improvement for conformer generation

被引:15
作者
Chan, Lucian [1 ]
Hutchison, Geoffrey R. [2 ]
Morris, Garrett M. [1 ]
机构
[1] Univ Oxford, Dept Stat, 24-29 St Giles, Oxford OX1 3LB, England
[2] Univ Pittsburgh, Dept Chem & Chem Engn, 219 Parkman Ave, Pittsburgh, PA 15260 USA
基金
英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
CONFORMATIONS; DATABASE; ACCESS;
D O I
10.1039/c9cp06688h
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A key challenge in conformer sampling is finding low-energy conformations with a small number of energy evaluations. We recently demonstrated the Bayesian Optimization Algorithm (BOA) is an effective method for finding the lowest energy conformation of a small molecule. Our approach balances between exploitation and exploration, and is more efficient than exhaustive or random search methods. Here, we extend strategies used on proteins and oligopeptides (e.g. Ramachandran plots of secondary structure) and study correlated torsions in small molecules. We use bivariate von Mises distributions to capture correlations, and use them to constrain the search space. We validate the performance of our new method, Bayesian Optimization with Knowledge-based Expected Improvement (BOKEI), on a dataset consisting of 533 diverse small molecules, using (i) a force field (MMFF94); and (ii) a semi-empirical method (GFN2), as the objective function. We compare the search performance of BOKEI, BOA with Expected Improvement (BOA-EI), and a genetic algorithm (GA), using a fixed number of energy evaluations. In more than 60% of the cases examined, BOKEI finds lower energy conformations than global optimization with BOA-EI or GA. More importantly, we find correlated torsions in up to 15% of small molecules in larger data sets, up to 8 times more often than previously reported. The BOKEI patterns not only describe steric clashes, but also reflect favorable intramolecular interactions such as hydrogen bonds and pi-pi stacking. Increasing our understanding of the conformational preferences of molecules will help improve our ability to find low energy conformers efficiently, which will have impact in a wide range of computational modeling applications.
引用
收藏
页码:5211 / 5219
页数:9
相关论文
共 43 条
[1]  
[Anonymous], 2008, Che. Cent. J., DOI DOI 10.1186/1752-153X-2-5
[2]  
[Anonymous], GPYOPT BAYESIAN OPTI
[3]  
[Anonymous], ABS10122599
[4]  
[Anonymous], 2019, ARXIV190602236
[5]  
[Anonymous], 2012, STAT BIOL HLTH
[6]   GFN2-xTB-An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions [J].
Bannwarth, Christoph ;
Ehlert, Sebastian ;
Grimme, Stefan .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (03) :1652-1671
[7]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[8]   A generative, probabilistic model of local protein structure [J].
Boomsma, Wouter ;
Mardia, Kanti V. ;
Taylor, Charles C. ;
Ferkinghoff-Borg, Jesper ;
Krogh, Anders ;
Hamelryck, Thomas .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (26) :8932-8937
[9]   Bayesian optimization for conformer generation [J].
Chan, Lucian ;
Hutchison, Geoffrey R. ;
Morris, Garrett M. .
JOURNAL OF CHEMINFORMATICS, 2019, 11
[10]   Knowledge-Based Conformer Generation Using the Cambridge Structural Database [J].
Cole, Jason C. ;
Korb, Oliver ;
McCabe, Patrick ;
Read, Murray G. ;
Taylor, Robin .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (03) :615-629