Freely Available Conformer Generation Methods: How Good Are They?

被引:155
作者
Ebejer, Jean-Paul [1 ,2 ]
Morris, Garrett M. [2 ]
Deane, Charlotte M. [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford Prot Informat Grp, Oxford OX1 3TG, England
[2] InhibOx Ltd, Oxford Ctr Innovat, Oxford OX1 1BY, England
关键词
CONFORMATIONAL-ANALYSIS; MOLECULAR-MECHANICS; FORCE-FIELD; DATABASE; PERFORMANCE; VALIDATION; ALGORITHM; CATALYST; DIVERSE; OMEGA;
D O I
10.1021/ci2004658
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Conformer generation has important implications in cheminformatics, particularly in computational drug discovery where the quality of conformer generation software may affect the outcome of a virtual screening exercise. We examine the performance of four freely available small molecule conformer generation tools (BALLOON, CONFAB, FROG2, and RDKIT) alongside a commercial tool (MOE). The aim of this study is 3-fold: (i) to identify which tools most accurately reproduce experimentally determined structures; (ii) to examine the diversity of the generated conformational set; and (iii) to benchmark the computational time expended. These aspects were tested using a set of 708 drug-like molecules assembled from the OMEGA validation set and the Astex Diverse Set. These molecules have varying physicochemical properties and at least one known X-ray crystal structure. We found that RDKIT and CONFAB are statistically better than other methods at generating low rmsd conformers to the known structure. RDKIT is particularly suited for less flexible molecules while CONFAB, with its systematic approach, is able to generate conformers which are geometrically closer to the experimentally determined structure for molecules with a large number of rotatable bonds (>= 10). In our tests RDKIT also resulted as the second fastest method after FROG2. In order to enhance the performance of RDKIT, we developed a postprocessing algorithm to build a diverse and representative set of conformers which also contains a close conformer to the known structure. Our analysis indicates that, with postprocessing, RDKIT is a valid free alternative to commercial, proprietary software.
引用
收藏
页码:1146 / 1158
页数:13
相关论文
共 54 条
  • [11] Property distributions: Differences between drugs, natural products, and molecules from combinatorial chemistry
    Feher, M
    Schmidt, JM
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (01): : 218 - 227
  • [12] Comparison of knowledge-based and distance geometry approaches for generation of molecular conformations
    Feuston, BP
    Miller, MD
    Culberson, JC
    Nachbar, RB
    Kearsley, SK
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (03): : 754 - 763
  • [13] The CoFactor database: organic cofactors in enzyme catalysis
    Fischer, Julia D.
    Holliday, Gemma L.
    Thornton, Janet M.
    [J]. BIOINFORMATICS, 2010, 26 (19) : 2496 - 2497
  • [14] Gu J., 2009, STRUCTURAL BIOINFORM, P639
  • [15] Blue Obelisk - Interoperability in chemical informatics
    Guha, Rajarshi
    Howard, Michael T.
    Hutchison, Geoffrey R.
    Murray-Rust, Peter
    Rzepa, Henry
    Steinbeck, Christoph
    Wegner, Jorg
    Willighagen, Egon L.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) : 991 - 998
  • [16] Three-dimensional shape-based searching of conformationally flexible compounds
    Hahn, M
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01): : 80 - 86
  • [17] Halgren TA, 1996, J COMPUT CHEM, V17, P490, DOI [10.1002/(SICI)1096-987X(199604)17:5/6<490::AID-JCC1>3.0.CO
  • [18] 2-P, 10.1002/(SICI)1096-987X(199604)17:5/6<616::AID-JCC5>3.0.CO
  • [19] 2-X]
  • [20] Diverse, high-quality test set for the validation of protein-ligand docking performance
    Hartshorn, Michael J.
    Verdonk, Marcel L.
    Chessari, Gianni
    Brewerton, Suzanne C.
    Mooij, Wijnand T. M.
    Mortenson, Paul N.
    Murray, Christopher W.
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (04) : 726 - 741