Virtual Screening with Generative Topographic Maps: How Many Maps Are Required?

被引:23
作者
Casciuc, Iuri [1 ]
Zabolotna, Yuliana [1 ]
Horvath, Dragos [1 ]
Marcou, Gilles [1 ]
Bajorath, Juergen [2 ]
Varnek, Alexandre [1 ]
机构
[1] CNRS, Inst LeBel, Lab Chemoinformat, UMR 7140, 4 Rue B Pascal, F-67081 Strasbourg, France
[2] Univ Bonn, Unit Chem Biol & Med Chem, Limes, B IT, D-53115 Bonn, Germany
关键词
SPACE; FRAGMENT; ISIDA;
D O I
10.1021/acs.jcim.8b00650
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Universal generative topographic maps (GTMs) provide two-dimensional representations of chemical space selected for their "polypharmacological competence", that is, the ability to simultaneously represent meaningful activity and property landscapes, associated with many distinct targets and properties. Several such GTMs can be generated, each based on a different initial descriptor vector, encoding distinct structural features. While their average polypharmacological competence may indeed be equivalent, they nevertheless significantly diverge with respect to the quality of each property-specific landscape. In this work, we show that distinct universal maps represent complementary and strongly synergistic views of biologically relevant chemical space. Eight universal GTMs were employed as support for predictive classification landscapes, using more than 600 active/inactive ligand series associated with as many targets from the ChEMBL database (v.23). For nine of these targets, it was possible to extract, from the Directory of Useful Decoys (DUD), truly external sets featuring sufficient "actives" and "decoys" not present in the landscape-defining ChEMBL ligand sets. For each such molecule, projected on every class landscape of a particular universal map, a probability of activity was estimated, in analogy to a virtual screening (VS) experiment. Cross-validated (CV) balanced accuracy on landscape-defining ChEMBL data was unable to predict the success of that landscape in VS. Thus, the universal map with best CV results for a given property should not be prioritized as the implicitly best predictor. For a given map, predictions for many DUD compounds are not trustworthy, according to applicability domain considerations. By contrast, simultaneous application of all universal maps, and rating of the likelihood of activity as the mean returned by all applicable maps, significantly improved prediction results. Performance measures in consensus VS using multiple maps were always superior or similar to those of the best individual map.
引用
收藏
页码:564 / 572
页数:9
相关论文
共 22 条
  • [1] GTM: The generative topographic mapping
    Bishop, CM
    Svensen, M
    Williams, CKI
    [J]. NEURAL COMPUTATION, 1998, 10 (01) : 215 - 234
  • [2] ChemAxon Ltd:, 2012, CHEMAXON STAND C VER
  • [3] GTM-Based QSAR Models and Their Applicability Domains
    Gaspar, H. A.
    Baskin, I. I.
    Marcou, G.
    Horvath, D.
    Varnek, A.
    [J]. MOLECULAR INFORMATICS, 2015, 34 (6-7) : 348 - 356
  • [4] Generative Topographic Mapping-Based Classification Models and Their Applicability Domain: Application to the Biopharmaceutics Drug Disposition Classification System (BDDCS)
    Gaspar, Helena A.
    Marcou, Gilles
    Horvath, Dragos
    Arault, Alban
    Lozano, Sylvain
    Vayer, Philippe
    Varnek, Alexandre
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (12) : 3318 - 3325
  • [5] ChEMBL: a large-scale bioactivity database for drug discovery
    Gaulton, Anna
    Bellis, Louisa J.
    Bento, A. Patricia
    Chambers, Jon
    Davies, Mark
    Hersey, Anne
    Light, Yvonne
    McGlinchey, Shaun
    Michalovich, David
    Al-Lazikani, Bissan
    Overington, John P.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D1100 - D1107
  • [6] Visualization and Analysis of Complex Reaction Data: The Case of Tautomeric Equilibria
    Glavatskikh, Marta
    Madzhidov, Timur
    Baskin, Igor I.
    Horvath, Dragos
    Nugmanov, Ramil
    Gimadiev, Timur
    Marcou, Gilles
    Varnek, Alexandre
    [J]. MOLECULAR INFORMATICS, 2018, 37 (9-10)
  • [7] Beware of q2!
    Golbraikh, A
    Tropsha, A
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2002, 20 (04) : 269 - 276
  • [8] Benchmarking sets for molecular docking
    Huang, Niu
    Shoichet, Brian K.
    Irwin, John J.
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (23) : 6789 - 6801
  • [9] From bird's eye views to molecular communities: two-layered visualization of structure-activity relationships in large compound data sets
    Kayastha, Shilva
    Kunimoto, Ryo
    Horvath, Dragos
    Varnek, Alexandre
    Bajorath, Juergen
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (11) : 961 - 977
  • [10] Privileged Structural Motif Detection and Analysis Using Generative Topographic Maps
    Kayastha, Shilva
    Horvath, Dragos
    Gilberg, Erik
    Guetschow, Michael
    Bajorath, Juergen
    Varnek, Alexandre
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (05) : 1218 - 1232