QSAR-derived affinity fingerprints (part 1): fingerprint construction and modeling performance for similarity searching, bioactivity classification and scaffold hopping

被引:28
作者
Skuta, C. [1 ]
Cortes-Ciriano, I [2 ]
Dehaen, W. [1 ,3 ]
Kriz, P. [4 ]
van Westen, G. J. P. [5 ]
Tetko, I., V [6 ,7 ]
Bender, A. [2 ]
Svozil, D. [1 ,3 ]
机构
[1] ASCR, Inst Mol Genet, CZ OPENSCREEN Natl Infrastruct Chem Biol, Vvi, Videnska 1083, Prague 14220 4, Czech Republic
[2] Univ Cambridge, Ctr Mol Informat, Dept Chem, Lensfield Rd, Cambridge CB2 1EW, England
[3] Univ Chem & Technol Prague, Fac Chem Technol, Dept Informat & Chem, CZ OPENSCREEN Natl Infrastruct Chem Biol, Tech 5, Prague 16628, Czech Republic
[4] Univ Chem & Technol Prague, Fac Chem Technol, Dept Math, Tech 5, Prague 16628, Czech Republic
[5] Leiden Univ, Drug Discovery & Safety, Computat Drug Discovery, LACDR, Einsteinweg 55, NL-2333 CC Leiden, Netherlands
[6] Helmholtz Zentrum Muenchen, German Res Ctr Environm Hlth GmbH, Ingolstaedter Landstr 1, D-85764 Neuherberg, Germany
[7] BIGCHEM GmbH, Ingolstaedter Landstr 1, D-85764 Neuherberg, Germany
关键词
Affinity fingerprint; Biological fingerprint; QSAR; Similarity searching; Bioactivity modeling; Scaffold hopping; MOLECULAR SIMILARITY; DRUG DISCOVERY; PREDICTION; VALIDATION; PROFILES; INHIBITORS; ACCURACY; DESIGN; SETS; APPLICABILITY;
D O I
10.1186/s13321-020-00443-6
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
An affinity fingerprint is the vector consisting of compound's affinity or potency against the reference panel of protein targets. Here, we present the QAFFP fingerprint, 440 elements long in silico QSAR-based affinity fingerprint, components of which are predicted by Random Forest regression models trained on bioactivity data from the ChEMBL database. Both real-valued (rv-QAFFP) and binary (b-QAFFP) versions of the QAFFP fingerprint were implemented and their performance in similarity searching, biological activity classification and scaffold hopping was assessed and compared to that of the 1024 bits long Morgan2 fingerprint (the RDKit implementation of the ECFP4 fingerprint). In both similarity searching and biological activity classification, the QAFFP fingerprint yields retrieval rates, measured by AUC (similar to 0.65 and similar to 0.70 for similarity searching depending on data sets, and similar to 0.85 for classification) and EF5 (similar to 4.67 and similar to 5.82 for similarity searching depending on data sets, and similar to 2.10 for classification), comparable to that of the Morgan2 fingerprint (similarity searching AUC of similar to 0.57 and similar to 0.66, and EF5 of similar to 4.09 and similar to 6.41, depending on data sets, classification AUC of similar to 0.87, and EF5 of similar to 2.16). However, the QAFFP fingerprint outperforms the Morgan2 fingerprint in scaffold hopping as it is able to retrieve 1146 out of existing 1749 scaffolds, while the Morgan2 fingerprint reveals only 864 scaffolds.
引用
收藏
页数:16
相关论文
共 115 条
  • [1] Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models
    Alexander, D. L. J.
    Tropsha, A.
    Winkler, David A.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (07) : 1316 - 1322
  • [2] Discriminating between drugs and nondrugs by prediction of activity spectra for substances (PASS)
    Anzali, S
    Barnickel, G
    Cezanne, B
    Krug, M
    Filimonov, D
    Poroikov, V
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (15) : 2432 - 2437
  • [3] Exploring Activity Cliffs from a Chemoinformatics Perspective
    Bajorath, Juergen
    [J]. MOLECULAR INFORMATICS, 2014, 33 (6-7) : 438 - 442
  • [4] Modeling of Compound Profiling Experiments Using Support Vector Machines
    Balfer, Jenny
    Heikamp, Kathrin
    Laufer, Stefan
    Bajorath, Juergen
    [J]. CHEMICAL BIOLOGY & DRUG DESIGN, 2014, 84 (01) : 75 - 85
  • [5] The properties of known drugs .1. Molecular frameworks
    Bemis, GW
    Murcko, MA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) : 2887 - 2893
  • [6] Molecular similarity: a key technique in molecular informatics
    Bender, A
    Glen, RC
    [J]. ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) : 3204 - 3218
  • [7] Bayes affinity fingerprints improve retrieval rates in virtual screening and define orthogonal bioactivity space: When are multitarget drugs a feasible concept?
    Bender, Andreas
    Jenkins, Jeremy L.
    Glick, Meir
    Deng, Zhan
    Nettles, James H.
    Davies, John W.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) : 2445 - 2456
  • [8] How Similar Are Similarity Searching Methods? A Principal Component Analysis of Molecular Descriptor Space
    Bender, Andreas
    Jenkins, Jeremy L.
    Scheiber, Josef
    Sukuru, Sai Chelan K.
    Glick, Meir
    Davies, John W.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (01) : 108 - 119
  • [9] The ChEMBL bioactivity database: an update
    Bento, A. Patricia
    Gaulton, Anna
    Hersey, Anne
    Bellis, Louisa J.
    Chambers, Jon
    Davies, Mark
    Krueger, Felix A.
    Light, Yvonne
    Mak, Lora
    McGlinchey, Shaun
    Nowotka, Michal
    Papadatos, George
    Santos, Rita
    Overington, John P.
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D1083 - D1090
  • [10] Target-related affinity profiling: Telik's lead discovery technology
    Beroza, P
    Damodaran, K
    Lum, RT
    [J]. CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2005, 5 (04) : 371 - 381