Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery

被引:104
作者
Bosc, Nicolas [1 ]
Atkinson, Francis [1 ]
Felix, Eloy [1 ]
Gaulton, Anna [1 ]
Hersey, Anne [1 ]
Leach, Andrew R. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Chemogen Team, Wellcome Genome Campus, Cambridge CB10 1SD, England
基金
英国惠康基金; 欧盟地平线“2020”;
关键词
QSAR; Mondrian conformal prediction; ChEMBL; Classification models; Cheminformatics; APPLICABILITY DOMAIN; CLASSIFICATION; DATABASE; CHEMICALS; DESIGN;
D O I
10.1186/s13321-018-0325-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a real world application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.
引用
收藏
页数:16
相关论文
共 49 条
  • [21] Matched Molecular Pair Analysis in Drug Discovery: Methods and Recent Applications
    Yang, Ziyi
    Shi, Shaohua
    Fu, Li
    Lu, Aiping
    Hou, Tingjun
    Cao, Dongsheng
    JOURNAL OF MEDICINAL CHEMISTRY, 2023, 66 (07) : 4361 - 4377
  • [22] Chemogenomics in drug discovery: computational methods based on the comparison of binding sites
    Vulpetti, Anna
    Kalliokoski, Tuomo
    Milletti, Francesca
    FUTURE MEDICINAL CHEMISTRY, 2012, 4 (15) : 1971 - 1979
  • [23] Large-Scale Prediction of Beneficial Drug Combinations Using Drug Efficacy and Target Profiles
    Iwata, Hiroaki
    Sawada, Ryusuke
    Mizutani, Sayaka
    Kotera, Masaaki
    Yamanishi, Yoshihiro
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (12) : 2705 - 2716
  • [24] Network analysis and in silico prediction of protein-protein interactions with applications in drug discovery
    Murakami, Yoichi
    Tripathi, Lokesh P.
    Prathipati, Philip
    Mizuguchi, Kenji
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 44 : 134 - 142
  • [25] Virtual Screening Methods as Tools for Drug Lead Discovery from Large Chemical Libraries
    Ma, X. H.
    Zhu, F.
    Liu, X.
    Shi, Z.
    Zhang, J. X.
    Yang, S. Y.
    Wei, Y. Q.
    Chen, Y. Z.
    CURRENT MEDICINAL CHEMISTRY, 2012, 19 (32) : 5562 - 5571
  • [26] The QSAR Paradigm in Fragment-Based Drug Discovery: From the Virtual Generation of Target Inhibitors to Multi-Scale Modeling
    Kleandrova, Valeria V.
    Speck-Planche, Alejandro
    MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2020, 20 (14) : 1357 - 1374
  • [27] Deep Learning With Conformal Prediction for Hierarchical Analysis of Large-Scale Whole-Slide Tissue Images
    Wieslander, Hakan
    Harrison, Philip J.
    Skogberg, Gabriel
    Jackson, Sonya
    Friden, Markus
    Karlsson, Johan
    Spjuth, Ola
    Wahlby, Carolina
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (02) : 371 - 380
  • [28] Large-scale prediction and testing of drug activity on side-effect targets
    Lounkine, Eugen
    Keiser, Michael J.
    Whitebread, Steven
    Mikhailov, Dmitri
    Hamon, Jacques
    Jenkins, Jeremy L.
    Lavan, Paul
    Weber, Eckhard
    Doak, Allison K.
    Cote, Serge
    Shoichet, Brian K.
    Urban, Laszlo
    NATURE, 2012, 486 (7403) : 361 - +
  • [29] Predicting Drug-Drug Interactions Through Large-Scale Similarity-Based Link Prediction
    Fokoue, Achille
    Sadoghi, Mohammad
    Hassanzadeh, Oktie
    Zhang, Ping
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 774 - 789
  • [30] Large-scale prediction of drug-target interactions using protein sequences and drug topological structures
    Cao, Dong-Sheng
    Liu, Shao
    Xu, Qing-Song
    Lu, Hong-Mei
    Huang, Jian-Hua
    Hu, Qian-Nan
    Liang, Yi-Zeng
    ANALYTICA CHIMICA ACTA, 2012, 752 : 1 - 10