Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery

被引:104
作者
Bosc, Nicolas [1 ]
Atkinson, Francis [1 ]
Felix, Eloy [1 ]
Gaulton, Anna [1 ]
Hersey, Anne [1 ]
Leach, Andrew R. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Chemogen Team, Wellcome Genome Campus, Cambridge CB10 1SD, England
基金
英国惠康基金; 欧盟地平线“2020”;
关键词
QSAR; Mondrian conformal prediction; ChEMBL; Classification models; Cheminformatics; APPLICABILITY DOMAIN; CLASSIFICATION; DATABASE; CHEMICALS; DESIGN;
D O I
10.1186/s13321-018-0325-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a real world application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.
引用
收藏
页数:16
相关论文
共 49 条
  • [31] Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
    Won, Sungho
    Choi, Hosik
    Park, Suyeon
    Lee, Juyoung
    Park, Changyi
    Kwon, Sunghoon
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [32] The Expanding Landscape of Graph Theoretic Molecular Descriptors: Development, Gradual Diversification of Descriptor Space, and Applications in QSAR/QMSA and New Drug Discovery
    Basak, Subhash C.
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2017, 13 (03) : 172 - 176
  • [33] Large-scale integrated super-computing platform for next generation virtual drug discovery
    Mitchell, Wayne
    Matsumoto, Shunji
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2011, 15 (04) : 553 - 559
  • [34] Adverse drug reaction or innocent bystander? A systematic comparison of statistical discovery methods for spontaneous reporting systems
    Dijkstra, Louis
    Garling, Marco
    Foraita, Ronja
    Pigeot, Iris
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 (04) : 396 - 403
  • [35] Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles
    Liu, Chengyou
    Hogan, Andrew M.
    Sturm, Hunter
    Khan, Mohd Wasif
    Islam, Md. Mohaiminul
    Rahman, A. S. M. Zisanur
    Davis, Rebecca
    Cardona, Silvia T.
    Hu, Pingzhao
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [36] DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
    Zheng, Yi
    Peng, Hui
    Zhang, Xiaocai
    Zhao, Zhixun
    Gao, Xiaoying
    Li, Jinyan
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [37] Network Distance Prediction for Enabling Service-Oriented Applications over Large-Scale Networks
    Huang, Haojun
    Yin, Hao
    Min, Geyong
    Wu, Dapeng Oliver
    Wu, Yulei
    Zuo, Tao
    Li, Ke
    IEEE COMMUNICATIONS MAGAZINE, 2015, 53 (08) : 166 - 174
  • [38] Towards agile large-scale predictive modelling in drug discovery with flow-based programming design principles
    Lampa, Samuel
    Alvarsson, Jonathan
    Spjuth, Ola
    JOURNAL OF CHEMINFORMATICS, 2016, 8
  • [39] Large-Scale Prediction of Drug Targets Based on Local and Global Consistency of Chemical-Chemical Networks
    Huang, Guohua
    Feng, Kaiyan
    Li, Xiaomei
    Peng, Yan
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2016, 19 (02) : 121 - 128
  • [40] The Comparison of Machine Learning Methods for Prediction Study of Type 2 Diabetes Mellitus's Drug Design
    Husna, Nadya Asanul
    Bustamam, Alhadi
    Yanuar, Arry
    Sarwinda, Devvi
    Hermansyah, Oky
    SYMPOSIUM ON BIOMATHEMATICS 2019 (SYMOMATH 2019), 2020, 2264