Testing the predictive power of reverse screening to infer drug targets, with the help of machine learning

被引:11
作者
Daina, Antoine [1 ]
Zoete, Vincent [1 ,2 ]
机构
[1] SIB Swiss Inst Bioinformat, Mol Modeling Grp, CH-1015 Lausanne, Switzerland
[2] Univ Lausanne, Ludwig Inst Canc Res, Dept Oncol UNIL CHUV, Comp Aided Mol Engn,Lausanne Branch, Lausanne, Switzerland
关键词
SIMILARITY;
D O I
10.1038/s42004-024-01179-2
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Estimating protein targets of compounds based on the similarity principle-similar molecules are likely to show comparable bioactivity-is a long-standing strategy in drug research. Having previously quantified this principle, we present here a large-scale evaluation of its predictive power for inferring macromolecular targets by reverse screening an unprecedented vast external test set of more than 300,000 active small molecules against another bioactivity set of more than 500,000 compounds. We show that machine-learning can predict the correct targets, with the highest probability among 2069 proteins, for more than 51% of the external molecules. The strong enrichment thus obtained demonstrates its usefulness in supporting phenotypic screens, polypharmacology, or repurposing. Moreover, we quantified the impact of the bioactivity knowledge available for proteins in terms of number and diversity of actives. Finally, we advise that developers of such approaches follow an application-oriented benchmarking strategy and use large, high-quality, non-overlapping datasets as provided here. Ligand-based reverse screening plays an important role in predicting molecular targets of bioactive compounds, however, its predictive ability might be overlooked due to the limitations of existing external test sets. Here, the authors assess the predictive power of reverse screening with a large diverse external bioactivity dataset.
引用
收藏
页数:8
相关论文
共 44 条
  • [41] Prediction of physicochemical parameters by atomic contributions
    Wildman, SA
    Crippen, GM
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (05): : 868 - 873
  • [42] Current advances in ligand-based target prediction
    Yang, Su-Qing
    Ye, Qing
    Ding, Jun-Jie
    Ming-Zhu Yin
    Lu, Ai-Ping
    Chen, Xiang
    Hou, Ting-Jun
    Cao, Dong-Sheng
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2021, 11 (03)
  • [43] Ye Q., 2021, Lecture Notes in Computer Science, V12838, P87
  • [44] A simple statistical parameter for use in evaluation and validation of high throughput screening assays
    Zhang, JH
    Chung, TDY
    Oldenburg, KR
    [J]. JOURNAL OF BIOMOLECULAR SCREENING, 1999, 4 (02) : 67 - 73