The Development of Target-Specific Machine Learning Models as Scoring Functions for Docking-Based Target Prediction

被引:39
|
作者
Nogueira, Mauro S. [1 ]
Koch, Oliver [1 ,2 ]
机构
[1] TU Dortmund Univ, Fac Chem & Chem Biol, Otto Hahn Str 6, D-44227 Dortmund, Germany
[2] Westfalische Wilhelms Univ Munster, Inst Pharmaceut & Med Chem, Corrensstr 48, D-48149 Munster, Germany
关键词
WEB SERVER; MACROMOLECULAR TARGETS; INTERACTION FINGERPRINT; DRUG DISCOVERY; CLASSIFICATION; IDENTIFICATION; PHARMACOLOGY; MECHANISMS; MOLECULES; UPDATE;
D O I
10.1021/acs.jcim.8b00773
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The identification of possible targets for a known bioactive compound is of the utmost importance for drug design and development. Molecular docking is one possible approach for in-silico protein target prediction, whereas a molecule is docked into several different protein structures to identify potential targets. This reverse docking approach is hampered by the limitation of current scoring functions to correctly discriminate between targets and nontargets. In this work, a development of target-specific scoring functions is described that showed improved prediction performances for the correct target prediction of both actives and decoys on three validation data sets. In contrast to pure ligand-based approaches, that are in general faster and include a greater target space, docking-based approaches can cover also unknown chemical space that lies outside the known bioactivity data. These target-specific scoring functions are based on known bioactivity data retrieved from ChEMBL and supervised machine learning approaches. Neural Networks and Support Vector Machines (SVMs) models were trained for 20 different protein targets. Our protein-ligand interaction fingerprint PADIF (Protein Atom Score Contributions Derived Interaction Fingerprint) represents the input for training, whereas the PADIFs are calculated based on docking poses of active and inactive compounds. Different data sets of previously unseen molecules were used for the final evaluation and analysis of the prediction performance of the created models. For a single-target selectivity data set, the correct target model returns in most of the cases the highest probabilities scores for their active molecules and with statistically significant differences from the other targets. These probability scores were also predicted and successfully used to rank the targets for molecules of a multitarget data set with activity data described simultaneously for two, three, and four to seven protein targets.
引用
收藏
页码:1238 / 1252
页数:15
相关论文
共 50 条
  • [31] ReMODE: a deep learning-based web server for target-specific drug design
    Wang, Mingyang
    Wang, Jike
    Weng, Gaoqi
    Kang, Yu
    Pan, Peichen
    Li, Dan
    Deng, Yafeng
    Li, Honglin
    Hsieh, Chang-Yu
    Hou, Tingjun
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [32] ReMODE: a deep learning-based web server for target-specific drug design
    Mingyang Wang
    Jike Wang
    Gaoqi Weng
    Yu Kang
    Peichen Pan
    Dan Li
    Yafeng Deng
    Honglin Li
    Chang-Yu Hsieh
    Tingjun Hou
    Journal of Cheminformatics, 14
  • [33] Knowledge-Based Scoring Functions in Drug Design. 1. Developing a Target-Specific Method for Kinase-Ligand Interactions
    Xue, Mengzhu
    Zheng, Mingyue
    Xiong, Bing
    Li, Yanlian
    Jiang, Hualiang
    Shen, Jingkang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (08) : 1378 - 1386
  • [34] Misuse of large language models: Exploiting weaknesses for target-specific outputs
    Klinkhammer, Dennis
    ZEITSCHRIFT FUER TECHNIKFOLGENABSCHAETZUNG IN THEORIE UND PRAXIS - TATUP, 2024, 33 (02):
  • [35] Influence of Data Similarity on the Scoring Power of Machine-learning Scoring Functions for Docking
    Sze, Kam-Heung
    Xiong, Zhiqiang
    Ma, Jinlong
    Lu, Gang
    Chan, Wai-Yee
    Li, Hongjian
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 85 - 92
  • [36] A machine learning approach for miRNA target prediction
    Liu, Hui
    Yue, Dong
    Zhang, Lin
    Gao, Shou-Jiang
    Huang, Yufei
    2008 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2008, : 6 - +
  • [37] Meta-VOS: Learning to Adapt Online Target-Specific Segmentation
    Xu, Chunyan
    Wei, Li
    Cui, Zhen
    Zhang, Tong
    Yang, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4760 - 4772
  • [38] Machine Learning Models for Mycobacterium tuberculosis In Vitro Activity: Prediction and Target Visualization
    Lane, Thomas R.
    Urbina, Fabio
    Rank, Laura
    Gerlach, Jacob
    Riabova, Olga
    Lepioshkin, Alexander
    Kazakova, Elena
    Vocat, Anthony
    Tkachenko, Valery
    Cole, Stewart
    Makarov, Vadim
    Ekins, Sean
    MOLECULAR PHARMACEUTICS, 2022, 19 (02) : 674 - 689
  • [39] ESTRADIOL RECEPTOR FUNCTIONS OF SOLUBLE-PROTEINS FROM TARGET-SPECIFIC LYSOSOMES
    HIRSCH, PC
    SZEGO, CM
    JOURNAL OF STEROID BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1974, 5 (06): : 533 - 542
  • [40] Machine Learning-Based Scoring Functions, Development and Applications with SAnDReS
    Bitencourt-Ferreira, Gabriela
    Rizzotto, Camila
    de Azevedo Junior, Walter Filgueira
    CURRENT MEDICINAL CHEMISTRY, 2021, 28 (09) : 1746 - 1756