Application of Deep Neural Network Models in Drug Discovery Programs

被引:27
作者
Grebner, Christoph [1 ]
Matter, Hans [1 ]
Kofink, Daniel [2 ]
Wenzel, Jan [3 ]
Schmidt, Friedemann [3 ]
Hessler, Gerhard [1 ]
机构
[1] Sanofi Aventis Deutschland GmbH, Integrated Drug Discovery, R&D, Ind Pk Hochst, D-65926 Frankfurt, Germany
[2] Sanofi Aventis France SA, AI & Deep Analyt, Digital & Data Sci, R&D, 1 Ave Pierre Bross Tette, F-91380 Chilly Mazarin, France
[3] Sanofi Aventis Deutschland GmbH, Preclin Safety, R&D, Ind Pk Hochst, D-65926 Frankfurt, Germany
关键词
deep neural networks; property predictions; drug design; structure-activity relationships; graph convolutional networks; FACTOR-XA INHIBITORS; MACHINE LEARNING-METHODS; 2-CARBOXYINDOLE SCAFFOLD; IN-VITRO; POTENT; OPTIMIZATION; PREDICTION; UTILITY; DESIGN; CYTOCHROME-P450;
D O I
10.1002/cmdc.202100418
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In silico driven optimization of compound properties related to pharmacokinetics, pharmacodynamics, and safety is a key requirement in modern drug discovery. Nowadays, large and harmonized datasets allow to implement deep neural networks (DNNs) as a framework for leveraging predictive models. Nevertheless, various available model architectures differ in their global applicability and performance in lead optimization projects, such as stability over time and interpretability of the results. Here, we describe and compare the value of established DNN-based methods for the prediction of key ADME property trends and biological activity in an industrial drug discovery environment, represented by microsomal lability, CYP3A4 inhibition and factor Xa inhibition. Three architectures are exemplified, our earlier described multilayer perceptron approach (MLP), graph convolutional network-based models (GCN) and a vector representation approach, Mol2Vec. From a statistical perspective, MLP and GCN were found to perform superior over Mol2Vec, when applied to external validation sets. Interestingly, GCN-based predictions are most stable over a longer period in a time series validation study. Apart from those statistical observations, DNN prove of value to guide local SAR. To illustrate this important aspect in pharmaceutical research projects, we discuss challenging applications in medicinal chemistry towards a more realistic picture of artificial intelligence in drug discovery.
引用
收藏
页码:3772 / 3786
页数:15
相关论文
共 91 条
  • [1] [Anonymous], 2016, ARXIV161103199
  • [2] [Anonymous], IND TOOLK
  • [3] [Anonymous], 2020, SCHRODINGER RELEASE
  • [4] Identification of diverse database subsets using property-based and fragment-based molecular descriptions
    Ashton, M
    Barnard, J
    Casset, F
    Charlton, M
    Downs, G
    Gorse, D
    Holliday, J
    Lahana, R
    Willett, P
    [J]. QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 2002, 21 (06): : 598 - 604
  • [5] Baringhaus K.-H., 2014, CHEMOINFORMATICS DRU, P245
  • [6] A neural device for searching direct correlations between structures and properties of chemical compounds
    Baskin, II
    Palyulin, VA
    Zefirov, NS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (04): : 715 - 721
  • [7] Bauer A., 2005, Eur. Pat. Appl., Patent No. 1568698
  • [8] KNIME:: The Konstanz Information Miner
    Berthold, Michael R.
    Cebron, Nicolas
    Dill, Fabian
    Gabriel, Thomas R.
    Koetter, Tobias
    Meinl, Thorsten
    Ohl, Peter
    Sieb, Christoph
    Thiel, Kilian
    Wiswedel, Bernd
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 319 - 326
  • [9] ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS
    CARHART, RE
    SMITH, DH
    VENKATARAGHAVAN, R
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02): : 64 - 73
  • [10] The rise of deep learning in drug discovery
    Chen, Hongming
    Engkvist, Ola
    Wang, Yinhai
    Olivecrona, Marcus
    Blaschke, Thomas
    [J]. DRUG DISCOVERY TODAY, 2018, 23 (06) : 1241 - 1250