Prediction of activity cliffs on the basis of images using convolutional neural networks

被引:10
作者
Iqbal, Javed [1 ]
Vogt, Martin [1 ]
Bajorath, Juergen [1 ]
机构
[1] Rheinische Friedrich Wilhelms Univ, Dept Life Sci Informat, Program Unit Chem Biol & Med Chem, B IT,LIMES, Friedrich Hirzebruch Allee 6, D-53115 Bonn, Germany
关键词
Activity cliffs; Matched molecular pairs; Image analysis; Convolutional neural networks; Convolutional feature visualization;
D O I
10.1007/s10822-021-00380-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An activity cliff (AC) is formed by a pair of structurally similar compounds with a large difference in potency. Accordingly, ACs reveal structure-activity relationship (SAR) discontinuity and provide SAR information for compound optimization. Herein, we have investigated the question if ACs could be predicted from image data. Therefore, pairs of structural analogs were extracted from different compound activity classes that formed or did not form ACs. From these compound pairs, consistently formatted images were generated. Image sets were used to train and test convolutional neural network (CNN) models to systematically distinguish between ACs and non-ACs. The CNN models were found to predict ACs with overall high accuracy, as assessed using alternative performance measures, hence establishing proof-of-principle. Moreover, gradient weights from convolutional layers were mapped to test compounds and identified characteristic structural features that contributed to successful predictions. Weight-based feature visualization revealed the ability of CNN models to learn chemistry from images at a high level of resolution and aided in the interpretation of model decisions with intrinsic black box character.
引用
收藏
页码:1157 / 1164
页数:8
相关论文
共 26 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Bradski G, 2000, DR DOBBS J, V25, P120
  • [3] CHINCHOR N, 1992, FOURTH MESSAGE UNDERSTANDING CONFERENCE (MUC-4), P22
  • [4] Chollet F., 2015, KERAS 20 COMPUTER SO
  • [5] KekuleScope: prediction of cancer cell line sensitivity and compound potency using convolutional neural networks trained on compound images
    Cortes-Ciriano, Isidro
    Bender, Andreas
    [J]. JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
  • [6] Culjak I., 2012, 2012 35th International Convention on Information and Communication Technology, Electronics and Microelectronics, P1725
  • [7] Toxic Colors: The Use of Deep Learning for Predicting Toxicity of Compounds Merely from Their Graphic Images
    Fernandez, Michael
    Ban, Fuqiang
    Woo, Godwin
    Hsing, Michael
    Yamazaki, Takeshi
    LeBlanc, Eric
    Rennie, Paul S.
    Welch, William J.
    Cherkasov, Artem
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (08) : 1533 - 1543
  • [8] The ChEMBL database in 2017
    Gaulton, Anna
    Hersey, Anne
    Nowotka, Michal
    Bento, A. Patricia
    Chambers, Jon
    Mendez, David
    Mutowo, Prudence
    Atkinson, Francis
    Bellis, Louisa J.
    Cibrian-Uhalte, Elena
    Davies, Mark
    Dedman, Nathan
    Karlsson, Anneli
    Magarinos, Maria Paula
    Overington, John P.
    Papadatos, George
    Smit, Ines
    Leach, Andrew R.
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D945 - D954
  • [9] Using Rule-Based Labels for Weak Supervised Learning A ChemNet for Transferable Chemical Property Prediction
    Goh, Garrett B.
    Siegel, Charles
    Vishnu, Abhinav
    Hodas, Nathan
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 302 - 310
  • [10] Goh GB., 2017, ARXIV170606689