Prediction of activity cliffs on the basis of images using convolutional neural networks

被引:0
作者
Javed Iqbal
Martin Vogt
Jürgen Bajorath
机构
[1] Rheinische Friedrich-Wilhelms-Universität,Department of Life Science Informatics, B
来源
Journal of Computer-Aided Molecular Design | 2021年 / 35卷
关键词
Activity cliffs; Matched molecular pairs; Image analysis; Convolutional neural networks; Convolutional feature visualization;
D O I
暂无
中图分类号
学科分类号
摘要
An activity cliff (AC) is formed by a pair of structurally similar compounds with a large difference in potency. Accordingly, ACs reveal structure–activity relationship (SAR) discontinuity and provide SAR information for compound optimization. Herein, we have investigated the question if ACs could be predicted from image data. Therefore, pairs of structural analogs were extracted from different compound activity classes that formed or did not form ACs. From these compound pairs, consistently formatted images were generated. Image sets were used to train and test convolutional neural network (CNN) models to systematically distinguish between ACs and non-ACs. The CNN models were found to predict ACs with overall high accuracy, as assessed using alternative performance measures, hence establishing proof-of-principle. Moreover, gradient weights from convolutional layers were mapped to test compounds and identified characteristic structural features that contributed to successful predictions. Weight-based feature visualization revealed the ability of CNN models to learn chemistry from images at a high level of resolution and aided in the interpretation of model decisions with intrinsic black box character.
引用
收藏
页码:1157 / 1164
页数:7
相关论文
共 50 条
  • [1] Fernandez M(2018)Toxic Colors: the use of deep learning for predicting toxicity of compounds merely from their graphic images J Chem Inf Model 58 1533-1543
  • [2] Ban F(2019)KekuleScope: prediction of cancer cell line sensitivity and compound potency using convolutional neural networks trained on compound images J Cheminform 11 e41-1535
  • [3] Woo G(2020)Activity landscape image analysis using convolutional neural networks J Cheminform 12 e34-2942
  • [4] Hsing M(2006)On outliers and activity cliffs—why QSAR often disappoints J Chem Inf Model 46 1535-348
  • [5] Yamazaki T(2012)Exploring activity cliffs in medicinal chemistry J Med Chem 55 2932-1145
  • [6] LeBlanc E(2010)Computationally efficient algorithm to identify matched molecular pairs (MMPs) in large data sets J Chem Inf Model 50 339-2365
  • [7] Rennie PS(2012)MMP-Cliffs: systematic identification of activity cliffs on the basis of matched molecular pairs J Chem Inf Model 52 1138-1640
  • [8] Welch WJ(2012)Prediction of activity cliffs using support vector machines J Chem Inf Model 52 2354-2663
  • [9] Cherkasov A(2016)Prediction of activity cliffs using condensed graphs of reaction representations, descriptor recombination, support vector machine classification, and support vector regression J Chem Inf Model 56 1631-255
  • [10] Cortés-Ciriano I(2014)Prediction of compound potency changes in matched molecular pairs using support vector regression J Chem Inf Model 54 2654-D954