Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders

被引:91
作者
Manica, Matteo [1 ]
Oskooei, Ali [1 ]
Born, Jannis [1 ,2 ,3 ]
Subramanian, Vigneshwari [4 ]
Saez-Rodriguez, Julio [5 ]
Martinez, Maria Rodriguez [1 ]
机构
[1] IBM Res, CH-8803 Zurich, Switzerland
[2] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[3] Univ Zurich, CH-8006 Zurich, Switzerland
[4] Rhein Westfal TH Aachen, D-52056 Aachen, Germany
[5] Heidelberg Univ, D-69047 Heidelberg, Germany
基金
欧盟地平线“2020”;
关键词
drug sensitivity prediction; computational systems biology; deep learning; machine learning; drug discovery; multiscale; multimodal; attention; CNN; RNN; explainability; interpretability; molecular networks; molecular fingerprints; GDSC; SMILES; gene expression; drug sensitivity; anticancer compounds; IC50; EC50; lead discovery; personalized medicine; precision medicine; DRUG-SENSITIVITY; CANCER; GENE; DISCOVERY; RESOURCE;
D O I
10.1021/acs.molpharmaceut.9b00520
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors, and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based encoder significantly outperforms a baseline model trained on Morgan fingerprints and a selection of encoders based on SMILES, as well as the previously reported state-of-the-art for multimodal drug sensitivity prediction (R-2 = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify to its potential for in silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design.
引用
收藏
页码:4797 / 4806
页数:10
相关论文
共 73 条
[1]   Machine learning and feature selection for drug response prediction in precision oncology applications [J].
Ali M. ;
Aittokallio T. .
Biophysical Reviews, 2019, 11 (1) :31-39
[2]   Integrative and Personalized QSAR Analysis in Cancer by Kernelized Bayesian Matrix Factorization [J].
Amnnad-ud-din, Muhammad ;
Georgii, Elisabeth ;
Gonen, Mehmet ;
Laitinen, Tuomo ;
Kallioniemi, Olli ;
Wennerberg, Krister ;
Poso, Antti ;
Kaski, Samuel .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (08) :2347-2359
[3]  
[Anonymous], PHARMA R D ANN REV 2
[4]  
[Anonymous], ARXIV180301271CSLG
[5]  
[Anonymous], ARXIV160206289CSCL
[6]  
[Anonymous], SCI REP UK
[7]  
[Anonymous], ARXIV171107839CSLG
[8]  
[Anonymous], ARXIV14090473CSCL
[9]  
[Anonymous], P DEEP LEARN WORKSH
[10]  
[Anonymous], ARXIV181204439CSLG