Predicting protein network topology clusters from chemical structure using deep learning

被引:0
作者
Akshai P. Sreenivasan
Philip J Harrison
Wesley Schaal
Damian J. Matuszewski
Kim Kultima
Ola Spjuth
机构
[1] Department of Pharmaceutical Biosciences,
[2] Uppsala University,undefined
[3] Centre for Image Analysis,undefined
[4] Department of Information Technology,undefined
[5] Uppsala University,undefined
[6] Department of Medical Sciences,undefined
[7] Uppsala University,undefined
来源
Journal of Cheminformatics | / 14卷
关键词
Deep learning; Neural networks; Drug discovery; Network topology; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Comparing chemical structures to infer protein targets and functions is a common approach, but basing comparisons on chemical similarity alone can be misleading. Here we present a methodology for predicting target protein clusters using deep neural networks. The model is trained on clusters of compounds based on similarities calculated from combined compound-protein and protein-protein interaction data using a network topology approach. We compare several deep learning architectures including both convolutional and recurrent neural networks. The best performing method, the recurrent neural network architecture MolPMoFiT, achieved an F1 score approaching 0.9 on a held-out test set of 8907 compounds. In addition, in-depth analysis on a set of eleven well-studied chemical compounds with known functions showed that predictions were justifiable for all but one of the chemicals. Four of the compounds, similar in their molecular structure but with dissimilarities in their function, revealed advantages of our method compared to using chemical similarity.
引用
收藏
相关论文
共 154 条
[1]  
Maggiora GM(2006)On outliers and activity cliffs-why qsar often disappoints J Chem Inf Model 46 1535-58
[2]  
Guha R(2008)Structure–activity landscape index: identifying and quantifying activity cliffs J Chem Inf Model 48 646-424
[3]  
Van Drie JH(2016)Targetnet: a web service for predicting potential drug–target interaction profiling via multi-target sar models J Comput Aided Mol Des 30 413-137
[4]  
Yao Z-J(2018)Predicting off-target binding profiles with confidence using conformal prediction Front Pharmacol 9 1256-9
[5]  
Dong J(2011)Predicting biological functions of compounds based on chemical–chemical interactions PLoS ONE 6 29491-33
[6]  
Che Y-J(2008)Prediction of compounds’ biological function (metabolic pathways) based on functional group composition Mol Divers 12 131-1409
[7]  
Zhu M-F(2012)Predicting metabolic pathways of small molecules and enzymes based on interaction information of chemicals and proteins PLoS ONE 7 1-45
[8]  
Wen M(2003)Prediction of biological activity spectra for substances: evaluation on the diverse sets of drug-like structures Curr Med Chem 10 225-49
[9]  
Wang N-N(2012)Rethinking molecular similarity: comparing compounds on the basis of biological activity ACS Chem Biol 7 1399-2370
[10]  
Wang S(2013)Exploiting pharmacological similarity to identify safety concerns-listen to what the data tells you Mol Inf 32 37-1307