Learning Drug Functions from Chemical Structures with Convolutional Neural Networks and Random Forests

被引:60
作者
Meyer, Jesse G. [1 ,2 ,3 ]
Liu, Shengchao [4 ,5 ]
Miller, Ian J. [3 ]
Coon, Joshua J. [1 ,2 ,3 ,5 ,6 ]
Gitter, Anthony [4 ,5 ,7 ]
机构
[1] Univ Wisconsin, Dept Chem, 1101 Univ Ave, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Biomol Chem, Madison, WI 53706 USA
[3] Univ Wisconsin, Natl Ctr Quantitat Biol Complex Syst, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Comp Sci, 1210 W Dayton St, Madison, WI 53706 USA
[5] Univ Wisconsin, Morgridge Inst Res, Madison, WI 53706 USA
[6] Univ Wisconsin, DOE Great Lakes Bioenergy Res Ctr, Madison, WI 53706 USA
[7] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
关键词
GROWTH-FACTOR RECEPTOR; APPLICABILITY DOMAIN; CLASSIFICATION; DISCOVERY; TOXICITY; MODELS; SMILES; SPACE; GENERATION; REGRESSION;
D O I
10.1021/acs.jcim.9b00236
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Empirical testing of chemicals for drug efficacy costs many billions of dollars every year. The ability to predict the action of molecules in silico would greatly increase the speed and decrease the cost of prioritizing drug leads. Here, we asked whether drug function, defined as MeSH "therapeutic use" classes, can be predicted from only a chemical structure. We evaluated two chemical-structure-derived drug classification methods, chemical images with convolutional neural networks and molecular fingerprints with random forests, both of which outperformed previous predictions that used drug-induced transcriptomic changes as chemical representations. This suggests that the structure of a chemical contains at least as much information about its therapeutic use as the transcriptional cellular response to that chemical. Furthermore, because training data based on chemical structure is not limited to a small set of molecules for which transcriptomic measurements are available, our strategy can leverage more training data to significantly improve predictive accuracy to 83-88%. Finally, we explore use of these models for prediction of side effects and drug-repurposing opportunities and demonstrate the effectiveness of this modeling strategy for multilabel classification.
引用
收藏
页码:4438 / 4449
页数:12
相关论文
共 72 条
[1]   Epidermal growth factor receptor is a common mediator of quinone-induced signaling leading to phosphorylation of connexin-43 - Role of glutathione and tyrosine phosphatases [J].
Abdelmohsen, K ;
Gerber, PA ;
von Montfort, C ;
Sies, H ;
Klotz, LO .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (40) :38360-38367
[2]   Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data [J].
Aliper, Alexander ;
Plis, Sergey ;
Artemov, Artem ;
Ulloa, Alvaro ;
Mamoshina, Polina ;
Zhavoronkov, Alex .
MOLECULAR PHARMACEUTICS, 2016, 13 (07) :2524-2530
[3]   Low Data Drug Discovery with One-Shot Learning [J].
Altae-Tran, Han ;
Ramsundar, Bharath ;
Pappu, Aneesh S. ;
Pande, Vijay .
ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293
[4]   Toxicity Testing in the 21st Century: Bringing the Vision to Life [J].
Andersen, Melvin E. ;
Krewski, Daniel .
TOXICOLOGICAL SCIENCES, 2009, 107 (02) :324-330
[5]  
[Anonymous], ARXIV180306236
[6]  
[Anonymous], 2018, CHEMRXIV
[7]   EVALUATION OF THE POTASSIUM CHANNEL ACTIVATOR CROMAKALIM (BRL-34915) AS A BRONCHODILATOR IN THE GUINEA-PIG - COMPARISON WITH NIFEDIPINE [J].
ARCH, JRS ;
BUCKLE, DR ;
BUMSTEAD, J ;
CLARKE, GD ;
TAYLOR, JF ;
TAYLOR, SG .
BRITISH JOURNAL OF PHARMACOLOGY, 1988, 95 (03) :763-770
[8]   Trospium Chloride Treatment of Overactive Bladder [J].
Biastre, Kelly ;
Burnakis, Thomas .
ANNALS OF PHARMACOTHERAPY, 2009, 43 (02) :283-295
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   PRESCHOOL-CHILDREN CAN LEARN TO TRANSFER - LEARNING TO LEARN AND LEARNING FROM EXAMPLE [J].
BROWN, AL ;
KANE, MJ .
COGNITIVE PSYCHOLOGY, 1988, 20 (04) :493-523