Co-model for chemical toxicity prediction based on multi-task deep learning

被引:4
作者
Yuan Li, Yuan [1 ]
Chen, Lingfeng [1 ]
Pu, Chengtao [1 ]
Zang, Chengdong [1 ]
Yan, YingChao [1 ]
Chen, Yadong [1 ,2 ]
Zhang, Yanmin [1 ,2 ]
Liu, Haichun [1 ,2 ]
机构
[1] China Pharmaceut Univ, Sch Sci, Lab Mol Design & Drug Discovery, Nanjing, Peoples R China
[2] China Pharmaceut Univ, Sch Sci, Lab Mol Design & Drug Discovery, 639 Longmian Ave, Nanjing 211198, Peoples R China
基金
中国国家自然科学基金;
关键词
Toxicity prediction; Multi-task learning; Deep learning; Graph convolution; Integrating model;
D O I
10.1002/minf.202200257
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The toxicity of compounds is closely related to the effectiveness and safety of drug development, and accurately predicting the toxicity of compounds is one of the most challenging tasks in medicinal chemistry and pharmacology. In this paper, we construct three types of models for single and multi-tasking based on 2D and 3D descriptors, fingerprints and molecular graphs, and then validate the models with benchmark tests on the Tox21 data challenge. We found that due to the information sharing mechanism of multi-task learning, it could address the imbalance problem of the Tox21 data sets to some extent, and the prediction performance of the multi-task was significantly improved compared with the single task in general. Given the complement of the different molecular representations and modeling algorithms, we attempted to integrate them into a robust Co-Model. Our Co-Model performs well in various evaluation metrics on the test set and also achieves significant performance improvement compared to other models in the literature, which clearly demonstrates its superior predictive power and robustness.
引用
收藏
页数:19
相关论文
共 75 条
[41]   hERG-Att: Self-attention-based deep neural network for predicting hERG blockers [J].
Kim, Hyunho ;
Nam, Hojung .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
[42]   Can the pharmaceutical industry reduce attrition rates? [J].
Kola, I ;
Landis, J .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) :711-715
[43]   Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets [J].
Korotcov, Alexandru ;
Tkachenko, Valery ;
Russo, Daniel P. ;
Ekins, Sean .
MOLECULAR PHARMACEUTICS, 2017, 14 (12) :4462-4475
[44]   Predictive Toxicology: Modeling Chemical Induced Toxicological Response Combining Circular Fingerprints with Random Forest and Support Vector Machine [J].
Koutsoukas, Alexios ;
St Amend, Joseph ;
Mishra, Meenakshi ;
Huan, Jun .
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2016, 4
[45]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[46]  
Li, 2019, ARXIV
[47]   COMPARISON OF PREDICTED AND OBSERVED SECONDARY STRUCTURE OF T4 PHAGE LYSOZYME [J].
MATTHEWS, BW .
BIOCHIMICA ET BIOPHYSICA ACTA, 1975, 405 (02) :442-451
[48]   Large-scale comparison of machine learning methods for drug target prediction on ChEMBL [J].
Mayr, Andreas ;
Klambauer, Guenter ;
Unterthiner, Thomas ;
Steijaert, Marvin ;
Wegner, Jorg K. ;
Ceulemans, Hugo ;
Clevert, Djork-Arne ;
Hochreiter, Sepp .
CHEMICAL SCIENCE, 2018, 9 (24) :5441-5451
[49]   DeepTox: Toxicity Prediction using Deep Learning [J].
Mayr, Andreas ;
Klambauer, Gunter ;
Unterthiner, Thomas ;
Hochreiter, Sepp .
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2016, 3
[50]   Mordred: a molecular descriptor calculator [J].
Moriwaki, Hirotomo ;
Tian, Yu-Shi ;
Kawashita, Norihito ;
Takagi, Tatsuya .
JOURNAL OF CHEMINFORMATICS, 2018, 10