Co-model for chemical toxicity prediction based on multi-task deep learning

被引:4
作者
Yuan Li, Yuan [1 ]
Chen, Lingfeng [1 ]
Pu, Chengtao [1 ]
Zang, Chengdong [1 ]
Yan, YingChao [1 ]
Chen, Yadong [1 ,2 ]
Zhang, Yanmin [1 ,2 ]
Liu, Haichun [1 ,2 ]
机构
[1] China Pharmaceut Univ, Sch Sci, Lab Mol Design & Drug Discovery, Nanjing, Peoples R China
[2] China Pharmaceut Univ, Sch Sci, Lab Mol Design & Drug Discovery, 639 Longmian Ave, Nanjing 211198, Peoples R China
基金
中国国家自然科学基金;
关键词
Toxicity prediction; Multi-task learning; Deep learning; Graph convolution; Integrating model;
D O I
10.1002/minf.202200257
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The toxicity of compounds is closely related to the effectiveness and safety of drug development, and accurately predicting the toxicity of compounds is one of the most challenging tasks in medicinal chemistry and pharmacology. In this paper, we construct three types of models for single and multi-tasking based on 2D and 3D descriptors, fingerprints and molecular graphs, and then validate the models with benchmark tests on the Tox21 data challenge. We found that due to the information sharing mechanism of multi-task learning, it could address the imbalance problem of the Tox21 data sets to some extent, and the prediction performance of the multi-task was significantly improved compared with the single task in general. Given the complement of the different molecular representations and modeling algorithms, we attempted to integrate them into a robust Co-Model. Our Co-Model performs well in various evaluation metrics on the test set and also achieves significant performance improvement compared to other models in the literature, which clearly demonstrates its superior predictive power and robustness.
引用
收藏
页数:19
相关论文
共 75 条
[1]   Consensus Modeling for HTS Assays Using In silico Descriptors Calculates the Best Balanced Accuracy in Tox21 Challenge [J].
Abdelaziz, Ahmed ;
Spahn-Langguth, Hilde ;
Schramm, Karl-Werner ;
Tetko, Igor, V .
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2016, 4
[2]   Profiling of the Tox21 Chemical Collection for Mitochondrial Function to Identify Compounds that Acutely Decrease Mitochondrial Membrane Potential [J].
Attene-Ramos, Matias S. ;
Huang, Ruili ;
Michael, Sam ;
Witt, Kristine L. ;
Richard, Ann ;
Tice, Raymond R. ;
Simeonov, Anton ;
Austin, Christopher P. ;
Xia, Menghang .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2015, 123 (01) :49-56
[3]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[4]   ProTox-II: a webserver for the prediction of toxicity of chemicals [J].
Banerjee, Priyanka ;
Eckert, Andreas O. ;
Schrey, Anna K. ;
Preissner, Robert .
NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) :W257-W263
[5]   PROBABILITY PLOTTING METHODS AND ORDER STATISTICS [J].
BARNETT, V .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1975, 24 (01) :95-108
[6]   An open source chemical structure curation pipeline using RDKit [J].
Bento, A. Patricia ;
Hersey, Anne ;
Felix, Eloy ;
Landrum, Greg ;
Gaulton, Anna ;
Atkinson, Francis ;
Bellis, Louisa J. ;
De Veij, Marleen ;
Leach, Andrew R. .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[7]   ROBUST TESTS FOR EQUALITY OF VARIANCES [J].
BROWN, MB ;
FORSYTHE, AB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) :364-367
[8]   Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity [J].
Cai, Chuipu ;
Guo, Pengfei ;
Zhou, Yadi ;
Zhou, Jingwei ;
Wang, Qi ;
Zhang, Fengxue ;
Fang, Jiansong ;
Cheng, Feixiong .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) :1073-1084
[9]  
Calders T., 2007, LECT NOTES ARTIF INT, V11, P42
[10]   Kernel k-nearest neighbor algorithm as a flexible SAR modeling tool [J].
Cao, Dong-Sheng ;
Huang, Jian-Hua ;
Yan, Jun ;
Zhang, Liang-Xiao ;
Hu, Qian-Nan ;
Xu, Qing-Song ;
Liang, Yi-Zeng .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 114 :19-23