Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point

被引:91
作者
Cavasotto, Claudio N. [1 ,3 ,4 ]
Scardino, Valeria [1 ,2 ]
机构
[1] Univ Austral, Austral Inst Appl Artificial Intelligence, B1629AHJ, Pilar, Buenos Aires, Argentina
[2] Meton Inc, Wilmington, DE 19801 USA
[3] Univ Austral, Computat Drug Design & Biomed Informat Lab, Inst Invest Med Traslac IIMT, CONICET, B1629AHJ, Pilar, Buenos Aires, Argentina
[4] Univ Austral, Fac Ciencias Biomed, Fac Ingn, B1630FHB, Pilar, Buenos Aires, Argentina
关键词
IN-SILICO PREDICTION; DRUG DISCOVERY; CLASSIFICATION MODELS; ADMET EVALUATION; WEB SERVER; BLACK-BOX; HERG; DATABASE; TOXICOLOGY; INTERPRETABILITY;
D O I
10.1021/acsomega.2c05693
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine learning (ML) models to predict the toxicity of small molecules have garnered great attention and have become widely used in recent years. Computational toxicity prediction is particularly advantageous in the early stages of drug discovery in order to filter out molecules with high probability of failing in clinical trials. This has been helped by the increase in the number of large toxicology databases available. However, being an area of recent application, a greater understanding of the scope and applicability of ML methods is still necessary. There are various kinds of toxic end points that have been predicted in silico. Acute oral toxicity, hepatotoxicity, cardiotoxicity, mutagenicity, and the 12 Tox21 data end points are among the most commonly investigated. Machine learning methods exhibit different performances on different data sets due to dissimilar complexity, class distributions, or chemical space covered, which makes it hard to compare the performance of algorithms over different toxic end points. The general pipeline to predict toxicity using ML has already been analyzed in various reviews. In this contribution, we focus on the recent progress in the area and the outstanding challenges, making a detailed description of the state-of-the-art models implemented for each toxic end point. The type of molecular representation, the algorithm, and the evaluation metric used in each research work are explained and analyzed. A detailed description of end points that are usually predicted, their clinical relevance, the available databases, and the challenges they bring to the field are also highlighted.
引用
收藏
页码:47536 / 47546
页数:11
相关论文
共 94 条
[21]   Machine learning-Predicting Ames mutagenicity of small molecules [J].
Chu, Charmaine S. M. ;
Simpson, Jack D. ;
O'Neill, Paul M. ;
Berry, Neil G. .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2021, 109
[22]   Synergistic target combination prediction from curated signaling networks: Machine learning meets systems biology and pharmacology [J].
Chua, Huey Eng ;
Bhowmick, Sourav S. ;
Tucker-Kellogg, Lisa .
METHODS, 2017, 129 :60-80
[23]   Advancing Computational Toxicology in the Big Data Era by Artificial Intelligence: Data-Driven and Mechanism-Driven Modeling for Chemical Toxicity [J].
Ciallella, Heather L. ;
Zhu, Hao .
CHEMICAL RESEARCH IN TOXICOLOGY, 2019, 32 (04) :536-547
[24]   Structure-Based Prediction of hERG-Related Cardiotoxicity: A Benchmark Study [J].
Creanza, Teresa Maria ;
Delre, Pietro ;
Ancona, Nicola ;
Lentini, Giovanni ;
Saviano, Michele ;
Mangiatordi, Giuseppe Felice .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (09) :4758-4770
[25]   The Role of Genetic Toxicology in Drug Discovery and Optimization [J].
Custer, L. L. ;
Sweder, K. S. .
CURRENT DRUG METABOLISM, 2008, 9 (09) :978-985
[26]   Descriptors and their selection methods in QSAR analysis: paradigm for drug design [J].
Danishuddin ;
Khan, Asad U. .
DRUG DISCOVERY TODAY, 2016, 21 (08) :1291-1302
[27]   Effect of missing data on multitask prediction methods [J].
de Leon, Antonio de la Vega ;
Chen, Beining ;
Gillet, Valerie J. .
JOURNAL OF CHEMINFORMATICS, 2018, 10
[28]   A Machine Learning Model to Predict Drug Transfer Across the Human Placenta Barrier [J].
Di Filippo, Juan, I ;
Bollini, Mariela ;
Cavasotto, Claudio N. .
FRONTIERS IN CHEMISTRY, 2021, 9
[29]   Compilation and physicochemical classification analysis of a diverse hERG inhibition database [J].
Didziapetris, Remigijus ;
Lanevskij, Kiril .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (12) :1175-1188
[30]   Trends in clinical success rates and therapeutic focus [J].
Dowden, Helen ;
Munro, Jamie .
NATURE REVIEWS DRUG DISCOVERY, 2019, 18 (07) :494-495