Machine learning models for classification tasks related to drug safety

被引:35
作者
Racz, Anita [1 ]
Bajusz, David [2 ]
Miranda-Quintana, Ramon Alain [3 ,4 ]
Heberger, Karoly [1 ]
机构
[1] Res Ctr Nat Sci, Plasma Chem Res Grp, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[2] Res Ctr Nat Sci, Med Chem Res Grp, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[3] Univ Florida, Dept Chem, Gainesville, FL 32603 USA
[4] Univ Florida, Quantum Theory Project, Gainesville, FL 32603 USA
关键词
ADMET; Toxicity; Big data; QSAR; In silico modeling; Machine learning; BLOOD-BRAIN-BARRIER; IN-SILICO PREDICTION; POTASSIUM CHANNEL BLOCKAGE; P-GLYCOPROTEIN; ADMET EVALUATION; NEURAL-NETWORKS; EYE IRRITATION; CARCINOGENICITY; PERMEABILITY; INHIBITORS;
D O I
10.1007/s11030-021-10239-x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015-2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood-brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts.
引用
收藏
页码:1409 / 1424
页数:16
相关论文
共 119 条
[41]   New experimental models of the blood-brain barrier for CNS drug discovery [J].
Kaisar, Mohammad A. ;
Sajja, Ravi K. ;
Prasad, Shikha ;
Abhyankar, Vinay V. ;
Liles, Taylor ;
Cucullo, Luca .
EXPERT OPINION ON DRUG DISCOVERY, 2017, 12 (01) :89-103
[42]  
KATO H, 2019, DRUG METAB PHARMACOK
[43]   hERG-Att: Self-attention-based deep neural network for predicting hERG blockers [J].
Kim, Hyunho ;
Nam, Hojung .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
[44]   hERG liability classification models using machine learning techniques [J].
Konda L.S.K. ;
Keerthi Praba S. ;
Kristam R. .
Computational Toxicology, 2019, 12
[45]   K-NEAREST NEIGHBOR CLASSIFICATION RULE (PATTERN-RECOGNITION) APPLIED TO NUCLEAR MAGNETIC-RESONANCE SPECTRAL INTERPRETATION [J].
KOWALSKI, BR ;
BENDER, CF .
ANALYTICAL CHEMISTRY, 1972, 44 (08) :1405-&
[46]  
Kramer O., 2013, Dimensionality Reduction with Unsupervised Nearest Neighbors, DOI [10.1007/978-3-642-38652-7_2, DOI 10.1007/978-3-642-38652-72]
[47]   Consensus statement for the diagnosis and treatment of drug-induced lung injuries [J].
Kubo, Keishi ;
Azuma, Arata ;
Kanazawa, Minoru ;
Kameda, Hideto ;
Kusumoto, Masahiko ;
Genma, Akihiko ;
Saijo, Yasuo ;
Sakai, Fumikazu ;
Sugiyama, Yukihiko ;
Tatsumi, Koichiro ;
Dohi, Makoto ;
Tokuda, Hitoshi ;
Hashimoto, Shu ;
Hattori, Noboru ;
Hanaoka, Masayuki ;
Fukuda, Yuh .
RESPIRATORY INVESTIGATION, 2013, 51 (04) :260-277
[48]  
Langley, 1995, Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, P338
[49]   Computational determination of hERG-related cardiotoxicity of drug candidates [J].
Lee, Hyang-Mi ;
Yu, Myeong-Sang ;
Kazmi, Sayada Reemsha ;
Oh, Seong Yun ;
Rhee, Ki-Hyeong ;
Bae, Myung-Ae ;
Lee, Byung Ho ;
Shin, Dae-Seop ;
Oho, Kwang-Seok ;
Ceong, Hyithaek ;
Lee, Donghyun ;
Na, Dokyun .
BMC BIOINFORMATICS, 2019, 20 (Suppl 10)
[50]   In silico prediction of multiple-category classification model for cytochrome P450 inhibitors and non-inhibitors using machine-learning method [J].
Lee, J. H. ;
Basith, S. ;
Cui, M. ;
Kim, B. ;
Choi, S. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2017, 28 (10) :863-874