Random forest algorithm-based accurate prediction of rat acute oral toxicity

被引:3
作者
Xiao, Linrong [1 ]
Deng, Jiyong [1 ]
Yang, Liping [2 ]
Huang, Xianwei [1 ]
Yu, Xinliang [1 ]
机构
[1] Hunan Inst Engn, Coll Mat & Chem Engn, Hunan Prov Key Lab Environm Catalysis & Waste Reg, Xiangtan 411104, Hunan, Peoples R China
[2] Shenzhen Expressway Environm Co Ltd, Shenzhen 518048, Guangdong, Peoples R China
关键词
Toxicity; LD50; QSAR; random forest; APPLICABILITY DOMAIN; MAMMALIAN TOXICITY; VALIDATION; MODELS; ERROR;
D O I
10.1080/00268976.2022.2140083
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Predicting acute oral toxicity LD50 of chemicals in rats is a challenge since many factors affect toxicity data. In this paper, 40 descriptors were successfully used to develop a quantitative structure-activity relationship model for 8448 rat acute oral toxicity logLD50 by applying the random forest (RF) algorithm. To develop the optimal RF model, a training set (5914 chemicals) was used to establish models, a validation set (1267 chemicals) used to tune RF parameters and a test set (1267 chemicals) used to assess the performance of RF models. It yielded correlation coefficients R of 0.9695 and rms errors (log unit) of 0.3171 for the training set, R = 0.8322 and rms = 0.2889 for the validation set and R = 0.8335 and rms = 0.3060 for the test set. More than 99% of rat acute oral toxicity logLD50 in the dataset can be accurately predicted, although the dataset is large.
引用
收藏
页数:6
相关论文
共 41 条
  • [1] [Anonymous], 2016, DRAGON SOFTWARE MOL
  • [2] First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability
    Banerjee, Arkaprava
    Roy, Kunal
    [J]. MOLECULAR DIVERSITY, 2022, 26 (05) : 2847 - 2862
  • [3] Prediction of mammalian toxicity of organophosphorus pesticides from QSTR modeling
    Devillers, J
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2004, 15 (5-6) : 501 - 510
  • [4] Prediction of acute mammalian toxicity from QSARs and interspecies correlations
    Devillers, J.
    Devillers, H.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2009, 20 (5-6) : 467 - 500
  • [5] Comparison of in silico tools for evaluating rat oral acute toxicity
    Diaza, R. G.
    Manganelli, S.
    Esposito, A.
    Roncaglioni, A.
    Manganaro, A.
    Benfenati, E.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2015, 26 (01) : 1 - 27
  • [6] QSAR and Classification Study on Prediction of Acute Oral Toxicity of N-Nitroso Compounds
    Fan, Tengjiao
    Sun, Guohui
    Zhao, Lijiao
    Cui, Xin
    Zhong, Rugang
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
  • [7] Random forest algorithm-based accurate prediction of chemical toxicity to Tetrahymena pyriformis
    Fang, Zhengjun
    Yu, Xinliang
    Zeng, Qun
    [J]. TOXICOLOGY, 2022, 480
  • [8] SAR and QSAR modeling of a large collection of LD50 rat acute oral toxicity data
    Gadaleta, Domenico
    Vukovic, Kristijan
    Toma, Cosimo
    Lavado, Giovanna J.
    Karmaus, Agnes L.
    Mansouri, Kamel
    Kleinstreuer, Nicole C.
    Benfenati, Emilio
    Roncaglioni, Alessandra
    [J]. JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
  • [9] A new semi-automated workflow for chemical data retrieval and quality checking for modeling applications
    Gadaleta, Domenico
    Lombardo, Anna
    Toma, Cosimo
    Benfenati, Emilio
    [J]. JOURNAL OF CHEMINFORMATICS, 2018, 10
  • [10] Gissi A, 2017, COMPREHENSIVE MEDICINAL CHEMISTRY III, VOL 1: GENERAL PERSPECTIVE - THE FUTURE OF DRUG DISCOVERY, P472, DOI 10.1016/B978-0-12-409547-2.12300-5