In silico prediction of toxicity of non-congeneric industrial chemicals using ensemble learning based modeling approaches

被引:24
作者
Singh, Kunwar P. [1 ]
Gupta, Shikha
机构
[1] Acad Sci & Innovat Res, New Delhi 110001, India
关键词
Toxicity; Diverse chemicals; Ensemble learning models; Interspecies model; Molecular descriptors; Regulatory toxicology; SUPPORT VECTOR MACHINES; ACUTE AQUATIC TOXICITY; TETRAHYMENA-PYRIFORMIS; NEURAL-NETWORK; QSAR MODELS; MOLECULAR DESCRIPTORS; APPLICABILITY DOMAINS; AROMATIC-COMPOUNDS; PHENOLS; VALIDATION;
D O I
10.1016/j.taap.2014.01.006
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Ensemble learning approach based decision treeboost (DTB) and decision tree forest (DTF) models are introduced in order to establish quantitative structure-toxicity relationship (QSTR) for the prediction of toxicity of 1450 diverse chemicals. Eight non-quantum mechanical molecular descriptors were derived. Structural diversity of the chemicals was evaluated using Tanimoto similarity index. Stochastic gradient boosting and bagging algorithms supplemented DTB and DTF models were constructed for classification and function optimization problems using the toxicity end-point in T. pyriformis. Special attention was drawn to prediction ability and robustness of the models, investigated both in external and 10-fold cross validation processes. In complete data, optimal DTB and DTF models rendered accuracies of 98.90%, 98.83% in two-category and 98.14%, 98.14% in four-category toxicity classifications. Both the models further yielded classification accuracies of 100% in external toxicity data of T. pyriformis. The constructed regression models (DTB and DTF) using five descriptors yielded correlation coefficients (R-2) of 0.945, 0.944 between the measured and predicted toxicities with mean squared errors (MSEs) of 0.059, and 0.064 in complete T. pyriformis data. The T. pyriformis regression models (DTB and DTF) applied to the external toxicity data sets yielded R-2 and MSE values of 0.637, 0.655; 0.534, 0.507 (marine bacteria) and 0.741, 0.691; 0.155, 0.173 (algae). The results suggest for wide applicability of the inter-species models in predicting toxicity of new chemicals for regulatory purposes. These approaches provide useful strategy and robust tools in the screening of ecotoxicological risk or environmental hazard potential of chemicals. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:198 / 212
页数:15
相关论文
共 96 条
  • [1] Abdelaal M.M.A., 2010, ONLINE J MATH STAT, V1, P1
  • [2] [Anonymous], 2010, PHARM LETT
  • [3] Chemistry-toxicity relationships for the effects of Di-and trihydroxybenzenes to Tetrahymena pyriformis
    Aptula, AO
    Roberts, DW
    Cronin, MTD
    Schultz, TW
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 2005, 18 (05) : 844 - 854
  • [4] QSAR analysis of the toxicity of nitroaromatics in Tetrahymena pyriformis: structural factors and possible modes of action
    Artemenko, A. G.
    Muratov, E. N.
    Kuz'min, V. E.
    Muratov, N. N.
    Varlamova, E. V.
    Kuz'mina, A. V.
    Gorb, L. G.
    Golius, A.
    Hill, F. C.
    Leszczynski, J.
    Tropsha, A.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2011, 22 (5-6) : 575 - 601
  • [5] Benigni R, 2008, ANN I SUPER SANITA, V44, P48
  • [6] The expanding role of predictive toxicology: An update on the (Q) SAR models for mutagens and carcinogens
    Benigni, Romualdo
    Netzeva, Tatiana I.
    Benfenati, Emilio
    Bossa, Cecilia
    Franke, Rainer
    Helma, Christoph
    Hulzebos, Etje
    Marchant, Carol
    Richard, Ann
    Woo, Yin-Tak
    Yang, Chihae
    [J]. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH PART C-ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS, 2007, 25 (01) : 53 - 97
  • [7] Breiman L, 1996, ANN STAT, V24, P2350
  • [8] An automated group contribution method in predicting aquatic toxicity: The diatomic fragment approach
    Casalegno, M
    Benfenati, E
    Sello, G
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 2005, 18 (04) : 740 - 746
  • [9] Castillo-Garit J.A., 2009, 13 INT EL C SYNTH OR, P1
  • [10] In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods
    Cheng, Feixiong
    Shen, Jie
    Yu, Yue
    Li, Weihua
    Liu, Guixia
    Lee, Philip W.
    Tang, Yun
    [J]. CHEMOSPHERE, 2011, 82 (11) : 1636 - 1643