In Silico Assessment of Chemical Biodegradability

被引:86
作者
Cheng, Feixiong [1 ]
Ikenaga, Yutaka [3 ]
Zhou, Yadi [1 ]
Yu, Yue [1 ]
Li, Weihua [1 ]
Shen, Jie [1 ]
Du, Zheng [1 ]
Chen, Lei [1 ]
Xu, Congying [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ,2 ]
Tang, Yun [1 ]
机构
[1] E China Univ Sci & Technol, Shanghai Key Lab New Drug Design, Sch Pharm, Shanghai 200237, Peoples R China
[2] Kyoto Univ, Grad Sch Agr, Sakyo Ku, Kyoto 6068502, Japan
[3] Natl Inst Technol & Evaluat NITE, Safety Assessment Div, Chem Management Ctr, Shibuya Ku, Tokyo 1510066, Japan
基金
中国国家自然科学基金;
关键词
READY BIODEGRADABILITY; NEAREST-NEIGHBOR; PREDICTION; CLASSIFICATION; MODELS; QSAR; APPLICABILITY; NONINHIBITORS; SUBSTRUCTURES; INHIBITORS;
D O I
10.1021/ci200622d
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Biodegradation is the principal environmental dissipation process. Due to a lack of comprehensive experimental data, high study cost and time-consuming, in silico approaches for assessing the biodegradable profiles of chemicals are encouraged and is an active current research topic. Here we developed in silico methods to estimate chemical biodegradability in the environment. At first 1440 diverse compounds tested under the Japanese Ministry of International Trade and Industry (MITI) protocol were used. Four different methods, namely support vector machine, k-nearest neighbor, naive Bayes, and C4.5 decision tree, were used to build the combinatorial classification probability models of ready versus not ready biodegradability using physicochemical descriptors and fingerprints separately. The overall predictive accuracies of the best models were more than 80% for the external test set of 164 diverse compounds. Some privileged substructures were further identified for ready or not ready biodegradable chemicals by combining information gain and substructure fragment analysis. Moreover, 27 new predicted chemicals were selected for experimental assay through the Japanese MITI test protocols, which validated that all 27 compounds were predicted correctly. The predictive accuracies of our models outperform the commonly used software of the EPI Suite. Our study provided critical tools for early assessment of biodegradability of new organic chemicals in environmental hazard assessment.
引用
收藏
页码:655 / 669
页数:15
相关论文
共 46 条
  • [1] A Simple Protocol for the Comparative Analysis of the Structure and Occurrence of Biochemical Pathways Across Superkingdoms
    Andreini, Claudia
    Bertini, Ivano
    Cavallaro, Gabriele
    Decaria, Leonardo
    Rosato, Antonio
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (03) : 730 - 738
  • [2] [Anonymous], DETECTION INTERACTIO
  • [3] [Anonymous], MACH LEARN
  • [4] [Anonymous], 2014, C4. 5: programs for machine learning
  • [5] [Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
  • [6] [Anonymous], 1992, OECD Guidelines for the Testing of Chemicals
  • [7] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [8] Using Biowin™, Bayes, and batteries to predict ready biodegradability
    Boethling, RS
    Lynch, DG
    Jaworska, JS
    Tunkel, JL
    Thom, GC
    Webb, S
    [J]. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2004, 23 (04) : 911 - 920
  • [9] Chang C.-C., LIBSVM: a Library for Support Vector Machines
  • [10] Insights into Molecular Basis of Cytochrome P450 Inhibitory Promiscuity of Compounds
    Cheng, Feixiong
    Yu, Yue
    Zhou, Yadi
    Shen, Zhonghua
    Xiao, Wen
    Liu, Guixia
    Li, Weihua
    Lee, Philip W.
    Tang, Yun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (10) : 2482 - 2495