Predicting the acute ecotoxicity of chemical substances by machine learning using graph theory

被引:30
作者
Takata, Michiyoshi [1 ]
Lin, Bin-Le [2 ]
Xue, Mianqiang [2 ]
Zushi, Yasuyuki [2 ]
Terada, Akihiko [1 ]
Hosomi, Masaaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Chem Engn, Tokyo, Japan
[2] Natl Inst Adv Ind Sci & Technol, Res Inst Sci Safety & Sustainabil, Tokyo, Japan
关键词
Ecotoxicity prediction; Chemical substance clustering; Machine learning; Graph theory; ECOSAR; AIST-MeRAM; CLASSIFYING ENVIRONMENTAL-POLLUTANTS; SUPPORT VECTOR REGRESSION; ACUTE TOXICITY; AQUATIC TOXICITY; FISH; MODELS;
D O I
10.1016/j.chemosphere.2019.124604
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate in silico predictions of chemical substance ecotoxicity has become an important issue in recent years. Most conventional methods, such as the Ecological Structure-Activity Relationship (ECOSAR) model, cluster chemical substances empirically based on structural information and then predict toxicity by employing a log P linear regression model. Due to empirical classification, the prediction accuracy does not improve even if new ecotoxicity test data are added. In addition, most of the conventional methods are not appropriate for predicting the ecotoxicity on inorganic and/or ionized compounds. Furthermore, a user faces difficulty in handling multiple Quantitative Structure-Activity Relationship (QSAR) formulas with one chemical substance. To overcome the flaws of the conventional methods, in this study a new method was developed that applied unsupervised machine learning and graph theory to predict acute ecotoxicity. The proposed machine learning technique is based on the large AIST-MeRAM ecotoxicity test dataset, a software program developed by the National Institute of Advanced Industry Science and Technology for Multi-purpose Ecological Risk Assessment and Management, and the Molecular ACCess System (MACCS) keys that vectorize a chemical structure to 166-bit binary information. The acute toxicity of fish, daphnids, and algae can be predicted with good accuracy, without requiring log P and linear regression models in existing methods. Results from the new method were cross-validated and compared with ECOSAR predictions and show that the new method provides better accuracy for a wider range of chemical substances, including inorganic and ionized compounds. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 2015, Ecological Structure Activity Relationships (ECOSAR) Predictive Model
[2]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[3]   A similarity-based QSAR model for predicting acute toxicity towards the fathead minnow (Pimephales promelas) [J].
Cassotti, M. ;
Ballabio, D. ;
Todeschini, R. ;
Consonni, V. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2015, 26 (03) :217-243
[4]   In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods [J].
Cheng, Feixiong ;
Shen, Jie ;
Yu, Yue ;
Li, Weihua ;
Liu, Guixia ;
Lee, Philip W. ;
Tang, Yun .
CHEMOSPHERE, 2011, 82 (11) :1636-1643
[5]   Reoptimization of MDL keys for use in drug discovery [J].
Durant, JL ;
Leland, BA ;
Henry, DR ;
Nourse, JG .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1273-1280
[6]   Visualized Networking of Co-Regulated Lipids in Human Blood Based on High-Throughput Screening Data: Implications for Exposure Assessment [J].
Gao, Shixiong ;
Wan, Yi ;
Li, Wenjuan ;
Huang, Chong .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2019, 53 (05) :2862-2872
[7]   InChI - the worldwide chemical structure identifier standard [J].
Heller, Stephen ;
McNaught, Alan ;
Stein, Stephen ;
Tchekhovskoi, Dmitrii ;
Pletnev, Igor .
JOURNAL OF CHEMINFORMATICS, 2013, 5
[8]  
Ideaconsult Ltd, 2018, TOXTR
[9]   PubChem Substance and Compound databases [J].
Kim, Sunghwan ;
Thiessen, Paul A. ;
Bolton, Evan E. ;
Chen, Jie ;
Fu, Gang ;
Gindulyte, Asta ;
Han, Lianyi ;
He, Jane ;
He, Siqian ;
Shoemaker, Benjamin A. ;
Wang, Jiyao ;
Yu, Bo ;
Zhang, Jian ;
Bryant, Stephen H. .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1202-D1213
[10]   Prediction of acute toxicity in fish by using QSAR methods and chemical modes of action [J].
Lozano, Sylvain ;
Lescot, Elodie ;
Halm, Marie-Pierre ;
Lepailleur, Alban ;
Bureau, Ronan ;
Rault, Sylvain .
JOURNAL OF ENZYME INHIBITION AND MEDICINAL CHEMISTRY, 2010, 25 (02) :195-203