Predicting the aquatic toxicity mode of action using logistic regression and linear discriminant analysis

被引:26
作者
Ren, Y. Y. [1 ]
Zhou, L. C. [2 ]
Yang, L. [1 ]
Liu, P. Y. [1 ]
Zhao, B. W. [1 ]
Liu, H. X. [3 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Environm & Municipal Engn, Lanzhou, Peoples R China
[2] Lanzhou Univ, Coll Chem & Chem Engn, Lanzhou, Peoples R China
[3] Lanzhou Univ, Sch Pharm, Lanzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Logistic regression; linear discriminant analysis; modes of aquatic toxic action; CODESSA; DRAGON; influence plot; ORGANIC-CHEMICALS; TETRAHYMENA-PYRIFORMIS; CLASSIFICATION MODELS; POLAR NARCOSIS; QSAR; MECHANISMS; PHENOLS; DESCRIPTORS; VALIDATION; POLLUTANTS;
D O I
10.1080/1062936X.2016.1229691
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The paper highlights the use of the logistic regression (LR) method in the construction of acceptable statistically significant, robust and predictive models for the classification of chemicals according to their aquatic toxic modes of action. Essentials accounting for a reliable model were all considered carefully. The model predictors were selected by stepwise forward discriminant analysis (LDA) from a combined pool of experimental data and chemical structure-based descriptors calculated by the CODESSA and DRAGON software packages. Model predictive ability was validated both internally and externally. The applicability domain was checked by the leverage approach to verify prediction reliability. The obtained models are simple and easy to interpret. In general, LR performs much better than LDA and seems to be more attractive for the prediction of the more toxic compounds, i.e. compounds that exhibit excess toxicity versus non-polar narcotic compounds and more reactive compounds versus less reactive compounds. In addition, model fit and regression diagnostics was done through the influence plot which reflects the hat-values, studentized residuals, and Cook's distance statistics of each sample. Overdispersion was also checked for the LR model. The relationships between the descriptors and the aquatic toxic behaviour of compounds are also discussed.
引用
收藏
页码:721 / 746
页数:26
相关论文
共 66 条
[1]  
[Anonymous], 1985, CHEMDRAW CHEM STRUCT
[2]  
[Anonymous], 2000, HYPERCHEM REL 6 0 WI
[3]  
[Anonymous], 2013, TALETE SRL DRAGON SO
[4]  
Aptula AO, 2002, QUANT STRUCT-ACT REL, V21, P12, DOI 10.1002/1521-3838(200205)21:1<12::AID-QSAR12>3.0.CO
[5]  
2-M
[6]   MODE OF ACTION AND THE ASSESSMENT OF CHEMICAL HAZARDS IN THE PRESENCE OF LIMITED DATA - USE OF STRUCTURE-ACTIVITY-RELATIONSHIPS (SAR) UNDER TSCA, SECTION-5 [J].
AUER, CM ;
NABHOLZ, JV ;
BAETCKE, KP .
ENVIRONMENTAL HEALTH PERSPECTIVES, 1990, 87 :183-197
[7]  
Basak SC, 1998, ENVIRON TOXICOL CHEM, V17, P1056
[8]  
Bhhatarai B, 2011, ENVIRON SCI TECHNOL, V45, P8120, DOI [10.1021/es101181g, 10.1021/es200106q]
[9]  
Bradbury S P, 1994, SAR QSAR Environ Res, V2, P89, DOI 10.1080/10629369408028842
[10]   Performance of Kier-hall E-state descriptors in quantitative structure activity relationship (QSAR) studies of multifunctional molecules [J].
Butina, D .
MOLECULES, 2004, 9 (12) :1004-1009