Linear QSAR regression models for the prediction of bioconcentration factors by physicochemical properties and structural theoretical molecular descriptors

被引:78
作者
Papa, E.
Dearden, J. C.
Gramatica, P.
机构
[1] Univ Insubria, Dept Struct & Funct Biol, QSAR Res Unit Environm Chem & Ecotoxicol, I-21100 Varese, Italy
[2] Liverpool John Moores Univ, Sch Pharm & Chem, QSAR & Modelling Res Grp, Liverpool L3 3AF, Merseyside, England
关键词
BCF; QSAR; applicability domain; external statistical validation; hydrophobic compounds;
D O I
10.1016/j.chemosphere.2006.09.079
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The development of QSAR models useful for the prediction of fish bioconcentration factor (BCF) for a wide range of different chemical classes is crucial for the assessment and prioritisation of potentially persistent bioaccumulative and toxic substances. In this study we present QSAR models for BCF developed on a wide range of chemical structural classes of environmental and toxicological interest (such as dyes and various chlorinated and brominated compounds). The aim is to provide valid QSAR models, statistically validated for predictivity, for the prediction of BCF in general, but also for problematical chemical classes such as highly hydrophobic chemicals. Several descriptors, calculated by different commercially available software packages, have been employed in order to take into account relevant information provided by physicochemical properties (octanol/water partition coefficient and water solubility) and molecular features (structural and quantum-chemical molecular descriptors). The best descriptor subsets for the models were selected using the Genetic Algorithm-Variable Subset Selection strategy (GA-VSS) and calculations were performed by ordinary least squares regression. Starting from a data set of 640 compounds (logK(ow) range from -2.34 to 12.66), we developed linear QSARs, firstly for a data set of 620 compounds (log K-ow range from -2.34 to 10.35) and secondly specifically for 87 highly hydrophobic chemicals (Log K-ow range from 6.00 to 10.35). All these models have been statistically validated (both internally by cross-validation and bootstrap and externally, by "a priori" splitting of available data by Kohonen Map-ANN in training and prediction sets) and their structural chemical domain has been verified by the leverage approach. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:351 / 358
页数:8
相关论文
共 34 条
[1]  
Atkinson AC., 1985, Plots, transformations and regression
[2]  
an introduction to graphical methods of diagnostic regression analysis
[3]  
Bintein S, 1993, SAR QSAR Environ Res, V1, P29, DOI 10.1080/10629369308028814
[4]   USE OF POLYNOMIAL EXPRESSIONS TO DESCRIBE THE BIOCONCENTRATION OF HYDROPHOBIC CHEMICALS BY FISH [J].
CONNELL, DW ;
HAWKER, DW .
ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 1988, 16 (03) :242-257
[5]   Structure/response correlations and similarity/diversity analysis by GETAWAY descriptors. 1. Theory of the novel 3D molecular descriptors [J].
Consonni, V ;
Todeschini, R ;
Pavan, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (03) :682-692
[6]   THE PREDICTION OF BIOCONCENTRATION IN FISH [J].
DAVIES, RP ;
DOBBS, AJ .
WATER RESEARCH, 1984, 18 (10) :1253-1262
[7]   Improved prediction of fish bioconcentration factor of hydrophobic chemicals [J].
Dearden, JC ;
Shinnawei, NM .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2004, 15 (5-6) :449-455
[8]   Non-linear modeling of bioconcentration using partition coefficients for narcotic chemicals [J].
Dimitrov, SD ;
Mekenyan, OG ;
Walker, JD .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2002, 13 (01) :177-184
[9]  
*EUR CTR EC TOX CH, 1996, 67 ECETOC
[10]   Validated QSAR prediction of OH tropospheric degradation of VOCs: Splitting into training-test sets and consensus modeling [J].
Gramatica, P ;
Pilutti, P ;
Papa, E .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (05) :1794-1802