Towards Global QSAR Model Building for Acute Toxicity: Munro Database Case Study

被引:30
作者
Chavan, Swapnil [1 ,2 ]
Nicholls, Ian A. [1 ,2 ,3 ]
Karlsson, Bjorn C. G. [1 ,2 ]
Rosengren, Annika M. [1 ,2 ]
Ballabio, Davide [4 ]
Consonni, Viviana [4 ]
Todeschini, Roberto [4 ]
机构
[1] Linnaeus Univ, Bioorgan & Biophys Chem Lab, Linnaeus Univ Ctr Biomaterials Chem, SE-39182 Kalmar, Sweden
[2] Linnaeus Univ, Dept Chem & Biomed Sci, SE-39182 Kalmar, Sweden
[3] Uppsala Univ, Dept Chem BMC, SE-75123 Uppsala, Sweden
[4] Univ Milano Bicocca, Dept Earth & Environm Sci, Milano Chemometr & QSAR Res Grp, IT-20126 Milan, Italy
关键词
k-nearest neighbor (k-NN); Munro database; genetic algorithm (GA); acute toxicity (LD50); GENETIC ALGORITHMS; REGRESSION;
D O I
10.3390/ijms151018162
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A series of 436 Munro database chemicals were studied with respect to their corresponding experimental LD50 values to investigate the possibility of establishing a global QSAR model for acute toxicity. Dragon molecular descriptors were used for the QSAR model development and genetic algorithms were used to select descriptors better correlated with toxicity data. Toxic values were discretized in a qualitative class on the basis of the Globally Harmonized Scheme: the 436 chemicals were divided into 3 classes based on their experimental LD50 values: highly toxic, intermediate toxic and low to non-toxic. The k-nearest neighbor (k-NN) classification method was calibrated on 25 molecular descriptors and gave a non-error rate (NER) equal to 0.66 and 0.57 for internal and external prediction sets, respectively. Even if the classification performances are not optimal, the subsequent analysis of the selected descriptors and their relationship with toxicity levels constitute a step towards the development of a global QSAR model for acute toxicity.
引用
收藏
页码:18162 / 18174
页数:13
相关论文
共 19 条
[1]   Classification tools in chemistry. Part 1: linear models. PLS-DA [J].
Ballabio, Davide ;
Consonni, Viviana .
ANALYTICAL METHODS, 2013, 5 (16) :3790-3798
[2]  
Ballabio D, 2009, INFRARED SPECTROSCOPY FOR FOOD QUALITY ANALYSIS AND CONTROL, P83, DOI 10.1016/B978-0-12-374136-3.00004-3
[3]  
Cramer G., 1976, FOOD COSMETICS TOXIC, V16, P255
[4]   Development of a QSAR for worst case estimates of acute toxicity of chemically reactive compounds [J].
Freidig, A. P. ;
Dekkers, S. ;
Verwei, M. ;
Zvinavashe, E. ;
Bessems, J. G. M. ;
van de Sandt, J. J. M. .
TOXICOLOGY LETTERS, 2007, 170 (03) :214-222
[5]  
Jolliffe I T., 2002, Principal Component Analysis, P1, DOI [DOI 10.1007/0-387-22440-8_1, DOI 10.1007/B98835]
[6]   K-NEAREST NEIGHBOR CLASSIFICATION RULE (PATTERN-RECOGNITION) APPLIED TO NUCLEAR MAGNETIC-RESONANCE SPECTRAL INTERPRETATION [J].
KOWALSKI, BR ;
BENDER, CF .
ANALYTICAL CHEMISTRY, 1972, 44 (08) :1405-&
[7]   Genetic algorithms applied to feature selection in PLS regression: how and when to use them [J].
Leardi, R ;
Gonzalez, AL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1998, 41 (02) :195-207
[8]   UNDERSTANDING AND USING GENETIC ALGORITHMS .1. CONCEPTS, PROPERTIES AND CONTEXT [J].
LUCASIUS, CB ;
KATEMAN, G .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 19 (01) :1-33
[9]  
Mason R. L., 2005, Quality Progress, V38, P83
[10]   Correlation of structural class with no-observed-effect levels: A proposal for establishing a threshold of concern [J].
Munro, IC ;
Ford, RA ;
Kennepohl, E ;
Sprenger, JG .
FOOD AND CHEMICAL TOXICOLOGY, 1996, 34 (09) :829-867