Qualitative consensus of QSAR ready biodegradability predictions

被引:26
作者
Ballabio, Davide [1 ]
Biganzoli, Fabrizio [1 ]
Todeschini, Roberto [1 ]
Consonni, Viviana [1 ]
机构
[1] Univ Milano Bicocca, Dept Earth & Environm Sci, Milano Chemometr & QSAR Res Grp, Milan, Italy
关键词
Ready biodegradability; QSAR; consensus; Dempster-Shafer; Bayes; ULTRA EXPERT-SYSTEM; IN-SILICO MODELS; CHEMICAL-STRUCTURES; UNCERTAINTY; PERFORMANCE; REGRESSION;
D O I
10.1080/02772248.2016.1260133
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
REACH requires that all chemicals produced or imported for more than 1 ton/year must be tested for ready biodegradability, a screening test for the assessment of chemical persistence. Since REACH encourages the use of in-silico approaches, several Quantitative Structure-Activity Relationship models have been calibrated to predict this property. Predictions obtained by different models can then be merged by means of specific consensus approaches, because it has been demonstrated that data integration can increase reliability and reduce the effects of contradictory data by averaging results.This study deals with integration of ready biodegradability predictions from eight different models. A set of 416 molecules, external to the training sets of all the considered models, was retrieved from the literature. Predictions were combined with three consensus approaches: majority voting criterion, Dempster-Shafer's theory of evidence and Bayesian consensus with discrete probability distributions. The different approaches were finally evaluated on the basis of the classification performance.On the basis of the obtained results we can conclude that application of consensus methodologies resulted in a reduction of uncertainty associated with biodegradability predictions and showed apparent advantages when sources of information are jointly analysed, without compromising the quality of predictions.
引用
收藏
页码:1193 / 1216
页数:24
相关论文
共 73 条
[1]  
[Anonymous], 1983, INTERPRETATION ANAL
[2]  
[Anonymous], SOFTWARE MOL DESCRIP
[3]  
[Anonymous], 2008, Green paper on the management of bio-waste in the European Union, P1
[4]   Evaluation of artificial intelligence based models for chemical biodegradability prediction [J].
Baker, JR ;
Gamberger, D ;
Mihelcic, JR ;
Sabljic, A .
MOLECULES, 2004, 9 (12) :989-1003
[5]   A MATLAB toolbox for Principal Component Analysis and unsupervised exploration of data structure [J].
Ballabio, Davide .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 149 :1-9
[6]   2D QSAR consensus prediction for high-throughput virtual screening. An application to COX-2 inhibition modeling and screening of the NCI database [J].
Baurin, N ;
Mozziconacci, JC ;
Arnoult, E ;
Chavatte, P ;
Marot, C ;
Morin-Allory, L .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :276-285
[7]   KNIME:: The Konstanz Information Miner [J].
Berthold, Michael R. ;
Cebron, Nicolas ;
Dill, Fabian ;
Gabriel, Thomas R. ;
Koetter, Tobias ;
Meinl, Thorsten ;
Ohl, Peter ;
Sieb, Christoph ;
Thiel, Kilian ;
Wiswedel, Bernd .
DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, :319-326
[8]   A Bayesian Approach to Analyzing Ecotoxicological Data [J].
Billoir, Elise ;
Delignette-Muller, Marie Laure ;
Pery, Alexandre R. R. ;
Charles, Sandrine .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2008, 42 (23) :8978-8984
[9]   Predicting ready biodegradability of premanufacture notice chemicals [J].
Boethling, R ;
Lynch, DG ;
Thom, GC .
ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2003, 22 (04) :837-844
[10]   SCREENING-LEVEL MODEL FOR AEROBIC BIODEGRADABILITY BASED ON A SURVEY OF EXPERT KNOWLEDGE [J].
BOETHLING, RS ;
SABLJIC, A .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1989, 23 (06) :672-679