Experiments in predicting biodegradability

被引:17
作者
Blockeel, H
Dzeroski, S
Kompare, B
Kramer, S
Pfahringer, B
Van Laer, W
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Louvain, Belgium
[2] Jozef Stefan Inst, Dept Intelligent Syst, Ljubljana, Slovenia
[3] Univ Ljubljana, Fac Civil Engn & Geodesy, Ljubljana, Slovenia
[4] Tech Univ Munich, Dept Comp Sci, D-8000 Munich, Germany
[5] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
关键词
D O I
10.1080/08839510490279131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with the use of AI techniques in ecology. More specifically, we present a novel application of inductive logic programming (ILP) in the area of quantitative structure-activity relationships (QSARs). The activity we want to predict is the biodegradability of chemical compounds in water. In particular, the target variable is the half-life for aerobic aqueous biodegradation. Structural descriptions of chemicals in terms of atoms and bonds are derived from the chemicals' SMILES encodings. The definition of substructures is used as background knowledge. Predicting biodegradability is essentially a regression problem, but we also consider a discretized version of the target variable. We thus employ a number of relational classification and regression methods on the relational representation and compare these to propositional methods applied to different propositionalizations of the problem. We also experiment with a prediction technique that consists of merging upper and lower bound predictions into one prediction. Some conclusions are drawn concerning the applicability of machine learning systems and the merging technique in this domain and the evaluation of hypotheses.
引用
收藏
页码:157 / 181
页数:25
相关论文
共 27 条
[1]  
[Anonymous], 1987, INFORMATICA
[2]   Top-down induction of first-order logical decision trees [J].
Blockeel, H ;
De Raedt, L .
ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) :285-297
[3]  
Breiman L., 1998, CLASSIFICATION REGRE
[4]   NEW TRENDS IN STRUCTURE-BIODEGRADABILITY RELATIONSHIPS [J].
CAMBON, B ;
DEVILLERS, J .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1993, 12 (01) :49-56
[5]  
CLARK P, 1991, LECT NOTES ARTIF INT, V482, P151
[6]   Discovery of frequent DATALOG patterns [J].
Dehaspe, L ;
Toivonen, H .
DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 3 (01) :7-36
[7]  
DERAEDT L, 1995, LECT NOTES ARTIF INT, V997, P80
[8]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[9]  
Dzeroski S, 1999, LECT NOTES ARTIF INT, V1634, P80
[10]  
HOWARD P, 1992, USERS GUIDE BIODEGRA