Learning classification rules from data

被引:23
作者
An, A [1 ]
机构
[1] York Univ, Dept Comp Sci, Toronto, ON M3J 1P3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
machine learning; rule induction; classification; data mining; artificial intelligence;
D O I
10.1016/S0898-1221(03)00034-8
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We present ELEM2, a machine learning system that induces classification rules from a set of data based on a heuristic search over a hypothesis space. ELEM2 is distinguished from other rule induction systems in three aspects. First, it uses a new heuristic function to guide the heuristic search. The function reflects the degree of relevance of an attribute-value pair to a target concept and leads to selection of the most relevant pairs for formulating rules. Second, ELEM2 handles inconsistent training examples by defining an unlearnable region of a concept based on the probability distribution of that concept in the training data. The unlearnable region is used as a stopping criterion for the concept learning process, which resolves conflicts without removing inconsistent examples. Third, ELEM2 employs a new rule quality measure in its post-pruning process to prevent rules from overfitting the data. The rule quality formula measures the extent to which a rule can discriminate between the positive and negative examples of a class. We describe features of ELEM2, its rule induction algorithm and its classification procedure. We report experimental results that compare ELEM2 with C4.5 and CN2 on a number of datasets. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:737 / 748
页数:12
相关论文
共 12 条
[1]  
AN A, 1999, P 3 PAC AS C KNOWL D
[2]  
AN A, 1997, THESIS U REGINA REGI
[3]  
CENDROWSKA J, 1988, KNOWLEDGE ACQUISITIO
[4]   PRO-OPIOMELANOCORTIN MESSENGER-RNA SIZE HETEROGENEITY IN ACTH-DEPENDENT CUSHINGS-SYNDROME [J].
CLARK, AJL ;
LAVENDER, PM ;
BESSER, GM ;
REES, LH .
JOURNAL OF MOLECULAR ENDOCRINOLOGY, 1989, 2 (01) :3-9
[5]  
Grzymala-Busse J.W., 1992, Intelligent Decision Support, P3, DOI DOI 10.1007/978-94-015-7975-9_
[6]  
HAMILTON HJ, 1996, CS9606 U REGINA
[7]  
Holte Robert C., 1989, IJCAI
[8]  
Michalski R. S., 1986, Proceedings AAAI-86: Fifth National Conference on Artificial Intelligence, P1041
[9]  
Murphy P.M., UCI Repository of Machine Learning Databases
[10]  
Quinlan J. R., 1983, MACHINE LEARNING ART, V1