Three discretization methods for rule induction

被引:1
作者
Grzymala-Busse, JW [1 ]
Stefanowski, J
机构
[1] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[2] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
10.1002/1098-111X(200101)16:1<29::AID-INT4>3.0.CO;2-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss problems associated with induction of decision rules from data with numerical attributes. Real-life data frequently contain numerical attributes. Rule induction from numerical data requires an additional step called discretization. In this step numerical values are converted into intervals. Most existing discretization methods are used before rule induction, as a part of data preprocessing. Some methods discretize numerical attributes while learning decision rules. We compare the classification accuracy of a discretization method based on conditional entropy, applied before rule induction, with two newly proposed methods, incorporated directly into the rule induction algorithm LEM2, where discretization and rule induction are performed at the same time. In all three approaches the same system is used for classification of new, unseen data. As a result, we conclude that an error rate for all three methods does not show significant difference, however, rules induced by the two new methods are simpler and stronger. (C) 2001 John Wiley & Sons, Inc.
引用
收藏
页码:29 / 38
页数:10
相关论文
共 19 条
  • [11] GRZYMALABUSSE JW, 1996, P 4 INT WORKSH ROUGH, P67
  • [12] Michalski R.S., 1998, MACHINE LEARNING DAT
  • [13] Nguyen HS, 1995, P 2 JOINT ANN C INF, P34
  • [14] ROUGH SETS
    PAWLAK, Z
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05): : 341 - 356
  • [15] Pawlak Z, 1991, Rough sets: Theoretical aspects of reasoning about data, V9, DOI DOI 10.1007/978-94-011-3534-4
  • [16] Quinlan J. R., 1993, C4 5 PROGRAMS MACHIN
  • [17] Stefanowski J., 1988, P 6 EUR C INT TECHN, V1, P109
  • [18] SUSMAGA R, 1997, INTELLIGENT DATA ANA, V1
  • [19] Thagard P., 1986, PROCESSES INFERENCE