Three discretization methods for rule induction

被引:1
作者
Grzymala-Busse, JW [1 ]
Stefanowski, J
机构
[1] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[2] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
10.1002/1098-111X(200101)16:1<29::AID-INT4>3.0.CO;2-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss problems associated with induction of decision rules from data with numerical attributes. Real-life data frequently contain numerical attributes. Rule induction from numerical data requires an additional step called discretization. In this step numerical values are converted into intervals. Most existing discretization methods are used before rule induction, as a part of data preprocessing. Some methods discretize numerical attributes while learning decision rules. We compare the classification accuracy of a discretization method based on conditional entropy, applied before rule induction, with two newly proposed methods, incorporated directly into the rule induction algorithm LEM2, where discretization and rule induction are performed at the same time. In all three approaches the same system is used for classification of new, unseen data. As a result, we conclude that an error rate for all three methods does not show significant difference, however, rules induced by the two new methods are simpler and stronger. (C) 2001 John Wiley & Sons, Inc.
引用
收藏
页码:29 / 38
页数:10
相关论文
共 19 条
  • [1] [Anonymous], ROUGH SETS KNOWLEDGE
  • [2] [Anonymous], MACHINE LEARNING PAR
  • [3] [Anonymous], 1993, P 13 INT JOINT C ART
  • [4] Global discretization of continuous attributes as preprocessing for machine learning
    Chmielewski, MR
    GrzymalaBusse, JW
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 1996, 15 (04) : 319 - 331
  • [5] Clark P., 1991, P 5 EUR WORK SESS LE, P151, DOI DOI 10.1007/BFB0017011
  • [6] Dougherty J., 1995, MACHINE LEARNING P 1, P194, DOI DOI 10.1016/B978-1-55860-377-6.50032-3
  • [7] FAYYAD UM, 1992, MACH LEARN, V8, P87, DOI 10.1023/A:1022638503176
  • [8] Grzymala-Busse J, 1994, P 3 INT S INT SYST, P70
  • [9] Grzymala-Busse J.W., 1992, Intelligent Decision Support, P3, DOI DOI 10.1007/978-94-015-7975-9_
  • [10] GRZYMALABUSSE JW, 1997, P 6 S INT INT SYST Z, P159