Ameva: An autonomous discretization algorithm

被引:68
作者
Gonzalez-Abril, L. [1 ]
Cuberos, F. J. [2 ]
Velasco-Morente, Francisco [1 ]
Ortega, J. A. [3 ]
机构
[1] Univ Seville, Appl Econ Dept 1, Seville 41018, Spain
[2] RTVA, Planning Dept, Seville, Spain
[3] Univ Seville, Comp Languages & Syst Dept, Seville, Spain
关键词
Knowledge discovery; Supervised discretization; Machine learning; Genetic algorithm; CHI2; ALGORITHM; ATTRIBUTES;
D O I
10.1016/j.eswa.2008.06.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a new discretization algorithm, called Ameva, which is designed to work with supervised learning algorithms. Ameva maximizes a contingency coefficient based on Chi-square statistics and generates a potentially minimal number of discrete intervals. Its most important advantage, in contrast with several existing discretization algorithms, is that it does not need the user to indicate the number of intervals. We have compared Ameva with one of the most relevant discretization algorithms, CAIM. Tests performed comparing these two algorithms show that discrete attributes generated by the Ameva algorithm always have the lowest number of intervals, and even if the number of classes is high, the same computational complexity is maintained. A comparison between the Ameva and the genetic algorithm approaches has been also realized and there are very small differences between these iterative and combinatorial approaches, except when considering the execution time. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5327 / 5332
页数:6
相关论文
共 26 条
  • [1] Support vector machines for interval discriminant analysis
    Angulo, C.
    Anguita, D.
    Gonzalez-Abril, L.
    Ortega, J. A.
    [J]. NEUROCOMPUTING, 2008, 71 (7-9) : 1220 - 1229
  • [2] [Anonymous], 1993, C4.5: Programs for machine learning
  • [3] [Anonymous], MACHINE LEARNING
  • [4] BERTIER P, 1981, ANAL DONNEES MULTIDI
  • [5] Khiops: A statistical discretization method of continuous attributes
    Boulle, M
    [J]. MACHINE LEARNING, 2004, 55 (01) : 53 - 69
  • [6] CATLETT J, 1991, P EUR WORK SESS LEAR
  • [7] CLASS-DEPENDENT DISCRETIZATION FOR INDUCTIVE LEARNING FROM CONTINUOUS AND MIXED-MODE DATA
    CHING, JY
    WONG, AKC
    CHAN, KCC
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (07) : 641 - 651
  • [8] Clark P., 1989, Machine Learning, V3, P261, DOI 10.1007/BF00116835
  • [9] Cohen WilliamW., 1995, Fast Effective Rule Induction
  • [10] FAYYAD UM, 1992, MACH LEARN, V8, P87, DOI 10.1023/A:1022638503176