A Dynamic Discretization Approach for Constructing Decision Trees with a Continuous Label

被引:20
|
作者
Hu, Hsiao-Wei [1 ]
Chen, Yen-Liang [1 ]
Tang, Kwei [2 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
[2] Purdue Univ, Krannert Grad Sch Management, W Lafayette, IN 47907 USA
关键词
Decision trees; data mining; classification; SELECTION; REGRESSION; ALGORITHM;
D O I
10.1109/TKDE.2009.24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In traditional decision (classification) tree algorithms, the label is assumed to be a categorical (class) variable. When the label is a continuous variable in the data, two possible approaches based on existing decision tree algorithms can be used to handle the situations. The first uses a data discretization method in the preprocessing stage to convert the continuous label into a class label defined by a finite set of nonoverlapping intervals and then applies a decision tree algorithm. The second simply applies a regression tree algorithm, using the continuous label directly. These approaches have their own drawbacks. We propose an algorithm that dynamically discretizes the continuous label at each node during the tree induction process. Extensive experiments show that the proposed method outperforms the preprocessing approach, the regression tree approach, and several nontree-based algorithms.
引用
收藏
页码:1505 / 1514
页数:10
相关论文
共 50 条
  • [1] A Wrapper Evolutionary Approach for Supervised Multivariate Discretization: A Case Study on Decision Trees
    Ramirez-Gallego, Sergio
    Garcia, Salvador
    Manuel Benitez, Jose
    Herrera, Francisco
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 47 - 58
  • [2] DISCOLEAF: Personalized DIScretization of COntinuous Attributes for LEArning with Federated Decision Trees
    Kwatra, Saloni
    Torra, Vicenc
    PRIVACY IN STATISTICAL DATABASES, PSD 2024, 2024, 14915 : 344 - 357
  • [3] Constructing Decision Trees for Unstructured Data
    Gong, Shucheng
    Liu, Hongyan
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 475 - 487
  • [4] A weighted distance-based approach with boosted decision trees for label ranking
    Albano, Alessandro
    Sciandra, Mariangela
    Plaia, Antonella
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [5] A novel approach to cutting decision trees
    Uney-Yuksektepe, Fadime
    CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2014, 22 (03) : 553 - 565
  • [6] Decision Trees Constructing over Multiple Data Streams
    Martyna, Jerzy
    MAN-MACHINE INTERACTIONS, 2009, 59 : 191 - 199
  • [7] A dynamic programming based pruning method for decision trees
    Li, XB
    Sweigart, J
    Teng, J
    Donohue, J
    Thombs, L
    INFORMS JOURNAL ON COMPUTING, 2001, 13 (04) : 332 - 344
  • [8] Optimization and analysis of decision trees and rules: dynamic programming approach
    Alkhalid, Abdulaziz
    Amin, Talha
    Chikalov, Igor
    Hussain, Shahid
    Moshkov, Mikhail
    Zielosko, Beata
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2013, 42 (06) : 614 - 634
  • [9] Evolutionary Algorithms for Constructing an Ensemble of Decision Trees
    Dolotov, Evgeny
    Zolotykh, Nikolai
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS (AIST 2019), 2020, 1086 : 9 - 15
  • [10] A novel approach to cutting decision trees
    Fadime Üney-Yüksektepe
    Central European Journal of Operations Research, 2014, 22 : 553 - 565