Symbolic Regression Enhanced Decision Trees for Classification Tasks

被引:0
|
作者
Sen Fong, Kei [1 ]
Motani, Mehul [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] Natl Univ Singapore, Inst Hlth N 1, Inst Digital Med WisDM, Inst Data Sci, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a conceptually simple yet effective method to create small, compact decision trees - by using splits found via Symbolic Regression (SR). Traditional decision tree (DT) algorithms partition a dataset on axis-parallel splits. When the true boundaries are not along the feature axes, DT is likely to have a complicated structure and a dense decision boundary. In this paper, we introduce SR-Enhanced DT (SREDT) - a method which utilizes SR to increase the richness of the class of possible DT splits. We evaluate SREDT on both synthetic and real-world datasets. Despite its simplicity, our method produces surprisingly small trees that outperform both DT and oblique DT (ODT) on supervised classification tasks in terms of accuracy and F-score. We show empirically that SREDTs decrease inference time (compared to DT and ODT) and argue that they allow us to obtain more explainable descriptions of the decision process. SREDT also performs competitively against state-of-the-art tabular classification methods, including tree ensembles and deep models. Finally, we introduce a local search mechanism to im-prove SREDT and evaluate it on 56 PMLB datasets. This mechanism shows improved performance on 77.2% of the datasets, outperforming DT and ODT. In terms of F-Score, local SREDT outperforms DT and ODT in 82.5% and 73.7% of the datasets respectively and in terms of inference time, local SREDT requires 25.8% and 26.6% less inference time than DT and ODT respectively.
引用
收藏
页码:12033 / 12042
页数:10
相关论文
共 50 条
  • [41] A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees
    De Caigny, Arno
    Coussement, Kristof
    De Bock, Koen W.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 269 (02) : 760 - 772
  • [42] A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models
    Irimia-Dieguez, A. I.
    Blanco-Oliver, A.
    Vazquez-Cueto, M. J.
    2ND GLOBAL CONFERENCE ON BUSINESS, ECONOMICS, MANAGEMENT AND TOURISM, 2015, 23 : 9 - 14
  • [43] A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models
    Irimia-Dieguez, A. I.
    Blanco-Oliver, A.
    Vazquez-Cueto, M. J.
    4TH WORLD CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (WCBEM-2015), 2015, 26 : 23 - 28
  • [44] Packet Classification with Multiple Decision Trees
    Wang, Pi-Chung
    2015 21ST ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC), 2015, : 626 - 631
  • [45] Classification based on full decision trees
    Genrikhov, I. E.
    Djukova, E. V.
    COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2012, 52 (04) : 653 - 663
  • [46] Classification based on full decision trees
    I. E. Genrikhov
    E. V. Djukova
    Computational Mathematics and Mathematical Physics, 2012, 52 : 653 - 663
  • [47] Classification of mammograms using decision trees
    Vibha, L.
    Harshavardhan, G. M.
    Pranaw, K.
    Deepa, Shenoy P.
    Venugopal, K. R.
    Patnaik, L. M.
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 263 - 266
  • [48] IMPRECISE CLASSIFICATION WITH CREDAL DECISION TREES
    Abellan, Joaquin
    Masegosa, Andres R.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 (05) : 763 - 787
  • [49] Nonlinear regression via incremental decision trees
    Vanli, N. Denizcan
    Sayin, Muhammed O.
    Mohaghegh, Mohammadreza N.
    Ozkan, Huseyin
    Kozat, Suleyman S.
    PATTERN RECOGNITION, 2019, 86 (1-13) : 1 - 13
  • [50] Interpretable Quantile Regression by Optimal Decision Trees
    Lemaire, Valentin
    Aglin, Gael
    Nijssen, Siegfried
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT II, IDA 2024, 2024, 14642 : 210 - 222