Symbolic Regression Enhanced Decision Trees for Classification Tasks

被引:0
|
作者
Sen Fong, Kei [1 ]
Motani, Mehul [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] Natl Univ Singapore, Inst Hlth N 1, Inst Digital Med WisDM, Inst Data Sci, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a conceptually simple yet effective method to create small, compact decision trees - by using splits found via Symbolic Regression (SR). Traditional decision tree (DT) algorithms partition a dataset on axis-parallel splits. When the true boundaries are not along the feature axes, DT is likely to have a complicated structure and a dense decision boundary. In this paper, we introduce SR-Enhanced DT (SREDT) - a method which utilizes SR to increase the richness of the class of possible DT splits. We evaluate SREDT on both synthetic and real-world datasets. Despite its simplicity, our method produces surprisingly small trees that outperform both DT and oblique DT (ODT) on supervised classification tasks in terms of accuracy and F-score. We show empirically that SREDTs decrease inference time (compared to DT and ODT) and argue that they allow us to obtain more explainable descriptions of the decision process. SREDT also performs competitively against state-of-the-art tabular classification methods, including tree ensembles and deep models. Finally, we introduce a local search mechanism to im-prove SREDT and evaluate it on 56 PMLB datasets. This mechanism shows improved performance on 77.2% of the datasets, outperforming DT and ODT. In terms of F-Score, local SREDT outperforms DT and ODT in 82.5% and 73.7% of the datasets respectively and in terms of inference time, local SREDT requires 25.8% and 26.6% less inference time than DT and ODT respectively.
引用
收藏
页码:12033 / 12042
页数:10
相关论文
共 50 条
  • [1] Globally optimal fuzzy decision trees for classification and regression
    Suárez, A
    Lutsko, JF
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (12) : 1297 - 1311
  • [2] Symbolic Regression Trees as Embedded Representations
    Caetano, Victor
    Teixeira, Matheus Candido
    Pappa, Gisele Lobo
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 411 - 419
  • [3] Neural-symbolic temporal decision trees for multivariate time series classification
    Pagliarini, Giovanni
    Scaboro, Simone
    Serra, Giuseppe
    Sciavicco, Guido
    Stan, Ionel Eduard
    INFORMATION AND COMPUTATION, 2024, 301
  • [4] Classification and regression trees
    Martin Krzywinski
    Naomi Altman
    Nature Methods, 2017, 14 : 757 - 758
  • [5] Classification and regression trees
    Loh, Wei-Yin
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 14 - 23
  • [6] Classification and regression trees
    Speybroeck, N.
    INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2012, 57 (01) : 243 - 246
  • [7] Using classification and regression trees (CART) to support worker decision making
    Johnson, MA
    Brown, CH
    Wells, SJ
    SOCIAL WORK RESEARCH, 2002, 26 (01) : 19 - 29
  • [8] Evaluating Nonlinear Decision Trees for Binary Classification Tasks with Other Existing Methods
    Dhebar, Yashesh
    Gupta, Sparsh
    Deb, Kalyanmoy
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2806 - 2813
  • [9] On full regression decision trees
    Genrikhov I.E.
    Djukova E.V.
    Zhuravlev V.I.
    Pattern Recognition and Image Analysis, 2017, 27 (1) : 1 - 7
  • [10] Decision Trees in the Tasks of Human Prediction
    Menshih, P. G.
    Erokhin, S. D.
    Gorodnichev, M. G.
    2021 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2021,