Recent advances in decision trees: an updated survey

被引:223
作者
Costa, Vinicius G. [1 ]
Pedreira, Carlos E. [1 ]
机构
[1] Fed Univ Rio de Janeiro UFRJ, Syst Engn & Comp Sci Dept, Rio De Janeiro, Brazil
关键词
Decision trees; Machine learning; Interpretable models; Classification algorithms; STRUCTURED CLASSIFICATION; DISCRIMINANT-ANALYSIS; INDUCTION; MACHINE; ALGORITHMS; ENTROPY;
D O I
10.1007/s10462-022-10275-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision Trees (DTs) are predictive models in supervised learning, known not only for their unquestionable utility in a wide range of applications but also for their interpretability and robustness. Research on the subject is still going strong after almost 60 years since its original inception, and in the last decade, several researchers have tackled key matters in the field. Although many great surveys have been published in the past, there is a gap since none covers the last decade of the field as a whole. This paper proposes a review of the main recent advances in DT research, focusing on three major goals of a predictive learner: issues regarding the fitting of training data, generalization, and interpretability. Moreover, by organizing several topics that have been previously analyzed in isolation, this survey attempts to provide an overview of the field, its key concerns, and future trends, serving as a good entry point for both researchers and newcomers to the machine learning community.
引用
收藏
页码:4765 / 4800
页数:36
相关论文
共 152 条
[41]   Alternating Model Trees [J].
Frank, Eibe ;
Mayo, Michael ;
Kramer, Stefan .
30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, :871-878
[42]  
Freitas Alex A, 2014, ACM SIGKDD Explorations Newslett., V15, DOI DOI 10.1145/2594473.2594475
[43]  
Freund Y, 1999, MACHINE LEARNING, PROCEEDINGS, P124
[44]  
Frosst N., 2017, ARXIV
[45]   A Novel Hyperparameter-Free Approach to Decision Tree Construction That Avoids Overfitting by Design [J].
Garcia Leiv, Rafael ;
Fernandez An, Antonio ;
Mancus, Vincenzo ;
Casari, Paolo .
IEEE ACCESS, 2019, 7 :99978-99987
[46]   Clustering nominal data using unsupervised binary decision trees: Comparisons with the state of the art methods [J].
Ghattas, Badih ;
Michel, Pierre ;
Boyer, Laurent .
PATTERN RECOGNITION, 2017, 67 :177-185
[47]   TOWARDS AUTOMATED MEDICAL DECISIONS [J].
GLESER, MA ;
COLLEN, MF .
COMPUTERS AND BIOMEDICAL RESEARCH, 1972, 5 (02) :180-&
[48]   Optimal decision trees for categorical data via integer programming [J].
Gunluk, Oktay ;
Kalagnanam, Jayant ;
Li, Minhan ;
Menickelly, Matt ;
Scheinberg, Katya .
JOURNAL OF GLOBAL OPTIMIZATION, 2021, 81 (01) :233-260
[49]  
Hastie T., 2009, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, DOI DOI 10.1007/978-0-387-84858-7
[50]  
HEATH D, 1993, IJCAI-93, VOLS 1 AND 2, P1002