A Differential-Evolution-Based Approach to Extract Univariate Decision Trees From Black-Box Models Using Tabular Data

被引:0
作者
Rivera-Lopez, Rafael [1 ]
Ceballos, Hector G. [2 ]
机构
[1] Tecnol Nacl Mexico, Inst Tecnol Veracruz, Dept Sistemas & Comp, Veracruz 91897, Mexico
[2] Tecnol Monterrey, Inst Future Educ, Monterrey 64849, Mexico
关键词
Computational modeling; Closed box; Focusing; Artificial neural networks; Data models; Robustness; Decision trees; Data mining; Random forests; Overfitting; Agnostic model; explainable artificial intelligence; evolutionary computation; decision trees; CLASSIFICATION; OPTIMIZATION; ALGORITHMS;
D O I
10.1109/ACCESS.2024.3498907
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing demand for complex machine learning models has increased the use of black-box models, such as random forests and artificial neural networks, posing significant challenges regarding explainability and interpretability. This manuscript addresses the critical problem of understanding and interpreting decisions from these opaque models, as a lack of interpretability can hinder their adoption in sensitive applications. To tackle this issue, we propose an evolutionary approach to induce univariate decision trees that accurately mimic the behavior of black-box models using tabular data. Our method employs two differential evolution algorithm variants, focusing on building univariate decision trees to enhance model explainability. Key contributions of this work include the development of a fitness function that balances accuracy with tree compactness to reduce overfitting and improve explanability. Additionally, we introduce a novel selection scheme that evaluates candidate solutions using synthetic instances, further enhancing the robustness against variance of the decision trees. Experimental results demonstrate that the proposed approach yields more precise and compact decision trees than traditional methods, significantly improving the explainability of complex machine learning models.
引用
收藏
页码:169850 / 169868
页数:19
相关论文
共 92 条
[11]  
Breiman L., 2017, CLASSIFICATION REGRE, DOI [DOI 10.1201/9781315139470, 10.1201/9781315139470]
[12]  
Brest J, 2017, IEEE C EVOL COMPUTAT, P1311, DOI 10.1109/CEC.2017.7969456
[13]   Bearing Fault Detection and Recognition From Supply Currents With Decision Trees [J].
Briglia, Giovanni ;
Immovilli, Fabio ;
Cocconcelli, Marco ;
Lippi, Marco .
IEEE ACCESS, 2024, 12 :12760-12770
[14]  
Calvo B, 2016, R J, V8, P248
[15]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[16]  
Craven MW, 1996, ADV NEUR IN, V8, P24
[17]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[18]   A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms [J].
Derrac, Joaquin ;
Garcia, Salvador ;
Molina, Daniel ;
Herrera, Francisco .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) :3-18
[19]   Evolutionary Algorithms for Constructing an Ensemble of Decision Trees [J].
Dolotov, Evgeny ;
Zolotykh, Nikolai .
ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS (AIST 2019), 2020, 1086 :9-15
[20]   Differential Evolution: A Survey and Analysis [J].
Eltaeib, Tarik ;
Mahmood, Ausif .
APPLIED SCIENCES-BASEL, 2018, 8 (10)