A Differential-Evolution-Based Approach to Extract Univariate Decision Trees From Black-Box Models Using Tabular Data

被引:0
作者
Rivera-Lopez, Rafael [1 ]
Ceballos, Hector G. [2 ]
机构
[1] Tecnol Nacl Mexico, Inst Tecnol Veracruz, Dept Sistemas & Comp, Veracruz 91897, Mexico
[2] Tecnol Monterrey, Inst Future Educ, Monterrey 64849, Mexico
关键词
Computational modeling; Closed box; Focusing; Artificial neural networks; Data models; Robustness; Decision trees; Data mining; Random forests; Overfitting; Agnostic model; explainable artificial intelligence; evolutionary computation; decision trees; CLASSIFICATION; OPTIMIZATION; ALGORITHMS;
D O I
10.1109/ACCESS.2024.3498907
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing demand for complex machine learning models has increased the use of black-box models, such as random forests and artificial neural networks, posing significant challenges regarding explainability and interpretability. This manuscript addresses the critical problem of understanding and interpreting decisions from these opaque models, as a lack of interpretability can hinder their adoption in sensitive applications. To tackle this issue, we propose an evolutionary approach to induce univariate decision trees that accurately mimic the behavior of black-box models using tabular data. Our method employs two differential evolution algorithm variants, focusing on building univariate decision trees to enhance model explainability. Key contributions of this work include the development of a fitness function that balances accuracy with tree compactness to reduce overfitting and improve explanability. Additionally, we introduce a novel selection scheme that evaluates candidate solutions using synthetic instances, further enhancing the robustness against variance of the decision trees. Experimental results demonstrate that the proposed approach yields more precise and compact decision trees than traditional methods, significantly improving the explainability of complex machine learning models.
引用
收藏
页码:169850 / 169868
页数:19
相关论文
共 92 条
[1]  
Andrzejak A, 2013, 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), P1, DOI 10.1109/CIDM.2013.6597210
[2]  
[Anonymous], DIFFERENTIAL EVOLUTI
[3]   A Novel Hybrid Harris Hawks Optimization for Color Image Multilevel Thresholding Segmentation [J].
Bao, Xiaoli ;
Jia, Heming ;
Lang, Chunbo .
IEEE ACCESS, 2019, 7 (76529-76546) :76529-76546
[4]  
Bastani O, 2019, Arxiv, DOI [arXiv:1705.08504, 10.48550/arXiv.1705.08504]
[5]  
Bilal Millie Pant, 2020, ENG APPL ARTIF INTEL, V90, DOI [DOI 10.1016/j.engappai.2020.103479, 10.1016/j.engappai.2020.103479]
[6]   Decision trees: from efficient prediction to responsible AI [J].
Blockeel, Hendrik ;
Devos, Laurens ;
Frenay, Benoit ;
Nanfack, Geraldin ;
Nijssen, Siegfried .
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
[7]   Benchmarking and survey of explanation methods for black boxmodels [J].
Bodria, Francesco ;
Giannotti, Fosca ;
Guidotti, Riccardo ;
Naretto, Francesca ;
Pedreschi, Dino ;
Rinzivillo, Salvatore .
DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (05) :1719-1778
[8]   Classification Tree Extraction from Trained Artificial Neural Networks [J].
Bondarenko, Andrey ;
Aleksejeva, Ludmila ;
Jumutc, Vilen ;
Borisov, Arkady .
ICTE 2016, 2017, 104 :556-563
[9]  
Boz O., 2002, P 8 ACM SIGKDD INT C, DOI 10.1145/775047.775113
[10]  
Breiman L., 1994, Tech. Rep., V1, P4