A Differential-Evolution-Based Approach to Extract Univariate Decision Trees From Black-Box Models Using Tabular Data

被引:0
作者
Rivera-Lopez, Rafael [1 ]
Ceballos, Hector G. [2 ]
机构
[1] Tecnol Nacl Mexico, Inst Tecnol Veracruz, Dept Sistemas & Comp, Veracruz 91897, Mexico
[2] Tecnol Monterrey, Inst Future Educ, Monterrey 64849, Mexico
关键词
Computational modeling; Closed box; Focusing; Artificial neural networks; Data models; Robustness; Decision trees; Data mining; Random forests; Overfitting; Agnostic model; explainable artificial intelligence; evolutionary computation; decision trees; CLASSIFICATION; OPTIMIZATION; ALGORITHMS;
D O I
10.1109/ACCESS.2024.3498907
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing demand for complex machine learning models has increased the use of black-box models, such as random forests and artificial neural networks, posing significant challenges regarding explainability and interpretability. This manuscript addresses the critical problem of understanding and interpreting decisions from these opaque models, as a lack of interpretability can hinder their adoption in sensitive applications. To tackle this issue, we propose an evolutionary approach to induce univariate decision trees that accurately mimic the behavior of black-box models using tabular data. Our method employs two differential evolution algorithm variants, focusing on building univariate decision trees to enhance model explainability. Key contributions of this work include the development of a fitness function that balances accuracy with tree compactness to reduce overfitting and improve explanability. Additionally, we introduce a novel selection scheme that evaluates candidate solutions using synthetic instances, further enhancing the robustness against variance of the decision trees. Experimental results demonstrate that the proposed approach yields more precise and compact decision trees than traditional methods, significantly improving the explainability of complex machine learning models.
引用
收藏
页码:169850 / 169868
页数:19
相关论文
共 92 条
[81]   Learning optimal decision trees using constraint programming [J].
Verhaeghe, Helene ;
Nijssen, Siegfried ;
Pesant, Gilles ;
Quimper, Claude-Guy ;
Schaus, Pierre .
CONSTRAINTS, 2020, 25 (3-4) :226-250
[82]  
Vidal T, 2020, PMLR, V119, P9743
[83]   An analysis of explainability methods for convolutional neural networks [J].
Vonder Haar, Lynn ;
Elvira, Timothy ;
Ochoa, Omar .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
[84]  
Wan A., 2021, P 9 INT C LEARN REPR, P1
[85]   Top 10 algorithms in data mining [J].
Wu, Xindong ;
Kumar, Vipin ;
Quinlan, J. Ross ;
Ghosh, Joydeep ;
Yang, Qiang ;
Motoda, Hiroshi ;
McLachlan, Geoffrey J. ;
Ng, Angus ;
Liu, Bing ;
Yu, Philip S. ;
Zhou, Zhi-Hua ;
Steinbach, Michael ;
Hand, David J. ;
Steinberg, Dan .
KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 14 (01) :1-37
[86]   Global Model Interpretation via Recursive Partitioning [J].
Yang, Chengliang ;
Rangarajan, Anand ;
Ranka, Sanjay .
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, :1563-1570
[87]  
Yarushev Sergey, 2023, Recent Trends in Intelligence Enabled Research: Selected Papers of Fourth Doctoral Symposium, DoSIER 2022. Advances in Intelligent Systems and Computing (1446), P1, DOI 10.1007/978-981-99-1472-2_1
[88]  
Zhang JQ, 2009, ADAPT LEARN OPTIM, V1, P1
[89]   JADE: Self-adaptive Differential Evolution with fast and reliable convergence performance [J].
Zhang, Jingqiao ;
Sanderson, Arthur C. .
2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, :2251-2258
[90]   Methods to balance the exploration and exploitation in Differential Evolution from different scales: A survey [J].
Zhang, Yanyun ;
Chen, Guanyu ;
Cheng, Li ;
Wang, Quanyu ;
Li, Qi .
NEUROCOMPUTING, 2023, 561