Exploring accuracy and interpretability trade-off in tabular learning with novel attention-based models

被引:0
|
作者
Kodjo Mawuena Amekoe [1 ]
Hanane Azzag [3 ]
Zaineb Chelly Dagdia [1 ]
Mustapha Lebbah [2 ]
Gregoire Jaffre [2 ]
机构
[1] Université Sorbonne Paris Nord,
[2] LIPN CNRS UMR,undefined
[3] Université Paris-Saclay,undefined
[4] DAVID Lab,undefined
[5] UVSQ,undefined
[6] Groupe BPCE,undefined
关键词
Tabular data; Interpretability; Attention; Robust explanation;
D O I
10.1007/s00521-024-10163-9
中图分类号
学科分类号
摘要
Apart from high accuracy, what interests many researchers and practitioners in real-life tabular learning problems (e.g., fraud detection and credit scoring) is uncovering hidden patterns in the data and/or providing meaningful justification of decisions made by machine learning models. In this concern, an important question arises: should one use inherently interpretable models or explain full-complexity models such as XGBoost, Random Forest with post hoc tools? Opting for the second choice is typically supported by the accuracy metric, but it is not always evident that the performance gap is sufficiently significant, especially considering the current trend of accurate and inherently interpretable models, as well as accounting for other real-life evaluation metrics such as faithfulness, stability, and computational cost of explanations. In this work, we show through benchmarking on 45 datasets that the relative accuracy loss is less than 4% in average when using intelligible models such as explainable boosting machine. Furthermore, we propose a simple use of model ensembling to improve the expressiveness of TabSRALinear, a novel attention-based inherently interpretable solution, and demonstrate both theoretically and empirically that it is a viable option for (1) generating stable or robust explanations and (2) incorporating human knowledge during the training phase. Source code is available at https://github.com/anselmeamekoe/TabSRA.
引用
收藏
页码:18583 / 18611
页数:28
相关论文
共 14 条
  • [1] The performance-interpretability trade-off: a comparative study of machine learning models
    André Assis
    Jamilson Dantas
    Ermeson Andrade
    Journal of Reliable Intelligent Environments, 2025, 11 (1)
  • [2] The accuracy versus interpretability trade-off in fraud detection model
    Nesvijevskaia, Anna
    Ouillade, Sophie
    Guilmin, Pauline
    Zucker, Jean-Daniel
    DATA & POLICY, 2021, 3
  • [3] Tackling the Accuracy-Interpretability Trade-off: Interpretable Deep Learning Models for Satellite Image-based Real Estate Appraisal
    Kucklick, Jan-Peter
    Mueller, Oliver
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2023, 14 (01)
  • [4] Leveraging the Trade-off between Accuracy and Interpretability in a Hybrid Intelligent System
    Wang, Di
    Quek, Chai
    Tan, Ah-Hwee
    Miao, Chunyan
    Ng, Geok See
    Zhou, You
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 55 - 60
  • [5] Balancing the trade-off between accuracy and interpretability in software defect prediction
    Toshiki Mori
    Naoshi Uchihira
    Empirical Software Engineering, 2019, 24 : 779 - 825
  • [6] Balancing the trade-off between accuracy and interpretability in software defect prediction
    Mori, Toshiki
    Uchihira, Naoshi
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (02) : 779 - 825
  • [7] Benchmarking Attention-Based Interpretability of Deep Learning in Multivariate Time Series Predictions
    Baric, Domjan
    Fumic, Petar
    Horvatic, Davor
    Lipic, Tomislav
    ENTROPY, 2021, 23 (02) : 1 - 23
  • [8] Predicting supply chain risks using machine learning: The trade-off between performance and interpretability
    Baryannis, George
    Dani, Samir
    Antoniou, Grigoris
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 993 - 1004
  • [9] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [10] A genetic programming approach for real-time crash prediction to solve trade-off between interpretability and accuracy
    Ma, Xiaochi
    Lu, Jian
    Liu, Xian
    Qu, Weibin
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (04) : 421 - 443