Explanation of machine learning models using shapley additive explanation and application for real data in hospital

被引:264
|
作者
Nohara, Yasunobu [1 ]
Matsumoto, Koutarou [2 ]
Soejima, Hidehisa [3 ]
Nakashima, Naoki [4 ]
机构
[1] Kumamoto Univ, Kumamoto, Japan
[2] Kurume Univ, Fukuoka, Japan
[3] Saiseikai Kumamoto Hosp, Kumamoto, Japan
[4] Kyushu Univ Hosp, Fukuoka, Japan
关键词
Shapley additive explanation; Machine learning; Interpretability; Feature importance; Feature packing;
D O I
10.1016/j.cmpb.2021.106584
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: When using machine learning techniques in decision-making processes, the interpretability of the models is important. In the present paper, we adopted the Shapley additive explanation (SHAP), which is based on fair profit allocation among many stakeholders depending on their contribution, for interpreting a gradient-boosting decision tree model using hospital data. Methods: For better interpretability, we propose two novel techniques as follows: (1) a new metric of feature importance using SHAP and (2) a technique termed feature packing, which packs multiple similar features into one grouped feature to allow an easier understanding of the model without reconstruction of the model. We then compared the explanation results between the SHAP framework and existing methods using cerebral infarction data from our hospital. Results: The interpretation by SHAP was mostly consistent with that by the existing methods. We showed how the A/G ratio works as an important prognostic factor for cerebral infarction using proposed techniques. Conclusion: Our techniques are useful for interpreting machine learning models and can uncover the underlying relationships between features and outcome. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Explanation of Machine Learning Models Using Improved Shapley Additive Explanation
    Nohara, Yasunobu
    Matsumoto, Koutarou
    Soejima, Hidehisa
    Nakashima, Naoki
    ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 546 - 546
  • [2] Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)
    Gebreyesus, Yibrah
    Dalton, Damian
    Nixon, Sebastian
    De Chiara, Davide
    Chinnici, Marta
    FUTURE INTERNET, 2023, 15 (03)
  • [3] Axial Compression Prediction and GUI Design for CCFST Column Using Machine Learning and Shapley Additive Explanation
    Liu, Xuerui
    Wu, Yanqi
    Zhou, Yisong
    BUILDINGS, 2022, 12 (05)
  • [4] Explainable cancer factors discovery: Shapley additive explanation for machine learning models demonstrates the best practices in the case of pancreatic cancer
    Su, Liuyan
    Hounye, Alphonse Houssou
    Pan, Qi
    Miao, Kexin
    Wang, Jiaoju
    Hou, Muzhou
    Xiong, Li
    PANCREATOLOGY, 2024, 24 (03) : 404 - 423
  • [5] Interpretability of SurvivalBoost upon Shapley Additive Explanation value on medical data
    Wang, Yating
    Su, Jinxia
    Zhao, Xuejing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (07) : 3058 - 3067
  • [6] Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation
    Liu, Lianhua
    Bi, Bo
    Cao, Li
    Gui, Mei
    Ju, Feng
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [7] Shapley additive explanation on machine learning predictions of fatigue lifetimes in piston aluminum alloys under different manufacturing and loading conditions
    Matin, Mahmood
    Azadi, Mohammad
    FRATTURA ED INTEGRITA STRUTTURALE-FRACTURE AND STRUCTURAL INTEGRITY, 2024, 18 (68): : 357 - 370
  • [8] Machine learning and Shapley Additive Explanation-based interpretable prediction of the electrocatalytic performance of N-doped carbon materials
    Tan, Shiteng
    Wang, Ruikun
    Song, Gaoke
    Qi, Shulong
    Zhang, Kai
    Zhao, Zhenghui
    Yin, Qianqian
    FUEL, 2024, 355
  • [9] Investigating the application of a commercial and residential energy consumption prediction model for urban Planning scenarios with Machine Learning and Shapley Additive explanation methods
    Amiri, Shideh Shams
    Mueller, Maya
    Hoque, Simi
    ENERGY AND BUILDINGS, 2023, 287
  • [10] Predicting pile bearing capacity using gene expression programming with SHapley Additive exPlanation interpretation
    Adil Khan
    Majid Khan
    Waseem Akhtar Khan
    Muhammad Ali Afridi
    Khawaja Atif Naseem
    Ayesha Noreen
    Discover Civil Engineering, 2 (1):