Explanation of machine learning models using shapley additive explanation and application for real data in hospital

被引:264
|
作者
Nohara, Yasunobu [1 ]
Matsumoto, Koutarou [2 ]
Soejima, Hidehisa [3 ]
Nakashima, Naoki [4 ]
机构
[1] Kumamoto Univ, Kumamoto, Japan
[2] Kurume Univ, Fukuoka, Japan
[3] Saiseikai Kumamoto Hosp, Kumamoto, Japan
[4] Kyushu Univ Hosp, Fukuoka, Japan
关键词
Shapley additive explanation; Machine learning; Interpretability; Feature importance; Feature packing;
D O I
10.1016/j.cmpb.2021.106584
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: When using machine learning techniques in decision-making processes, the interpretability of the models is important. In the present paper, we adopted the Shapley additive explanation (SHAP), which is based on fair profit allocation among many stakeholders depending on their contribution, for interpreting a gradient-boosting decision tree model using hospital data. Methods: For better interpretability, we propose two novel techniques as follows: (1) a new metric of feature importance using SHAP and (2) a technique termed feature packing, which packs multiple similar features into one grouped feature to allow an easier understanding of the model without reconstruction of the model. We then compared the explanation results between the SHAP framework and existing methods using cerebral infarction data from our hospital. Results: The interpretation by SHAP was mostly consistent with that by the existing methods. We showed how the A/G ratio works as an important prognostic factor for cerebral infarction using proposed techniques. Conclusion: Our techniques are useful for interpreting machine learning models and can uncover the underlying relationships between features and outcome. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Application of Machine Learning Predictive Models for Early Detection of Glaucoma Using Real World Data
    Raju, Murugesan
    Shanmugam, Krishna P.
    Shyu, Chi-Ren
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [22] Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
    Rodriguez-Perez, Raquel
    Bajorath, Juergen
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2020, 34 (10) : 1013 - 1026
  • [23] Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
    Raquel Rodríguez-Pérez
    Jürgen Bajorath
    Journal of Computer-Aided Molecular Design, 2020, 34 : 1013 - 1026
  • [24] Explaining deep learning-based activity schedule models using SHapley Additive exPlanations
    Koushik, Anil
    Manoj, M.
    Nezamuddin, N.
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2025, 17 (03): : 442 - 457
  • [25] Method Agnostic Model Class Reliance (MAMCR) Explanation of Multiple Machine Learning Models
    Gunasekaran, Abirami
    Chen, Minsi
    Hill, Richard
    McCabe, Keith
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 56 - 71
  • [26] Identifying major climate extreme indices driver of stream flow discharge variability using machine learning and SHaply Additive Explanation
    Isa, Zaharaddeen
    Abdussalam, Auwal F.
    Sawa, Bulus Ajiya
    Ibrahim, Muktar
    Isa, Umar Abdulkadir
    Babati, Abu-Hanifa
    SUSTAINABLE WATER RESOURCES MANAGEMENT, 2023, 9 (04)
  • [27] Identifying major climate extreme indices driver of stream flow discharge variability using machine learning and SHaply Additive Explanation
    Zaharaddeen Isa
    Auwal F. Abdussalam
    Bulus Ajiya Sawa
    Muktar Ibrahim
    Umar Abdulkadir Isa
    Abu-Hanifa Babati
    Sustainable Water Resources Management, 2023, 9
  • [28] Shapley Additive Explanation Method for Assessing Motorized Two-Wheeler Level of Service at Signalized Intersections
    Biswal, Manisha
    Bhuyan, Prasanta Kumar
    URBAN MOBILITY RESEARCH IN INDIA, UMI 2022, 2023, 361 : 381 - 389
  • [29] Attribution analysis to Co-planning renewable energy and storage capacity based on Shapley Additive Explanation
    Chen, Zili
    Zhou, Ming
    Wu, Zhaoyuan
    Yang, Linyan
    Yue, Hao
    ENERGY, 2025, 325
  • [30] Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations
    Vega, Johnny
    Sepulveda-Murillo, Fabio Humberto
    Parra, Melissa
    AIR SOIL AND WATER RESEARCH, 2023, 16