Explainable Machine Learning for Credit Risk Management When Features are Dependent

被引:1
|
作者
Do, Thanh Thuy [1 ]
Babaei, Golnoosh [2 ]
Pagnottoni, Paolo [3 ]
机构
[1] Univ Insubria, Dept Econ, Via Monte Generoso 71, I-21100 Varese, Italy
[2] Univ Pavia, Dept Engn, Pavia, Italy
[3] Univ Pavia, Dept Econ & Management, Pavia, Italy
基金
欧盟地平线“2020”;
关键词
Feature dependence; Shapley values; machine learning; explainability; PREDICTIONS;
D O I
10.1080/15366367.2023.2261186
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Complex Machine Learning (ML) models used to support decision-making in peer-to-peer (P2P) lending often lack clear, accurate, and interpretable explanations. While the game-theoretic concept of Shapley values and its computationally efficient variant Kernel SHAP may be employed for this aim, similarly to other existing methods, the latter makes the assumption that the features are independent. The assumption of uncorrelated features in credit risk management is fairly restrictive and, thus, prediction explanations coming from correlated features might result in highly misleading Shapley values, even when considering simple models. We therefore propose an evaluation of different dependent-feature estimation methods of Kernel SHAP for classification purposes in credit risk management. We show that dependent-feature estimation of Shapley values can improve the understanding of true prediction explanations, their robustness and is essential for better identifying the most relevant variables to default predictions coming from black-box ML models. We propose estimation of feature-dependent Shapley values for P2P credit risk managementWe consider different linear and non-linear predictive models with varying degrees of dependenceDependent feature estimation of Shapley values can improve prediction explanations and their robustnessLoan amount and interest rate are the most determinant features to loan default prediction explanations
引用
收藏
页码:315 / 340
页数:26
相关论文
共 50 条
  • [31] Explainable Machine Learning in Deployment
    Bhatt, Umang
    Xiang, Alice
    Sharma, Shubham
    Weller, Adrian
    Taly, Ankur
    Jia, Yunhan
    Ghosh, Joydeep
    Puri, Ruchir
    Moura, Jose M. F.
    Eckersley, Peter
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 648 - 657
  • [32] Credit default prediction of Chinese real estate listed companies based on explainable machine learning
    Ma, Yuanyuan
    Zhang, Pingping
    Duan, Shaodong
    Zhang, Tianjie
    FINANCE RESEARCH LETTERS, 2023, 58
  • [33] Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach
    Nwafor, Chioma Ngozi
    Nwafor, Obumneme
    Brahma, Sanjukta
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [34] Explainable machine learning for labquake prediction using catalog-driven features
    Karimpouli, Sadegh
    Caus, Danu
    Grover, Harsh
    Martinez-Garzon, Patricia
    Bohnhoff, Marco
    Beroza, Gregory C.
    Dresen, Georg
    Goebel, Thomas
    Weigel, Tobias
    Kwiatek, Grzegorz
    EARTH AND PLANETARY SCIENCE LETTERS, 2023, 622
  • [35] Two machine learning models based on explainable features to reduce PSQA workload
    Miori, Gloria
    Chieregato, Matteo
    Maio, Rosaria
    Andreoli, Francesca
    Galelli, Marco
    RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S4451 - S4455
  • [36] An Explainable Machine Learning Model for Material Backorder Prediction in Inventory Management
    Ntakolia, Charis
    Kokkotis, Christos
    Karlsson, Patrik
    Moustakidis, Serafeim
    SENSORS, 2021, 21 (23)
  • [37] Leveraging explainable machine learning for enhanced management of lake water quality
    Hasani, Sajad Soleymani
    Arias, Mauricio E.
    Nguyen, Hung Q.
    Tarabih, Osama M.
    Welch, Zachariah
    Zhang, Qiong
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 370
  • [38] An explainable machine learning pipeline for backorder prediction in inventory management systems
    Charis, Ntakolia
    Christos, Kokkotis
    Serafeim, Moustakidis
    Elpiniki, Papageorgiou
    25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 229 - 234
  • [39] RISK STRATIFICATION FOR PATIENTS WITH MYASTHENIA GRAVIS: AN EXPLAINABLE MACHINE LEARNING MODEL
    Zhong, Huahua
    Ruan, Zhe
    Lv, Zhiguo
    Zheng, Xueying
    Xi, Jianying
    Song, Jie
    Yan, Chong
    Luo, Lijun
    Chu, Lan
    Tan, Song
    Zhang, Chao
    Bu, Bitao
    Luo, Sushan
    Chang, Ting
    Zhao, Chongbo
    MUSCLE & NERVE, 2022, 66 : S65 - S65
  • [40] Predicting the risk of diabetic retinopathy using explainable machine learning algorithms
    Islam, Md. Merajul
    Rahman, Md. Jahanur
    Rabby, Md. Symun
    Alam, Md. Jahangir
    Pollob, S. M. Ashikul Islam
    Ahmed, N. A. M. Faisal
    Tawabunnahar, Most.
    Roy, Dulal Chandra
    Shin, Junpil
    Maniruzzaman, Md.
    DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2023, 17 (12)