Explainable Machine Learning for Credit Risk Management When Features are Dependent

被引：1

作者：

Do, Thanh Thuy ^{[1
]}

Babaei, Golnoosh ^{[2
]}

Pagnottoni, Paolo ^{[3
]}

机构：

[1] Univ Insubria, Dept Econ, Via Monte Generoso 71, I-21100 Varese, Italy

[2] Univ Pavia, Dept Engn, Pavia, Italy

[3] Univ Pavia, Dept Econ & Management, Pavia, Italy

来源：

MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES | 2024年 / 22卷 / 04期

基金：

欧盟地平线“2020”;

关键词：

Feature dependence; Shapley values; machine learning; explainability; PREDICTIONS;

D O I：

10.1080/15366367.2023.2261186

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

Complex Machine Learning (ML) models used to support decision-making in peer-to-peer (P2P) lending often lack clear, accurate, and interpretable explanations. While the game-theoretic concept of Shapley values and its computationally efficient variant Kernel SHAP may be employed for this aim, similarly to other existing methods, the latter makes the assumption that the features are independent. The assumption of uncorrelated features in credit risk management is fairly restrictive and, thus, prediction explanations coming from correlated features might result in highly misleading Shapley values, even when considering simple models. We therefore propose an evaluation of different dependent-feature estimation methods of Kernel SHAP for classification purposes in credit risk management. We show that dependent-feature estimation of Shapley values can improve the understanding of true prediction explanations, their robustness and is essential for better identifying the most relevant variables to default predictions coming from black-box ML models. We propose estimation of feature-dependent Shapley values for P2P credit risk managementWe consider different linear and non-linear predictive models with varying degrees of dependenceDependent feature estimation of Shapley values can improve prediction explanations and their robustnessLoan amount and interest rate are the most determinant features to loan default prediction explanations

引用

页码：315 / 340

页数：26

共 50 条

[31] Explainable Machine Learning in Deployment
Bhatt, Umang
Xiang, Alice
Sharma, Shubham
Weller, Adrian
Taly, Ankur
Jia, Yunhan
Ghosh, Joydeep
Puri, Ruchir
Moura, Jose M. F.
Eckersley, Peter
FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 648 - 657
[32] Credit default prediction of Chinese real estate listed companies based on explainable machine learning
Ma, Yuanyuan
Zhang, Pingping
Duan, Shaodong
Zhang, Tianjie
FINANCE RESEARCH LETTERS, 2023, 58
[33] Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach
Nwafor, Chioma Ngozi
Nwafor, Obumneme
Brahma, Sanjukta
SCIENTIFIC REPORTS, 2024, 14 (01):
[34] Explainable machine learning for labquake prediction using catalog-driven features
Karimpouli, Sadegh
Caus, Danu
Grover, Harsh
Martinez-Garzon, Patricia
Bohnhoff, Marco
Beroza, Gregory C.
Dresen, Georg
Goebel, Thomas
Weigel, Tobias
Kwiatek, Grzegorz
EARTH AND PLANETARY SCIENCE LETTERS, 2023, 622
[35] Two machine learning models based on explainable features to reduce PSQA workload
Miori, Gloria
Chieregato, Matteo
Maio, Rosaria
Andreoli, Francesca
Galelli, Marco
RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S4451 - S4455
[36] An Explainable Machine Learning Model for Material Backorder Prediction in Inventory Management
Ntakolia, Charis
Kokkotis, Christos
Karlsson, Patrik
Moustakidis, Serafeim
SENSORS, 2021, 21 (23)
[37] Leveraging explainable machine learning for enhanced management of lake water quality
Hasani, Sajad Soleymani
Arias, Mauricio E.
Nguyen, Hung Q.
Tarabih, Osama M.
Welch, Zachariah
Zhang, Qiong
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 370
[38] An explainable machine learning pipeline for backorder prediction in inventory management systems
Charis, Ntakolia
Christos, Kokkotis
Serafeim, Moustakidis
Elpiniki, Papageorgiou
25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 229 - 234
[39] RISK STRATIFICATION FOR PATIENTS WITH MYASTHENIA GRAVIS: AN EXPLAINABLE MACHINE LEARNING MODEL
Zhong, Huahua
Ruan, Zhe
Lv, Zhiguo
Zheng, Xueying
Xi, Jianying
Song, Jie
Yan, Chong
Luo, Lijun
Chu, Lan
Tan, Song
Zhang, Chao
Bu, Bitao
Luo, Sushan
Chang, Ting
Zhao, Chongbo
MUSCLE & NERVE, 2022, 66 : S65 - S65
[40] Predicting the risk of diabetic retinopathy using explainable machine learning algorithms
Islam, Md. Merajul
Rahman, Md. Jahanur
Rabby, Md. Symun
Alam, Md. Jahangir
Pollob, S. M. Ashikul Islam
Ahmed, N. A. M. Faisal
Tawabunnahar, Most.
Roy, Dulal Chandra
Shin, Junpil
Maniruzzaman, Md.
DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2023, 17 (12)

← 1 2 3 4 5 →