An Explainable ADASYN-Based Focal Loss Approach for Credit Assessment

被引：0

作者：

Shahee, Shaukat Ali ^{[1
]}

Patel, Rujavi ^{[2
]}

机构：

[1] Indian Inst Management Kashipur, Informat Technol & Syst, Kashipur, India

[2] Indian Inst Management Kashipur, Admiss Dept, Kashipur, India

来源：

JOURNAL OF FORECASTING | 2025年

关键词：

ADASYN; class imbalance; credit risk assessment; focal loss; interpretable machine learning; SHAP (Shapley additive explanations); NEURAL-NETWORK; OVERSAMPLING TECHNIQUE; FEATURE-SELECTION; CLASSIFICATION; MACHINE;

D O I：

10.1002/for.3252

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

The integration of deep learning techniques with financial technology (fintech) has revolutionized the credit risk analysis, a critical component of financial risk management. A pervasive challenge in credit risk assessment lies in the skewed distribution of data, hindering accurate predictions, particularly for minority class instances. In available literature, various solutions have been proposed to address class imbalance, albeit with limitations. Focal loss is one of the well-known loss functions proposed for handling class imbalance by running the hyperparameter gamma$$ \gamma $$. However, imbalance still remains in terms of number of hard-to-learn observations between the classes. In this paper, we have proposed integration of ADASYN with focal loss to mitigate class imbalance and enhance credit scoring accuracy. ADASYN systematically generates synthetic data based on hard-to-learn examples to counter skewed distributions, while focal loss prioritizes the training of challenging examples, fostering a more balanced model performance. This approach has been rigorously tested using real-world imbalanced datasets and credit assessment data, and the outcomes have been compared against a range of sample technique and loss function combinations. The results clearly show that our suggested strategy is better than other approaches. Although improving the accuracy of credit risk analysis is critical, model interpretability is just as important for enabling financial analysts to make wise choices. In order to solve this, we have measured the global and local contributions of each feature using SHAP (Shapley additive explanation). According to global interpretability, the top 4 parameters influencing credit risk assessment are checking account status, loan purpose, borrower age, credit history, and interest rate/installment rate. Moreover, local interpretability analysis reveals quantitative and direction differences in feature contributions. These revelations not only broaden our knowledge of credit assessment services but also highlight how important a role they could play in attracting new clients and generating income. This paper also highlights how the suggested approach may be scaled to other imbalanced real-world datasets, demonstrating how it can improve model performance in terms of AUC, G-mean, and F-measure.

引用

页数：18

共 50 条

[41] A new hybrid ensemble credit scoring model based on classifiers consensus system approach
Ala'raj, Maher
Abbod, Maysam F.
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 36 - 55
[42] Broken Corn Detection Based on an Adjusted YOLO With Focal Loss
Liu, Zechuan
Wang, Song
IEEE ACCESS, 2019, 7 : 68281 - 68289
[43] Insulators and Defect Detection Based on the Improved Focal Loss Function
Li, Yuhang
Zou, Guoping
Zou, Hongliang
Zhou, Chen
An, Siguang
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[44] EEG-Based Seizure detection using linear graph convolution network with focal loss
Zhao, Yanna
Dong, Changxu
Zhang, Gaobo
Wang, Yaru
Chen, Xin
Jia, Weikuan
Yuan, Qi
Xu, Fangzhou
Zheng, Yuanjie
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
[45] Measuring service quality based on customer emotion: An explainable AI approach
Guo, Yiting
Li, Yilin
Liu, De
Xu, Sean Xin
DECISION SUPPORT SYSTEMS, 2024, 176
[46] Neural Network eXplainable AI Based on Paraconsistent Analysis - an Initial Approach
Marcondes, Francisco S.
Duraes, Dalila
Gomes, Marco
Santos, Flavio
Almeida, Jose Joao
Novais, Paulo
SUSTAINABLE SMART CITIES AND TERRITORIES, 2022, 253 : 139 - 149
[47] An explainable stacking-based approach for accelerating the prediction of antidiabetic peptides
Arshad, Farwa
Ahmed, Saeed
Amjad, Aqsa
Kabir, Muhammad
ANALYTICAL BIOCHEMISTRY, 2024, 691
[48] Credit Risk Assessment of Heavy-Polluting Enterprises: A Wide-lp Penalty and Deep Learning Approach
Song, Wanying
Min, Jian
Yang, Jianbo
MATHEMATICS, 2023, 11 (16)
[49] Multiple imputation method of missing credit risk assessment data based on generative adversarial networks
Zhao, Feng
Lu, Yan
Li, Xinning
Wang, Lina
Song, Yingjie
Fan, Deming
Zhang, Caiming
Chen, Xiaobo
APPLIED SOFT COMPUTING, 2022, 126
[50] Credit Risk Assessment Based on Flexible Neural Tree Model
Zhang, Yishen
Wang, Dong
Chen, Yuehui
Zhao, Yaou
Shao, Peng
Meng, Qingfang
ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 215 - 222

← 1 2 3 4 5 →