An Explainable ADASYN-Based Focal Loss Approach for Credit Assessment

被引:0
|
作者
Shahee, Shaukat Ali [1 ]
Patel, Rujavi [2 ]
机构
[1] Indian Inst Management Kashipur, Informat Technol & Syst, Kashipur, India
[2] Indian Inst Management Kashipur, Admiss Dept, Kashipur, India
关键词
ADASYN; class imbalance; credit risk assessment; focal loss; interpretable machine learning; SHAP (Shapley additive explanations); NEURAL-NETWORK; OVERSAMPLING TECHNIQUE; FEATURE-SELECTION; CLASSIFICATION; MACHINE;
D O I
10.1002/for.3252
中图分类号
F [经济];
学科分类号
02 ;
摘要
The integration of deep learning techniques with financial technology (fintech) has revolutionized the credit risk analysis, a critical component of financial risk management. A pervasive challenge in credit risk assessment lies in the skewed distribution of data, hindering accurate predictions, particularly for minority class instances. In available literature, various solutions have been proposed to address class imbalance, albeit with limitations. Focal loss is one of the well-known loss functions proposed for handling class imbalance by running the hyperparameter gamma$$ \gamma $$. However, imbalance still remains in terms of number of hard-to-learn observations between the classes. In this paper, we have proposed integration of ADASYN with focal loss to mitigate class imbalance and enhance credit scoring accuracy. ADASYN systematically generates synthetic data based on hard-to-learn examples to counter skewed distributions, while focal loss prioritizes the training of challenging examples, fostering a more balanced model performance. This approach has been rigorously tested using real-world imbalanced datasets and credit assessment data, and the outcomes have been compared against a range of sample technique and loss function combinations. The results clearly show that our suggested strategy is better than other approaches. Although improving the accuracy of credit risk analysis is critical, model interpretability is just as important for enabling financial analysts to make wise choices. In order to solve this, we have measured the global and local contributions of each feature using SHAP (Shapley additive explanation). According to global interpretability, the top 4 parameters influencing credit risk assessment are checking account status, loan purpose, borrower age, credit history, and interest rate/installment rate. Moreover, local interpretability analysis reveals quantitative and direction differences in feature contributions. These revelations not only broaden our knowledge of credit assessment services but also highlight how important a role they could play in attracting new clients and generating income. This paper also highlights how the suggested approach may be scaled to other imbalanced real-world datasets, demonstrating how it can improve model performance in terms of AUC, G-mean, and F-measure.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] A new hybrid ensemble credit scoring model based on classifiers consensus system approach
    Ala'raj, Maher
    Abbod, Maysam F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 36 - 55
  • [42] Broken Corn Detection Based on an Adjusted YOLO With Focal Loss
    Liu, Zechuan
    Wang, Song
    IEEE ACCESS, 2019, 7 : 68281 - 68289
  • [43] Insulators and Defect Detection Based on the Improved Focal Loss Function
    Li, Yuhang
    Zou, Guoping
    Zou, Hongliang
    Zhou, Chen
    An, Siguang
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [44] EEG-Based Seizure detection using linear graph convolution network with focal loss
    Zhao, Yanna
    Dong, Changxu
    Zhang, Gaobo
    Wang, Yaru
    Chen, Xin
    Jia, Weikuan
    Yuan, Qi
    Xu, Fangzhou
    Zheng, Yuanjie
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [45] Measuring service quality based on customer emotion: An explainable AI approach
    Guo, Yiting
    Li, Yilin
    Liu, De
    Xu, Sean Xin
    DECISION SUPPORT SYSTEMS, 2024, 176
  • [46] Neural Network eXplainable AI Based on Paraconsistent Analysis - an Initial Approach
    Marcondes, Francisco S.
    Duraes, Dalila
    Gomes, Marco
    Santos, Flavio
    Almeida, Jose Joao
    Novais, Paulo
    SUSTAINABLE SMART CITIES AND TERRITORIES, 2022, 253 : 139 - 149
  • [47] An explainable stacking-based approach for accelerating the prediction of antidiabetic peptides
    Arshad, Farwa
    Ahmed, Saeed
    Amjad, Aqsa
    Kabir, Muhammad
    ANALYTICAL BIOCHEMISTRY, 2024, 691
  • [48] Credit Risk Assessment of Heavy-Polluting Enterprises: A Wide-lp Penalty and Deep Learning Approach
    Song, Wanying
    Min, Jian
    Yang, Jianbo
    MATHEMATICS, 2023, 11 (16)
  • [49] Multiple imputation method of missing credit risk assessment data based on generative adversarial networks
    Zhao, Feng
    Lu, Yan
    Li, Xinning
    Wang, Lina
    Song, Yingjie
    Fan, Deming
    Zhang, Caiming
    Chen, Xiaobo
    APPLIED SOFT COMPUTING, 2022, 126
  • [50] Credit Risk Assessment Based on Flexible Neural Tree Model
    Zhang, Yishen
    Wang, Dong
    Chen, Yuehui
    Zhao, Yaou
    Shao, Peng
    Meng, Qingfang
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 215 - 222