Making Deep Learning-Based Predictions for Credit Scoring Explainable

被引:30
作者
Dastile, Xolani [1 ]
Celik, Turgay [2 ,3 ,4 ]
机构
[1] Univ Witwatersrand, Sch Comp Sci & Appl Math, ZA-2000 Johannesburg, South Africa
[2] Univ Witwatersrand, Sch Elect & Informat Engn, ZA-2000 Johannesburg, South Africa
[3] Univ Witwatersrand, Wits Inst Data Sci, ZA-2000 Johannesburg, South Africa
[4] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Peoples R China
关键词
Deep learning; Predictive models; Machine learning; Data models; Two dimensional displays; Tools; Neural networks; Credit scoring; convolutional neural network; deep learning; explainable artificial intelligence; MACHINE; MODEL;
D O I
10.1109/ACCESS.2021.3068854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Credit scoring has become an important risk management tool for money lending institutions. Over the years, statistical and classical machine learning models have been the most researched risk management tools in credit scoring literature, and recently the focus has turned to deep learning models. This transition is due to better performances that are shown by deep learning models in different domains. Despite deep learning models' superior performances, there is still a need for explaining how these models make their predictions. The non-transparency nature of deep learning models has created a bottleneck for their use in credit scoring. Explanations of decisions are important for lending institutions since it is a requirement for automated decisions that are generated by non-transparent models to be explained. The other issue in using deep learning models, specifically 2D Convolutional Neural Networks (CNNs), in credit scoring is the need to have the data in image format. We propose an explainable deep learning model for credit scoring which can harness the performance benefits offered by deep learning and yet comply with the legislation requirements for the automated decision-making processes. The proposed method converts tabular datasets into images and thus allowing the application of 2D CNNs in credit scoring. Each pixel of the image corresponds to a feature bin of the tabular dataset. The predictions from the 2D CNNs were explained using state-of-the-art explanation methods. Furthermore, explanations were evaluated using a sanity check methodology and also performances of the explanation methods were compared quantitatively. The proposed explainable deep learning model outperforms the other credit scoring methods on publicly available credit scoring datasets.
引用
收藏
页码:50426 / 50440
页数:15
相关论文
共 46 条
  • [21] Backpropagation Applied to Handwritten Zip Code Recognition
    LeCun, Y.
    Boser, B.
    Denker, J. S.
    Henderson, D.
    Howard, R. E.
    Hubbard, W.
    Jackel, L. D.
    [J]. NEURAL COMPUTATION, 1989, 1 (04) : 541 - 551
  • [22] Credit Risk Assessment Algorithm using Deep Neural Networks with Clustering and Merging
    Li, Ying
    Lin, Xianghong
    Wang, Xiangwen
    Shen, Fanqi
    Gong, Zuzheng
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 173 - 176
  • [23] Liu X., 2018, Confucianism reconsidered insights for American and Chinese education in the twenty-first century pp, P1
  • [24] Lundberg S, 2017, Arxiv, DOI [arXiv:1705.07874, 10.48550/arXiv.1705.07874, DOI 10.48550/ARXIV.1705.07874]
  • [25] Miljkovi D., 2020, NOVEL METHOD CLASSIF, DOI [10.1101/2020.05.02.074203, DOI 10.1101/2020.05.02.074203]
  • [26] Munkhdalai L., 2018, PROC 4 INT C INF SYS, P1
  • [27] Neagoe VE, 2018, IEEE ICC, P201, DOI 10.1109/ICComm.2018.8453730
  • [28] "Why Should I Trust You?" Explaining the Predictions of Any Classifier
    Ribeiro, Marco Tulio
    Singh, Sameer
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1135 - 1144
  • [29] Rothman D., 2020, HAND ON EXPLAINABLE