Making Deep Learning-Based Predictions for Credit Scoring Explainable

被引：30

作者：

Dastile, Xolani ^{[1
]}

Celik, Turgay ^{[2
,3
,4
]}

机构：

[1] Univ Witwatersrand, Sch Comp Sci & Appl Math, ZA-2000 Johannesburg, South Africa

[2] Univ Witwatersrand, Sch Elect & Informat Engn, ZA-2000 Johannesburg, South Africa

[3] Univ Witwatersrand, Wits Inst Data Sci, ZA-2000 Johannesburg, South Africa

[4] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Deep learning; Predictive models; Machine learning; Data models; Two dimensional displays; Tools; Neural networks; Credit scoring; convolutional neural network; deep learning; explainable artificial intelligence; MACHINE; MODEL;

D O I：

10.1109/ACCESS.2021.3068854

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Credit scoring has become an important risk management tool for money lending institutions. Over the years, statistical and classical machine learning models have been the most researched risk management tools in credit scoring literature, and recently the focus has turned to deep learning models. This transition is due to better performances that are shown by deep learning models in different domains. Despite deep learning models' superior performances, there is still a need for explaining how these models make their predictions. The non-transparency nature of deep learning models has created a bottleneck for their use in credit scoring. Explanations of decisions are important for lending institutions since it is a requirement for automated decisions that are generated by non-transparent models to be explained. The other issue in using deep learning models, specifically 2D Convolutional Neural Networks (CNNs), in credit scoring is the need to have the data in image format. We propose an explainable deep learning model for credit scoring which can harness the performance benefits offered by deep learning and yet comply with the legislation requirements for the automated decision-making processes. The proposed method converts tabular datasets into images and thus allowing the application of 2D CNNs in credit scoring. Each pixel of the image corresponds to a feature bin of the tabular dataset. The predictions from the 2D CNNs were explained using state-of-the-art explanation methods. Furthermore, explanations were evaluated using a sanity check methodology and also performances of the explanation methods were compared quantitatively. The proposed explainable deep learning model outperforms the other credit scoring methods on publicly available credit scoring datasets.

引用

页码：50426 / 50440

页数：15

共 46 条

[1] Classifiers consensus system approach for credit scoring
Ala'raj, Maher
Abbod, Maysam F.
[J]. KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 89 - 105
[2] [Anonymous], 2018, Advances in Neural Information Processing Systems (NeurIPS
[3] Basel, 2006, INT CONVERGENCE CAPI
[4] Statistical modeling: The two cultures
Breiman, L
[J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215
[5] Brier Glenn W., 1950, Monthly Weather Review, V78, P1, DOI 10.1175/1520-0493(1950)078<0001:vofeit>2.0.co
[6] 2
[7] Chari S., 2020, ARXIV200307523
[8] The devil is in the details: an evaluation of recent feature encoding methods
Chatfield, Ken
Lempitsky, Victor
Vedaldi, Andrea
Zisserman, Andrew
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[9] Chollet F., 2018, Deep Learning with Python, DOI DOI 10.1007/978-1-4842-2766-4
[10] Statistical and machine learning models in credit scoring: A systematic literature survey
Dastile, Xolani
Celik, Turgay
Potsane, Moshe
[J]. APPLIED SOFT COMPUTING, 2020, 91

← 1 2 3 4 5 →