Machine learning interpretability for a stress scenario generation in credit scoring based on counterfactuals

被引：18

作者：

Bueff, Andreas C. ^{[1
]}

Cytrynski, Mateusz ^{[2
]}

Calabrese, Raffaella ^{[2
]}

Jones, Matthew ^{[3
]}

Roberts, John ^{[3
]}

Moore, Jonathon ^{[3
]}

Brown, Iain ^{[4
,5
]}

机构：

[1] Univ Edinburgh, Sch Informat, 10 Crichton St, Edinburgh EH8 9AB, Scotland

[2] Univ Edinburgh, Business Sch, 29 Buccleuch Pl, Edinburgh EH8 9JS, Scotland

[3] Nationwide Bldg Soc, Nationwide Bldg Soc Headquarters,Pipers Way, Swindon SN3 1TA, Wiltshire, England

[4] SAS, Wittington House, Wittington House,Henley Rd, Marlow SL7 2EB, Buckinghamshire, England

[5] Univ Southampton, Hartley Lib B12, Univ Rd,Highfield, Southampton SO17 1BJ, Hampshire, England

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 202卷

基金：

英国工程与自然科学研究理事会;

关键词：

OR in banking; Interpretable ML; Credit scoring; Stress scenario; CLASSIFICATION ALGORITHMS; MODELS;

D O I：

10.1016/j.eswa.2022.117271

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To boost the application of machine learning (ML) techniques for credit scoring models, the blackbox problem should be addressed. The primary aim of this paper is to propose a measure based on counterfactuals to evaluate the interpretability of a ML credit scoring technique. Counterfactuals assist with understanding the model with regard to the classification decision boundaries and evaluate model robustness. The second contribution is the development of a data perturbation technique to generate a stress scenario.We apply these two proposals to a dataset on UK unsecured personal loans to compare logistic regression and stochastic gradient boosting (SBG). We show that training a blackbox model (SGB) as conditioned on our data perturbation technique can provide insight into model performance under stressed scenarios. The empirical results show that our interpretability measure is able to capture the classification decision boundary, unlike AUC and the classification accuracy widely used in the banking sector.

引用

页数：14

共 54 条

[1] Genetic programming for credit scoring: The case of Egyptian public sector banks [J].

Abdou, Hussein A. .

EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) :11402-11417

[2] Credit Risk Analysis Using Machine and Deep Learning Models [J].

Addo, Peter Martey ;

Guegan, Dominique ;

Hassani, Bertrand .

RISKS, 2018, 6 (02)

[3]

[Anonymous], 2011, OCC Bulletin 2011-12

[4]

[Anonymous], 2018, COMPUT RES REPOS

[5]

Baesens B, 2016, CREDIT RISK ANAL MEA

[6]

Bank of England FCA, 2019, MACH LEARN UK FIN SE

[7] Forecasting and stress testing credit card default using dynamic models [J].

Bellotti, Tony ;

Crook, Jonathan .

INTERNATIONAL JOURNAL OF FORECASTING, 2013, 29 (04) :563-574

[8]

Bracke P., 2019, RISK MANAGEMENT ANAL

[9] An experimental comparison of classification algorithms for imbalanced credit scoring data sets [J].

Brown, Iain ;

Mues, Christophe .

EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :3446-3453

[10] Explainable Machine Learning in Credit Risk Management [J].

Bussmann, Niklas ;

Giudici, Paolo ;

Marinelli, Dimitri ;

Papenbrock, Jochen .

COMPUTATIONAL ECONOMICS, 2021, 57 (01) :203-216

← 1 2 3 4 5 6 →