Machine learning predictivity applied to consumer creditworthiness

被引:16
作者
Aniceto, Maisa Cardoso [1 ]
Barboza, Flavio [2 ]
Kimura, Herbert [1 ]
机构
[1] Univ Brasilia, Dept Management, Campus Darcy Ribeiro North Wing, BR-70910900 Brasilia, DF, Brazil
[2] Univ Fed Uberlandia, Sch Business & Management, Av Joao Naves de Avila 2121, BR-38400902 Uberlandia, MG, Brazil
关键词
Machine learning; Credit risk; Consumer lending; Default prediction; Performance analysis; FEATURE-SELECTION; CREDIT; DEFAULT; INFORMATION; MODELS;
D O I
10.1186/s43093-020-00041-w
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit risk evaluation has a relevant role to financial institutions, since lending may result in real and immediate losses. In particular, default prediction is one of the most challenging activities for managing credit risk. This study analyzes the adequacy of borrower's classification models using a Brazilian bank's loan database, and exploring machine learning techniques. We develop Support Vector Machine, Decision Trees, Bagging, AdaBoost and Random Forest models, and compare their predictive accuracy with a benchmark based on a Logistic Regression model. Comparisons are analyzed based on usual classification performance metrics. Our results show that Random Forest and Adaboost perform better when compared to other models. Moreover, Support Vector Machine models show poor performance using both linear and nonlinear kernels. Our findings suggest that there are value creating opportunities for banks to improve default prediction models by exploring machine learning techniques.
引用
收藏
页数:14
相关论文
共 35 条
[1]   Credit Risk Analysis Model in Microfinance Institutions in Peru Through the use of Bayesian Networks [J].
Alarcon Morales, Eduardo ;
Mora Ramos, Brian ;
Armas Aguirre, Jimmy ;
Mauricio Sanchez, David .
2019 CONGRESO INTERNACIONAL DE INNOVACION Y TENDENCIAS EN INGENIERIA (CONIITI ), 2019,
[2]   Classification Algorithms in Financial Application: Credit Risk Analysis on Legal Entities [J].
Assef, Fernanda ;
Steiner, Maria Teresinha ;
Steiner Neto, Pedro Jose ;
Franco, David Gabriel de Barros .
IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (10) :1733-1740
[3]   MONOTONICITY MAINTENANCE IN INFORMATION-THEORETIC MACHINE LEARNING ALGORITHMS [J].
BENDAVID, A .
MACHINE LEARNING, 1995, 19 (01) :29-43
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Central Bank of Brazil, 2007, ANN REPORT
[7]  
Central Bank of Brazil, 2020, CONS PERS LOAN
[8]   The Relevance of Soft Information for Predicting Small Business Credit Default: Evidence from a Social Bank [J].
Cornee, Simon .
JOURNAL OF SMALL BUSINESS MANAGEMENT, 2019, 57 (03) :699-719
[9]   Instance sampling in credit scoring: An empirical study of sample size and balancing [J].
Crone, Sven F. ;
Finlay, Steven .
INTERNATIONAL JOURNAL OF FORECASTING, 2012, 28 (01) :224-238
[10]  
Damrongsakmethee T, 2019, ADV INTELLIGENT SYST, P216