Forecasting Loan Default in Europe with Machine Learning*

被引:21
作者
Barbaglia, Luca [1 ]
Manzan, Sebastiano [2 ]
Tosetti, Elisa [3 ]
机构
[1] European Commiss, Joint Res Ctr, Via E Fermi 2749, I-21027 Ispra, VA, Italy
[2] Baruch Coll, Zicklin Sch Business, New York, NY USA
[3] Ca Foscari Univ Venice, Venice, Italy
关键词
big data; credit risk; loan default; machine learning; regional analysis; MORTGAGE DEFAULT; NEGATIVE EQUITY; MODELS; DELINQUENCY; RISK; REGULARIZATION; TERMINATION; REGRESSION; CRISIS; TESTS;
D O I
10.1093/jjfinec/nbab010
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
We use a dataset of 12 million residential mortgages to investigate the loan default behavior in several European countries. We model the default occurrence as a function of borrower characteristics, loan-specific variables, and local economic conditions. We compare the performance of a set of machine learning algorithms relative to the logistic regression, finding that they perform significantly better in providing predictions. The most important variables in explaining loan default are the interest rate and the local economic characteristics. The existence of relevant geographical heterogeneity in the variable importance points at the need for regionally tailored risk-assessment policies in Europe.
引用
收藏
页码:569 / 596
页数:28
相关论文
共 70 条
[41]  
García S, 2008, J MACH LEARN RES, V9, P2677
[42]  
Georgarakos D., 2009, IZA DISCUSSION PAPER, V12
[43]   Can't Pay or Won't Pay? Unemployment, Negative Equity, and Strategic Default [J].
Gerardi, Kristopher ;
Herkenhoff, Kyle F. ;
Ohanian, Lee E. ;
Willen, Paul S. .
REVIEW OF FINANCIAL STUDIES, 2018, 31 (03) :1098-1131
[44]  
Gerlach-Kristen P., 2015, MORTGAGE ARREARS EUR
[45]   Empirical Asset Pricing via Machine Learning [J].
Gu, Shihao ;
Kelly, Bryan ;
Xiu, Dacheng .
REVIEW OF FINANCIAL STUDIES, 2020, 33 (05) :2223-2273
[46]   A better Beta for the H measure of classification performance [J].
Hand, D. J. ;
Anagnostopoulos, C. .
PATTERN RECOGNITION LETTERS, 2014, 40 :41-46
[47]   Measuring classifier performance: a coherent alternative to the area under the ROC curve [J].
Hand, David J. .
MACHINE LEARNING, 2009, 77 (01) :103-123
[48]  
Hastie T., 2001, ELEMENTS STAT LEARNI, P193, DOI [DOI 10.1007/978-0-387-84858-7, 10.1007/978-0-387-84858-7]
[49]   APPROXIMATIONS OF THE CRITICAL REGION OF THE FRIEDMAN STATISTIC [J].
IMAN, RL ;
DAVENPORT, JM .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1980, 9 (06) :571-595
[50]  
Jappelli T., 2013, Journal of Financial Management, Markets and Institutions, V1, P23, DOI [DOI 10.12831/73631, 10.12831/73631 iv]