Deep learning for survival and competing risk modelling

被引:22
作者
Blumenstock, Gabriel [1 ]
Lessmann, Stefan [1 ]
Seow, Hsin-Vonn [2 ]
机构
[1] Humboldt Univ, Sch Business & Econ, Unter Linden 6, D-10099 Berlin, Germany
[2] Univ Nottingham Malaysia, Nottingham Univ Business Sch, Semenyih, Malaysia
关键词
survival analysis; competing risk model; deep learning; mortgage risk; CREDIT RISK; DEFAULT; REGRESSION; TERMINATION; PREPAYMENT; IF;
D O I
10.1080/01605682.2020.1838960
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
The article examines novel machine learning techniques for survival analysis in a credit risk modelling context. Using a large dataset of US mortgages, we evaluate the adequacy of DeepHit, a deep learning-based competing risk model, and random survival forests. The observed results provide strong evidence that both models predict default and prepayment risk more accurately than statistical benchmarks in the form of the Cox proportional hazard model and the Fine and Gray model. The superiority of the machine learning models is robust across different periods including stressed periods. We also find machine learning models do not require larger amounts of training data than the statistical benchmarks. Finally, we extend methods for estimating feature importance scores to deep neural networks for survival analysis and clarify which covariates determine the estimated survival functions of DeepHit. An online companion with additional results is available in .
引用
收藏
页码:26 / 38
页数:13
相关论文
共 30 条
[1]   Credit Risk Analysis Using Machine and Deep Learning Models [J].
Addo, Peter Martey ;
Guegan, Dominique ;
Hassani, Bertrand .
RISKS, 2018, 6 (02)
[2]   A time-dependent discrimination index for survival data [J].
Antolini, L ;
Boracchi, P ;
Biganzoli, E .
STATISTICS IN MEDICINE, 2005, 24 (24) :3927-3944
[3]   Practical recommendations for reporting Fine-Gray model analyses for competing risk data [J].
Austin, Peter C. ;
Fine, Jason P. .
STATISTICS IN MEDICINE, 2017, 36 (27) :4391-4400
[4]   Neural network survival analysis for personal loan data [J].
Baesens, B ;
Van Gestel, T ;
Stepanova, M ;
Van den Poel, D ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) :1089-1098
[5]  
Banasik J, 1999, J OPER RES SOC, V50, P1185
[6]   Credit scoring with macroeconomic variables using survival analysis [J].
Bellotti, T. ;
Crook, J. .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (12) :1699-1707
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Cao R, 2009, SORT-STAT OPER RES T, V33, P3
[9]   The termination of commercial mortgage contracts through prepayment and default: A proportional hazard approach with competing risks [J].
Ciochetti, BA ;
Deng, YH ;
Gao, B ;
Yao, R .
REAL ESTATE ECONOMICS, 2002, 30 (04) :595-633
[10]  
Cohen B H., 2017, BIS Quarterly Review