Deep learning for survival and competing risk modelling

被引:22
作者
Blumenstock, Gabriel [1 ]
Lessmann, Stefan [1 ]
Seow, Hsin-Vonn [2 ]
机构
[1] Humboldt Univ, Sch Business & Econ, Unter Linden 6, D-10099 Berlin, Germany
[2] Univ Nottingham Malaysia, Nottingham Univ Business Sch, Semenyih, Malaysia
关键词
survival analysis; competing risk model; deep learning; mortgage risk; CREDIT RISK; DEFAULT; REGRESSION; TERMINATION; PREPAYMENT; IF;
D O I
10.1080/01605682.2020.1838960
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
The article examines novel machine learning techniques for survival analysis in a credit risk modelling context. Using a large dataset of US mortgages, we evaluate the adequacy of DeepHit, a deep learning-based competing risk model, and random survival forests. The observed results provide strong evidence that both models predict default and prepayment risk more accurately than statistical benchmarks in the form of the Cox proportional hazard model and the Fine and Gray model. The superiority of the machine learning models is robust across different periods including stressed periods. We also find machine learning models do not require larger amounts of training data than the statistical benchmarks. Finally, we extend methods for estimating feature importance scores to deep neural networks for survival analysis and clarify which covariates determine the estimated survival functions of DeepHit. An online companion with additional results is available in .
引用
收藏
页码:26 / 38
页数:13
相关论文
共 30 条
[11]  
Collobert R., 2008, P 25 INT C MACH LEAR, P160, DOI DOI 10.1145/1390156.1390177.ICML08
[12]   Mortgage termination: An empirical hazard model with a stochastic term structure [J].
Deng, YH .
JOURNAL OF REAL ESTATE FINANCE AND ECONOMICS, 1997, 14 (03) :309-331
[13]   Mortgage Prepayment and Default Behavior with Embedded Forward Contract Risks in China's Housing Market [J].
Deng, Yongheng ;
Liu, Peng .
JOURNAL OF REAL ESTATE FINANCE AND ECONOMICS, 2009, 38 (03) :214-240
[14]   Macro-Economic Factors in Credit Risk Calculations: Including Time-Varying Covariates in Mixture Cure Models [J].
Dirick, Lore ;
Bellotti, Tony ;
Claeskens, Gerda ;
Baesens, Bart .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2019, 37 (01) :40-53
[15]   Time to default in credit scoring using survival analysis: a benchmark study [J].
Dirick, Lore ;
Claeskens, Gerda ;
Baesens, Bart .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2017, 68 (06) :652-665
[16]  
Eder B., 2019, 26 C CRED RISK CRED
[17]   A proportional hazards model for the subdistribution of a competing risk [J].
Fine, JP ;
Gray, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) :496-509
[18]  
Fisher A, 2019, J MACH LEARN RES, V20
[19]   Variable importance in binary regression trees and forests [J].
Ishwaran, Hemant .
ELECTRONIC JOURNAL OF STATISTICS, 2007, 1 :519-537
[20]   Random survival forests for competing risks [J].
Ishwaran, Hemant ;
Gerds, Thomas A. ;
Kogalur, Udaya B. ;
Moore, Richard D. ;
Gange, Stephen J. ;
Lau, Bryan M. .
BIOSTATISTICS, 2014, 15 (04) :757-773