Gradient boosting survival tree with applications in credit scoring

被引:25
|
作者
Bai, Miaojun [1 ]
Zheng, Yan [1 ]
Shen, Yun [1 ]
机构
[1] 360 DigiTech Inc, Shanghai, Peoples R China
关键词
Boosting; survival tree ensemble; credit scoring; survival analysis;
D O I
10.1080/01605682.2021.1919035
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Credit scoring plays a vital role in the field of consumer finance. Survival analysis provides an advanced solution to the credit-scoring problem by quantifying the probability of survival time. In order to deal with highly heterogeneous industrial data collected in Chinese market of consumer finance, we propose a nonparametric ensemble tree model called gradient boosting survival tree (GBST) that extends the survival tree models with a gradient boosting algorithm. The survival tree ensemble is learned by minimising the negative log-likelihood in an additive manner. The proposed model optimises the survival probability simultaneously for each time period, which can reduce the overall error significantly. Finally, as a test of the applicability, we apply the GBST model to quantify the credit risk with large-scale real market datasets. The results show that the GBST model outperforms the existing survival models measured by the concordance index (C-index), Kolmogorov-Smirnov (KS) index, as well as by the area under the receiver operating characteristic curve (AUC) of each time period.
引用
收藏
页码:39 / 55
页数:17
相关论文
共 50 条
  • [21] CATE: Contrastive augmentation and tree-enhanced embedding for credit scoring
    Gao, Ying
    Xiao, Haolang
    Zhan, Choujun
    Liang, Lingrui
    Cai, Wentian
    Hu, Xiping
    INFORMATION SCIENCES, 2023, 651
  • [22] A new credit scoring method based on rough sets and decision tree
    Zhou, XiYue
    Zhang, DeFu
    Jiang, Yi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 1081 - 1089
  • [23] Tree-based heterogeneous cascade ensemble model for credit scoring
    Liu, Wanan
    Fan, Hong
    Xia, Meng
    INTERNATIONAL JOURNAL OF FORECASTING, 2023, 39 (04) : 1593 - 1614
  • [24] Applications of Artificial Intelligence Technologies in Credit Scoring: a Survey of Literature
    Chen, Bili
    Zeng, Wenhua
    Lin, Yangbin
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 658 - 664
  • [25] A novel tree-based dynamic heterogeneous ensemble method for credit scoring
    Xia, Yufei
    Zhao, Junhao
    He, Lingyun
    Li, Yinguo
    Niu, Mengyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [26] An Application of Locally Linear Model Tree Algorithm for Predictive Accuracy of Credit Scoring
    Siami, Mohammad
    Gholamian, Mohammad Reza
    Basiri, Javad
    Fathian, Mohammad
    MODEL AND DATA ENGINEERING, 2011, 6918 : 133 - +
  • [27] REGULARISATION IN DISCRETE SURVIVAL MODELS: A COMPARISON OF LASSO AND GRADIENT BOOSTING
    Bere, Alphonce
    Sithuba, Godfrey H.
    Mabvuu, Coster
    Mashabela, Retang
    Sigauke, Caston
    Kyei, Kwabena
    SOUTH AFRICAN STATISTICAL JOURNAL, 2021, 55 (01) : 29 - 44
  • [28] Time to default in credit scoring using survival analysis: a benchmark study
    Dirick, Lore
    Claeskens, Gerda
    Baesens, Bart
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2017, 68 (06) : 652 - 665
  • [29] A Fast Survival Support Vector Regression Approach to Large Scale Credit Scoring via Safe Screening
    Wang, Hong
    Hong, Ling
    BIG DATA, 2024,
  • [30] FROM CREDIT SCORING TO REGULATORY SCORING: COMPARING CREDIT SCORING MODELS FROM A REGULATORY PERSPECTIVE
    Xia, Yufei
    Liao, Zijun
    Xu, Jun
    LI, Yinguo
    TECHNOLOGICAL AND ECONOMIC DEVELOPMENT OF ECONOMY, 2022, 28 (06) : 1954 - 1990