共 64 条
Enterprise credit risk prediction using supply chain information: A decision tree ensemble model based on the differential sampling rate, Synthetic Minority Oversampling Technique and AdaBoost
被引:29
作者:
Yao, Gang
[1
]
Hu, Xiaojian
[1
,2
]
Zhou, Taiyun
[3
]
Zhang, Yue
[1
]
机构:
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Key Lab Proc Optimizat & Intelligent Decis Making, Minist Educ, Hefei, Peoples R China
[3] Anhui Agr Univ, Sch Econ & Management, Hefei, Peoples R China
基金:
中国国家自然科学基金;
关键词:
class imbalance;
ensemble learning;
enterprise credit risk prediction;
supply chain;
FINANCIAL DISTRESS PREDICTION;
SCORING MODEL;
SMOTE;
RATIOS;
SECTOR;
D O I:
10.1111/exsy.12953
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
The spread of enterprise credit risk in the supply chain may lead to large-scale bankruptcy and credit crises, which are related to national economic and social stability and financial system security. Therefore, enterprise credit risk in the supply chain context is not only a concern for banking financial institutions, credit rating agencies and enterprise managers but also the focus of governments. This article develops a DTE-DSA (decision tree [DT] ensemble model using the differential sampling rate, Synthetic Minority Oversampling Technique [SMOTE] and AdaBoost) prediction framework integrating supply chain information to predict enterprise credit risk. The empirical test shows that using supply chain information can significantly improve the prediction score. The DTE-DSA model has the best prediction effect in dealing with class imbalance problems. Compared with single classifier models-such as logistic regression, k-nearest neighbours, support vector machine, DT and DT using the SMOTE-as well as ensemble models-such as extremely randomized trees, random forest, rotation forest, extreme gradient boosting, gradient boosting DT and DT ensemble model using AdaBoost-the DTE-DSA model not only has the best prediction score but also has a more stable performance. The comprehensive use of supply chain information and the DTE-DSA model can result in the highest prediction score, with an area under the curve of 0.9016 and a Kolmogorov-Smirnov statistic of 0.7369. Further analysis of the variables of importance enhances the interpretability of the model and obtains relevant management insights.
引用
收藏
页数:29
相关论文