Forecasting China bond default with severe class-imbalanced data: A simple learning model with causal inference

被引:1
作者
Peng, Michael [1 ]
Stern, Elisheva R. [2 ]
Hu, Hanwen [1 ]
机构
[1] Greifenberg Analyt, 5307 Victoria Pl, Vancouver, BC, Canada
[2] Ursinus Coll, 601 E Main St, Collegeville, PA 19426 USA
关键词
China bond market; Default predictions; Credit risk; Machine-learning; Class imbalance; Ensemble method; Causal inference; Model interpretability; FINANCIAL RATIOS; PREDICTION; CLASSIFICATION; OPTIMIZATION; BANKRUPTCY; RISK;
D O I
10.1016/j.econmod.2024.106985
中图分类号
F [经济];
学科分类号
02 ;
摘要
We develop a parsimonious machine-learning model to forecast bond defaults in China, addressing class imbalance and endogeneity issues widely overlooked in similar research. Using data from 2014-2023, we construct over 70 potential predictors and refine the standard ensemble method by training models on class- balanced sub-samples generated by bootstrapping before aggregating them for the final prediction. Besides superior model performance, the study's contribution lies in transcending the associative nature of comparable studies by conducting sensitivity tests and causal inference to improve interpretability and robustness. One economic insight is that China's institutional constraints, like ownership type (and the associated implicit guarantee), create "common causes"which introduce bias and distortions in traditional models. An important policy implication is that the risk of state-owned firms, whose connections afford them to assume a higher debt ratio, are well underestimated, particularly during economic downturns or deregulation, whereas a bailout is less likely.
引用
收藏
页数:16
相关论文
共 49 条
[1]  
Alkhawaldeh Ibraheem M, 2023, World J Methodol, V13, P373, DOI 10.5662/wjm.v13.i5.373
[2]   FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[3]   Are hazard models superior to traditional bankruptcy prediction approaches? A comprehensive test [J].
Bauer, Julian ;
Agarwal, Vineet .
JOURNAL OF BANKING & FINANCE, 2014, 40 :432-442
[4]   CatBoost model and artificial intelligence techniques for corporate failure prediction [J].
Ben Jabeur, Sami ;
Gharib, Cheima ;
Mefteh-Wali, Salma ;
Ben Arfi, Wissal .
TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 166
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   An experimental comparison of classification algorithms for imbalanced credit scoring data sets [J].
Brown, Iain ;
Mues, Christophe .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :3446-3453
[7]  
Chava S., 2004, Review of Finance, V8, P537, DOI 10.1093/rof/8.4.537
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction [J].
Ding, A. Adam ;
Tian, Shaonan ;
Yu, Yan ;
Guo, Hui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (499) :990-1003