A Hybrid Algorithm-Level Ensemble Model for Imbalanced Credit Default Prediction in the Energy Industry

被引:3
|
作者
Wang, Kui [1 ]
Wan, Jie [2 ,3 ]
Li, Gang [4 ]
Sun, Hao [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Sci & Dev, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Publ Policy & Management, Beijing 100049, Peoples R China
[4] Northeastern Univ, Sch Business Adm, Shenyang 110819, Peoples R China
关键词
credit default prediction; energy industry; class imbalance; cost-sensitive; threshold method; FINANCIAL RATIOS; DISCRIMINANT-ANALYSIS;
D O I
10.3390/en15145206
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Credit default prediction for the energy industry is essential to promoting the healthy development of the energy industry in China. While previous studies have constructed various credit default prediction models with brilliant performance, the class-imbalance problem in the credit default dataset cannot be ignored, where the numbers of credit default cases are usually much smaller than the number of non-default ones. To address the class-imbalance problem, we proposed a novel CT-XGBoost model, which adds to XGBoost with two algorithm-level methods for class imbalance, including the cost-sensitive strategy and threshold method. Based on the credit default dataset consisting of energy corporates in western China, which suffers from the class-imbalance problem, the CT-XGBoost model achieves better performance than the conventional models. The results indicate that the proposed model can efficiently alleviate the inherent class-imbalance problem in the credit default dataset. Moreover, we analyze how the prediction performance is influenced by different parameter settings in the cost-sensitive strategy and threshold method. This study can help market investors and regulators precisely assess the credit risk in the energy industry and provides theoretical guidance to solving the class-imbalance problem in credit default prediction.
引用
收藏
页数:18
相关论文
共 21 条
  • [11] Balanced training of a hybrid ensemble method for imbalanced datasets: a case of emergency department readmission prediction
    Artetxe, Arkaitz
    Grana, Manuel
    Beristain, Andoni
    Rios, Sebastian
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (10) : 5735 - 5744
  • [12] Balanced training of a hybrid ensemble method for imbalanced datasets: a case of emergency department readmission prediction
    Arkaitz Artetxe
    Manuel Graña
    Andoni Beristain
    Sebastián Ríos
    Neural Computing and Applications, 2020, 32 : 5735 - 5744
  • [13] Prediction of credit risk with an ensemble model: a correlation-based classifier selection approach
    Xiong, Zhibin
    Huang, Jun
    JOURNAL OF MODELLING IN MANAGEMENT, 2022, 17 (04) : 1078 - 1097
  • [14] Hybrid Model for Credit Risk Prediction: An Application of Neural Network Approaches
    Chi, Guotai
    Uddin, Mohammad Shamsu
    Abedin, Mohammad Zoynul
    Yuan, Kunpeng
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (05)
  • [15] Class imbalance Bayesian model averaging for consumer loan default prediction: The role of soft credit information
    Weng, Futian
    Zhu, Miao
    Buckle, Mike
    Hajek, Petr
    Abedin, Mohammad Zoynul
    RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2025, 74
  • [16] Proposing an innovative real-time monitoring approach for the parameter stability of credit-default prediction model
    Liu, Linzi
    Lai, Xin
    Cai, Xiaofang
    Huang, Wei
    Miao, Weimin
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2024, 75 (11) : 2115 - 2126
  • [17] An Ensemble Data-Model-Label Three-Level Regularization Framework for Imbalanced Intelligent Fault Diagnosis
    Luo, Yixiong
    Shi, Jianhua
    Tan, Jinbiao
    Ren, Zijie
    Wan, Jiafu
    Safran, Mejdl
    Alqahtani, Salman A.
    IEEE TRANSACTIONS ON RELIABILITY, 2024, : 1 - 13
  • [18] A new hybrid ensemble model with voting-based outlier detection and balanced sampling for credit scoring
    Zhang, Wenyu
    Yang, Dongqi
    Zhang, Shuai
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
  • [19] A hybrid approach to integrate genetic algorithm into dual scoring model in enhancing the performance of credit scoring model
    Chi, Bo-Wen
    Hsu, Chiun-Chieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 2650 - 2661
  • [20] Enterprise credit risk prediction using supply chain information: A decision tree ensemble model based on the differential sampling rate, Synthetic Minority Oversampling Technique and AdaBoost
    Yao, Gang
    Hu, Xiaojian
    Zhou, Taiyun
    Zhang, Yue
    EXPERT SYSTEMS, 2022, 39 (06)