Research on User Default Prediction Algorithm Based on Adjusted Homogenous and Heterogeneous Ensemble Learning

被引:0
作者
Lu, Yao [1 ]
Wang, Kui [2 ]
Sun, Hui [1 ]
Qu, Hanwen [3 ]
Chen, Jiajia [4 ]
Liu, Wei [5 ]
Chang, Chenjie [3 ]
机构
[1] Xinjiang Univ, Sch Econ & Management, Urumqi 830046, Peoples R China
[2] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[3] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830046, Peoples R China
[4] Xinjiang Changji Vocat & Tech Coll, Changji 831100, Peoples R China
[5] Xinjiang Univ, Coll Software, Urumqi 830046, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 13期
关键词
default prediction; ensemble learning; homogeneous ensemble; heterogeneous ensemble; CREDIT;
D O I
10.3390/app14135711
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the field of risk assessment, the traditional econometric models are generally used to assess credit risk. And with the introduction of the "dual-carbon" goals to promote the development of a low-carbon economy, the scale of green credit in China has rapidly expanded. But with the advent of the big data era, due to the poor interpretability of a traditional single machine learning model, it is difficult to capture nonlinear relationships, and there are shortcomings in prediction accuracy and robustness. This paper selects the adjusted ensemble learning model based on the homogeneous and heterogeneous factors for user default prediction, which can efficiently process large quantities of high-dimensional data. This article adjusts each model to adapt to the task and innovatively compares various models. In this paper, the missing value filling method, feature selection, and ensemble model are studied and discussed, and the optimal ensemble model is obtained. When comparing the predictions of single models and ensemble models, the accuracy, sensitivity, specificity, F1-Score, Kappa, and MCC of Categorical Features Gradient Boosting (CatBoost) and Random undersampling Boosting (RUSBoost) all reach 100%. The experimental results prove that the algorithm based on adjusted homogeneous and heterogeneous ensemble learning can predict the user default efficiently and accurately. This paper also provides some references for establishing a risk assessment index system.
引用
收藏
页数:11
相关论文
共 27 条
  • [1] Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk
    Abedin, Mohammad Zoynul
    Guotai, Chi
    Hajek, Petr
    Zhang, Tong
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3559 - 3579
  • [2] Credit default prediction using a support vector machine and a probabilistic neural network
    Abedin, Mohammad Zoynul
    Guotai, Chi
    Colombage, Sisira
    Fahmida-E-Moula
    [J]. JOURNAL OF CREDIT RISK, 2018, 14 (02): : 1 - 27
  • [3] Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction
    Alonso Robisco, Andres
    Carbo Martinez, Jose Manuel
    [J]. FINANCIAL INNOVATION, 2022, 8 (01)
  • [4] A Longitudinal Systematic Review of Credit Risk Assessment and Credit Default Predictors
    Calli, Busra Alma
    Coskun, Erman
    [J]. SAGE OPEN, 2021, 11 (04):
  • [5] [曹莹 Cao Ying], 2013, [自动化学报, Acta Automatica Sinica], V39, P745
  • [6] A survey on ensemble learning
    Dong, Xibin
    Yu, Zhiwen
    Cao, Wenming
    Shi, Yifan
    Ma, Qianli
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (02) : 241 - 258
  • [7] Gao X., 2021, Res. Sq, DOI [10.21203/rs.3.rs-724813/v1, DOI 10.21203/RS.3.RS-724813/V1]
  • [8] Stacking Ensemble-Based Intelligent Machine Learning Model for Predicting Post-COVID-19 Complications
    Gupta, Aditya
    Jain, Vibha
    Singh, Amritpal
    [J]. NEW GENERATION COMPUTING, 2022, 40 (04) : 987 - 1007
  • [9] A novel hybrid ensemble model based on tree-based method and deep learning method for default prediction
    He, Hongliang
    Fan, Yanli
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [10] Impact of green credit on industrial structure in China: theoretical mechanism and empirical analysis
    Hu, Yiqin
    Jiang, Hongying
    Zhong, Zhangqi
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (10) : 10506 - 10519