Multi-class imbalanced enterprise credit evaluation based on asymmetric bagging combined with light gradient boosting machine

被引:23
|
作者
Sun, Jie [1 ]
Li, Jie [1 ]
Fujita, Hamido [2 ,3 ,4 ]
机构
[1] Tianjin Univ Finance & Econ, Sch Accounting, Tianjin 300222, Peoples R China
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Univ Granada, Andalusian Res Inst Data Sci & Computat Intelligen, Granada, Spain
[4] Iwate Prefectural Univ, Fac Software & Informat Sci, Iwate, Japan
基金
中国国家自然科学基金;
关键词
Credit evaluation; Multi -class classification; Class imbalance; Asymmetric bagging; Light gradient boosting machine; SUPPORT VECTOR MACHINES; FINANCIAL DISTRESS; FEATURE-SELECTION; RANDOM SUBSPACE; LISTING STATUS; CLASSIFICATION; ENSEMBLE; PREDICTION; RATINGS; RATIOS;
D O I
10.1016/j.asoc.2022.109637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing research on multi-class imbalanced enterprise credit evaluation modeling has been built on data-level imbalance processing methods and single classifier approaches. Using the oneversus-one (OVO) decomposition and fusion method to dispose multi-class classification, this paper proposes two new credit evaluation ensemble models by combining the asymmetric bagging (AB) and the light gradient boosting machine (LightGBM). Based on a multi-class imbalanced dataset of Chinese enterprises that issue corporate bonds from 2014 through 2020, this study conducts a series of empirical experiments for multi-class imbalanced enterprise credit evaluation. The experimental results demonstrate that our proposed models can significantly outperform the benchmark models, which integrate the OVO and one-versus-all decomposition and fusion method respectively with the random under sampling LightGBM, random over sampling LightGBM, synthetic minority over sampling technique LightGBM and the AB decision tree. In addition, the proposed models can carry out analysis on feature importance, which provides decision-making basis for enterprise stakeholders. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 3 条
  • [1] Imbalanced enterprise credit evaluation with DTE-SBD: Decision tree ensemble based on SMOTE and bagging with differentiated sampling rates
    Sun, Jie
    Lang, Jie
    Fujita, Hamido
    Li, Hui
    INFORMATION SCIENCES, 2018, 425 : 76 - 91
  • [2] A novel method for predicting debris flow hazard: a multi-strategy fusion approach based on the light gradient boosting machine framework
    Wang, Tianlong
    Ge, Qi
    Ma, Tianxing
    Chen, Hao
    Luo, Rui
    Wang, Xu
    Zhang, Keying
    Chu, Zhaowei
    Ni, Xiaohui
    Sun, Hongyue
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2025,
  • [3] Value-Added Carp Products: Multi-Class Evaluation of Crisp Grass Carp by Machine Learning-Based Analysis of Blood Indexes
    Fu, Bing
    Kaneko, Gen
    Xie, Jun
    Li, Zhifei
    Tian, Jingjing
    Gong, Wangbao
    Zhang, Kai
    Xia, Yun
    Yu, Ermeng
    Wang, Guangjun
    FOODS, 2020, 9 (11)