A Novel Ensemble Credit Scoring Model Based on Extreme Learning Machine and Generalized Fuzzy Soft Sets

被引:6
作者
Xu, Dayu [1 ]
Zhang, Xuyao [2 ]
Hu, Junguo [1 ]
Chen, Jiahao [3 ]
机构
[1] Zhejiang A&F Univ, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Coll Econ & Management, Hangzhou, Zhejiang, Peoples R China
[3] Duke Univ, Fuqua Sch Business, Durham, NC 27706 USA
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION; GENETIC ALGORITHM; RISK-ASSESSMENT;
D O I
10.1155/2020/7504764
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper mainly discusses the hybrid application of ensemble learning, classification, and feature selection (FS) algorithms simultaneously based on training data balancing for helping the proposed credit scoring model perform more effectively, which comprises three major stages. Firstly, it conducts preprocessing for collected credit data. Then, an efficient feature selection algorithm based on adaptive elastic net is employed to reduce the weakly related or uncorrelated variables to get high-quality training data. Thirdly, a novel ensemble strategy is proposed to make the imbalanced training data set balanced for each extreme learning machine (ELM) classifier. Finally, a new weighting method for single ELM classifiers in the ensemble model is established with respect to their classification accuracy based on generalized fuzzy soft sets (GFSS) theory. A novel cosine-based distance measurement algorithm of GFSS is also proposed to calculate the weights of each ELM classifier. To confirm the efficiency of the proposed ensemble credit scoring model, we implemented experiments with real-world credit data sets for comparison. The process of analysis, outcomes, and mathematical tests proved that the proposed model is capable of improving the effectiveness of classification in average accuracy, area under the curve (AUC), H-measure, and Brier's score compared to all other single classifiers and ensemble approaches.
引用
收藏
页数:12
相关论文
共 45 条
  • [21] Two new flavones glycosides with antimicrobial activities from Clerodendrum formicarum Gurke (Lamiaceae)
    Mahamat, Achi
    Gbaweng, Abel Joel Yaya
    Fotsing, Maurice Tagatsing
    Talla, Emmanuel
    Fekam, Fabrice Boyom
    Henoumont, Celine
    Sophie, Laurent
    Mbafor, Joseph Tanyi
    [J]. NATURAL PRODUCT RESEARCH, 2021, 35 (06) : 951 - 959
  • [22] Maji P.K., 2001, Journal of Fuzzy Mathematics, V9, P589
  • [23] Generalised fuzzy soft sets
    Majumdar, Pinaki
    Samanta, S. K.
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2010, 59 (04) : 1425 - 1432
  • [24] Two-level classifier ensembles for credit risk assessment
    Marques, A. I.
    Garcia, V.
    Sanchez, J. S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10916 - 10922
  • [25] Exploring the behaviour of base classifiers in credit scoring ensembles
    Marques, A. I.
    Garcia, V.
    Sanchez, J. S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (11) : 10244 - 10250
  • [26] Soft set theory - First results
    Molodtsov, D
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1999, 37 (4-5) : 19 - 31
  • [27] Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems
    Ng, Wing W. Y.
    Hu, Junjie
    Yeung, Daniel S.
    Yin, Shaohua
    Roli, Fabio
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (11) : 2402 - 2412
  • [28] Genetic algorithm-based heuristic for feature selection in credit risk assessment
    Oreski, Stjepan
    Oreski, Goran
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 2052 - 2064
  • [29] Two-stage consumer credit risk modelling using heterogeneous ensemble learning
    Papouskova, Monika
    Hajek, Petr
    [J]. DECISION SUPPORT SYSTEMS, 2019, 118 : 33 - 45
  • [30] Algorithms for interval-valued fuzzy soft sets in emergency decision making based on WDBA and CODAS with new information measure
    Peng, Xindong
    Garg, Harish
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 119 : 439 - 452