A Novel Ensemble Credit Scoring Model Based on Extreme Learning Machine and Generalized Fuzzy Soft Sets

被引:6
作者
Xu, Dayu [1 ]
Zhang, Xuyao [2 ]
Hu, Junguo [1 ]
Chen, Jiahao [3 ]
机构
[1] Zhejiang A&F Univ, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Coll Econ & Management, Hangzhou, Zhejiang, Peoples R China
[3] Duke Univ, Fuqua Sch Business, Durham, NC 27706 USA
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION; GENETIC ALGORITHM; RISK-ASSESSMENT;
D O I
10.1155/2020/7504764
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper mainly discusses the hybrid application of ensemble learning, classification, and feature selection (FS) algorithms simultaneously based on training data balancing for helping the proposed credit scoring model perform more effectively, which comprises three major stages. Firstly, it conducts preprocessing for collected credit data. Then, an efficient feature selection algorithm based on adaptive elastic net is employed to reduce the weakly related or uncorrelated variables to get high-quality training data. Thirdly, a novel ensemble strategy is proposed to make the imbalanced training data set balanced for each extreme learning machine (ELM) classifier. Finally, a new weighting method for single ELM classifiers in the ensemble model is established with respect to their classification accuracy based on generalized fuzzy soft sets (GFSS) theory. A novel cosine-based distance measurement algorithm of GFSS is also proposed to calculate the weights of each ELM classifier. To confirm the efficiency of the proposed ensemble credit scoring model, we implemented experiments with real-world credit data sets for comparison. The process of analysis, outcomes, and mathematical tests proved that the proposed model is capable of improving the effectiveness of classification in average accuracy, area under the curve (AUC), H-measure, and Brier's score compared to all other single classifiers and ensemble approaches.
引用
收藏
页数:12
相关论文
共 45 条
  • [1] Using neural network rule extraction and decision tables for credit-risk evaluation
    Baesens, B
    Setiono, R
    Mues, C
    Vanthienen, J
    [J]. MANAGEMENT SCIENCE, 2003, 49 (03) : 312 - 329
  • [2] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178
  • [3] Extreme learning machines for credit scoring: An empirical evaluation
    Beque, Artem
    Lessmann, Stefan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 86 : 42 - 53
  • [4] Cui L., 2018, NEUROCOMPUTING, V336, P26, DOI [10.1016/j.neucom.2018.06.0812-s2.0-85056363172, DOI 10.1016/J.NEUCOM.2018.06.0812-S2.0-85056363172]
  • [5] A feature selection enabled hybrid-bagging algorithm for credit risk evaluation
    Dahiya, Shashi
    Handa, S. S.
    Singh, N. P.
    [J]. EXPERT SYSTEMS, 2017, 34 (06)
  • [6] Model combination for credit risk assessment: A stacked generalization approach
    Doumpos, Michael
    Zopounidis, Constantin
    [J]. ANNALS OF OPERATIONS RESEARCH, 2007, 151 (01) : 289 - 306
  • [7] Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
    Douzas, Georgios
    Bacao, Fernando
    Last, Felix
    [J]. INFORMATION SCIENCES, 2018, 465 : 1 - 20
  • [8] Self-Organizing Map Oversampling (SOMO) for imbalanced data set learning
    Douzas, Georgios
    Bacao, Fernando
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 40 - 52
  • [9] Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending
    Emekter, Riza
    Tu, Yanbin
    Jirasakuldech, Benjamas
    Lu, Min
    [J]. APPLIED ECONOMICS, 2015, 47 (01) : 54 - 70
  • [10] Enhancing PROMETHEE method with intuitionistic fuzzy soft sets
    Feng, Feng
    Xu, Zeshui
    Fujita, Hamido
    Liang, Meiqi
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (07) : 1071 - 1104