A Novel Ensemble Credit Scoring Model Based on Extreme Learning Machine and Generalized Fuzzy Soft Sets

被引:8
作者
Xu, Dayu [1 ]
Zhang, Xuyao [2 ]
Hu, Junguo [1 ]
Chen, Jiahao [3 ]
机构
[1] Zhejiang A&F Univ, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Coll Econ & Management, Hangzhou, Zhejiang, Peoples R China
[3] Duke Univ, Fuqua Sch Business, Durham, NC 27706 USA
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION; GENETIC ALGORITHM; RISK-ASSESSMENT;
D O I
10.1155/2020/7504764
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper mainly discusses the hybrid application of ensemble learning, classification, and feature selection (FS) algorithms simultaneously based on training data balancing for helping the proposed credit scoring model perform more effectively, which comprises three major stages. Firstly, it conducts preprocessing for collected credit data. Then, an efficient feature selection algorithm based on adaptive elastic net is employed to reduce the weakly related or uncorrelated variables to get high-quality training data. Thirdly, a novel ensemble strategy is proposed to make the imbalanced training data set balanced for each extreme learning machine (ELM) classifier. Finally, a new weighting method for single ELM classifiers in the ensemble model is established with respect to their classification accuracy based on generalized fuzzy soft sets (GFSS) theory. A novel cosine-based distance measurement algorithm of GFSS is also proposed to calculate the weights of each ELM classifier. To confirm the efficiency of the proposed ensemble credit scoring model, we implemented experiments with real-world credit data sets for comparison. The process of analysis, outcomes, and mathematical tests proved that the proposed model is capable of improving the effectiveness of classification in average accuracy, area under the curve (AUC), H-measure, and Brier's score compared to all other single classifiers and ensemble approaches.
引用
收藏
页数:12
相关论文
共 45 条
[1]   Using neural network rule extraction and decision tables for credit-risk evaluation [J].
Baesens, B ;
Setiono, R ;
Mues, C ;
Vanthienen, J .
MANAGEMENT SCIENCE, 2003, 49 (03) :312-329
[2]   Cost-sensitive Feature Selection for Support Vector Machines [J].
Benitez-Pena, S. ;
Blanquero, R. ;
Carrizosa, E. ;
Ramirez-Cobo, P. .
COMPUTERS & OPERATIONS RESEARCH, 2019, 106 :169-178
[3]   Extreme learning machines for credit scoring: An empirical evaluation [J].
Beque, Artem ;
Lessmann, Stefan .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 86 :42-53
[4]  
Cui L., 2018, NEUROCOMPUTING, V336, P26, DOI [10.1016/j.neucom.2018.06.0812-s2.0-85056363172, DOI 10.1016/J.NEUCOM.2018.06.0812-S2.0-85056363172]
[5]   A feature selection enabled hybrid-bagging algorithm for credit risk evaluation [J].
Dahiya, Shashi ;
Handa, S. S. ;
Singh, N. P. .
EXPERT SYSTEMS, 2017, 34 (06)
[6]   Model combination for credit risk assessment: A stacked generalization approach [J].
Doumpos, Michael ;
Zopounidis, Constantin .
ANNALS OF OPERATIONS RESEARCH, 2007, 151 (01) :289-306
[7]   Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE [J].
Douzas, Georgios ;
Bacao, Fernando ;
Last, Felix .
INFORMATION SCIENCES, 2018, 465 :1-20
[8]   Self-Organizing Map Oversampling (SOMO) for imbalanced data set learning [J].
Douzas, Georgios ;
Bacao, Fernando .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 :40-52
[9]   Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending [J].
Emekter, Riza ;
Tu, Yanbin ;
Jirasakuldech, Benjamas ;
Lu, Min .
APPLIED ECONOMICS, 2015, 47 (01) :54-70
[10]   Enhancing PROMETHEE method with intuitionistic fuzzy soft sets [J].
Feng, Feng ;
Xu, Zeshui ;
Fujita, Hamido ;
Liang, Meiqi .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (07) :1071-1104