A Novel Ensemble Credit Scoring Model Based on Extreme Learning Machine and Generalized Fuzzy Soft Sets

被引:8
作者
Xu, Dayu [1 ]
Zhang, Xuyao [2 ]
Hu, Junguo [1 ]
Chen, Jiahao [3 ]
机构
[1] Zhejiang A&F Univ, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Coll Econ & Management, Hangzhou, Zhejiang, Peoples R China
[3] Duke Univ, Fuqua Sch Business, Durham, NC 27706 USA
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION; GENETIC ALGORITHM; RISK-ASSESSMENT;
D O I
10.1155/2020/7504764
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper mainly discusses the hybrid application of ensemble learning, classification, and feature selection (FS) algorithms simultaneously based on training data balancing for helping the proposed credit scoring model perform more effectively, which comprises three major stages. Firstly, it conducts preprocessing for collected credit data. Then, an efficient feature selection algorithm based on adaptive elastic net is employed to reduce the weakly related or uncorrelated variables to get high-quality training data. Thirdly, a novel ensemble strategy is proposed to make the imbalanced training data set balanced for each extreme learning machine (ELM) classifier. Finally, a new weighting method for single ELM classifiers in the ensemble model is established with respect to their classification accuracy based on generalized fuzzy soft sets (GFSS) theory. A novel cosine-based distance measurement algorithm of GFSS is also proposed to calculate the weights of each ELM classifier. To confirm the efficiency of the proposed ensemble credit scoring model, we implemented experiments with real-world credit data sets for comparison. The process of analysis, outcomes, and mathematical tests proved that the proposed model is capable of improving the effectiveness of classification in average accuracy, area under the curve (AUC), H-measure, and Brier's score compared to all other single classifiers and ensemble approaches.
引用
收藏
页数:12
相关论文
共 45 条
[21]   Two new flavones glycosides with antimicrobial activities from Clerodendrum formicarum Gurke (Lamiaceae) [J].
Mahamat, Achi ;
Gbaweng, Abel Joel Yaya ;
Fotsing, Maurice Tagatsing ;
Talla, Emmanuel ;
Fekam, Fabrice Boyom ;
Henoumont, Celine ;
Sophie, Laurent ;
Mbafor, Joseph Tanyi .
NATURAL PRODUCT RESEARCH, 2021, 35 (06) :951-959
[22]  
Maji P. K., 2001, Journal of Fuzzy Mathematics, V9, P589
[23]   Generalised fuzzy soft sets [J].
Majumdar, Pinaki ;
Samanta, S. K. .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2010, 59 (04) :1425-1432
[24]   Two-level classifier ensembles for credit risk assessment [J].
Marques, A. I. ;
Garcia, V. ;
Sanchez, J. S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) :10916-10922
[25]   Exploring the behaviour of base classifiers in credit scoring ensembles [J].
Marques, A. I. ;
Garcia, V. ;
Sanchez, J. S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (11) :10244-10250
[26]   Soft set theory - First results [J].
Molodtsov, D .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1999, 37 (4-5) :19-31
[27]   Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems [J].
Ng, Wing W. Y. ;
Hu, Junjie ;
Yeung, Daniel S. ;
Yin, Shaohua ;
Roli, Fabio .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (11) :2402-2412
[28]   Genetic algorithm-based heuristic for feature selection in credit risk assessment [J].
Oreski, Stjepan ;
Oreski, Goran .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) :2052-2064
[29]   Two-stage consumer credit risk modelling using heterogeneous ensemble learning [J].
Papouskova, Monika ;
Hajek, Petr .
DECISION SUPPORT SYSTEMS, 2019, 118 :33-45
[30]   Algorithms for interval-valued fuzzy soft sets in emergency decision making based on WDBA and CODAS with new information measure [J].
Peng, Xindong ;
Garg, Harish .
COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 119 :439-452