A comparative assessment of ensemble learning for credit scoring

被引:350
作者
Wang, Gang [1 ,2 ]
Hao, Jinxing [1 ,3 ]
Ma, Jian [1 ]
Jiang, Hongbing [1 ]
机构
[1] City Univ Hong Kong, Dept Informat Syst, Kowloon, Hong Kong, Peoples R China
[2] Hefei Univ Technol, Sch Management, Hefei 230009, Anhui, Peoples R China
[3] BeiHang Univ, Sch Econ & Management, Beijing 100083, Peoples R China
关键词
Credit scoring; Ensemble learning; Bagging; Boosting; Stacking; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; MODELS; RISK;
D O I
10.1016/j.eswa.2010.06.048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Both statistical techniques and Artificial Intelligence (Al) techniques have been explored for credit scoring, an important finance activity. Although there are no consistent conclusions on which ones are better, recent studies suggest combining multiple classifiers, i.e., ensemble learning, may have a better performance. In this study, we conduct a comparative assessment of the performance of three popular ensemble methods, i.e., Bagging. Boosting, and Stacking, based on four base learners, i.e., Logistic Regression Analysis (LRA), Decision Tree (DT), Artificial Neural Network (ANN) and Support Vector Machine (SVM). Experimental results reveal that the three ensemble methods can substantially improve individual base learners. In particular, Bagging performs better than Boosting across all credit datasets. Stacking and Bagging DT in our experiments, get the best performance in terms of average accuracy, type I error and type II error. (c) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:223 / 230
页数:8
相关论文
共 28 条
[1]  
[Anonymous], 2007, Uci machine learning repository
[2]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[3]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[4]  
Buta P., 1994, AI Expert, V9, P34
[5]   COMPOSITE CLASSIFIER SYSTEM-DESIGN - CONCEPTS AND METHODOLOGY [J].
DASARATHY, BV ;
SHEELA, BV .
PROCEEDINGS OF THE IEEE, 1979, 67 (05) :708-713
[6]   A comparison of neural networks and linear scoring models in the credit union environment [J].
Desai, VS ;
Crook, JN ;
Overstreet, GA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 95 (01) :24-37
[7]  
Dietterich TG, 1997, AI MAG, V18, P97
[8]  
Freund Y., 1996, INT C MACH LEARN ICM, V6, P148, DOI DOI 10.5555/3091696.3091715
[9]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[10]  
Hand DJ, 1997, J R STAT SOC A STAT, V160, P523