A comparative study on base classifiers in ensemble methods for credit scoring

被引:148
作者
Abelian, Joaquin [1 ]
Castellano, Javier G. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
关键词
Credit scoring; Ensembles of classifiers; Base classifier; Decision trees; Imprecise Dirichlet model; Uncertainty measures; SUPPORT VECTOR MACHINE; BANKRUPTCY PREDICTION; NEURAL-NETWORKS; TREES; PROBABILITIES; PERFORMANCE; ALGORITHMS; ENTROPY;
D O I
10.1016/j.eswa.2016.12.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last years, the application of artificial intelligence methods on credit risk assessment has meant an improvement over classic methods. Small improvements in the systems about credit scoring and bankruptcy prediction can suppose great profits. Then, any improvement represents a high interest to banks and financial institutions. Recent works show that ensembles of classifiers achieve the better results for this kind of tasks. In this paper, it is extended a previous work about the selection of the best base classifier used in ensembles on credit data sets. It is shown that a very simple base classifier, based on imprecise probabilities and uncertainty measures, attains a better trade-off among some aspects of interest for this type of studies such as accuracy and area under ROC curve (AUC). The AUC measure can be considered as a more appropriate measure in this grounds, where the different type of errors have different costs or consequences. The results shown here present to this simple classifier as an interesting choice to be used as base classifier in ensembles for credit scoring and bankruptcy prediction, proving that not only the individual performance of a classifier is the key point to be selected for an ensemble scheme. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 64 条
[31]   ON THE RATIONALE OF MAXIMUM-ENTROPY METHODS [J].
JAYNES, ET .
PROCEEDINGS OF THE IEEE, 1982, 70 (09) :939-952
[32]  
Kearns M., 1988, ml class project
[33]  
Klir GJ., 2005, Uncertainty and information, DOI [10.1002/0471755575, DOI 10.1002/0471755575]
[34]   A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines [J].
Lee, TS ;
Chen, IF .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :743-752
[35]   Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research [J].
Lessmann, Stefan ;
Baesens, Bart ;
Seow, Hsin-Vonn ;
Thomas, Lyn C. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 247 (01) :124-136
[36]  
LICHMAN M., 2013, UCI MACHINE LEARNING
[37]   The subprime credit crisis and contagion in financial markets [J].
Longstaff, Francis A. .
JOURNAL OF FINANCIAL ECONOMICS, 2010, 97 (03) :436-450
[38]  
Louzada F., 2016, ARXIV E PRINTS
[39]  
Makowski P., 1985, CREDIT WORLD, V75, P30
[40]   Credal-C4.5: Decision tree based on imprecise probabilities to classify noisy data [J].
Mantas, Carlos J. ;
Abellan, Joaquin .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (10) :4625-4637