Genetic algorithm-based heuristic for feature selection in credit risk assessment

被引:323
作者
Oreski, Stjepan [1 ]
Oreski, Goran [1 ]
机构
[1] Bank Karlovac, Karlovac 47000, Croatia
关键词
Artificial intelligence; Genetic algorithms; Classification; Credit risk assessment; Incremental feature selection; Neural network;
D O I
10.1016/j.eswa.2013.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an advanced novel heuristic algorithm is presented, the hybrid genetic algorithm with neural networks (HGA-NN), which is used to identify an optimum feature subset and to increase the classification accuracy and scalability in credit risk assessment. This algorithm is based on the following basic hypothesis: the high-dimensional input feature space can be preliminarily restricted to only the important features. In this preliminary restriction, fast algorithms for feature ranking and earlier experience are used. Additionally, enhancements are made in the creation of the initial population, as well as by introducing an incremental stage in the genetic algorithm. The performances of the proposed HGA-NN classifier are evaluated using a real-world credit dataset that is collected at a Croatian bank, and the findings are further validated on another real-world credit dataset that is selected in a UCI database. The classification accuracy is compared with that presented in the literature. Experimental results that were achieved using the proposed novel HGA-NN classifier are promising for feature selection and classification in retail credit risk assessment and indicate that the HGA-NN classifier is a promising addition to existing data mining techniques. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2052 / 2064
页数:13
相关论文
共 30 条
[1]  
Aha D. W., 1996, COMP EVALUATION SEQU, P199
[2]   An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish credit card data [J].
Akkoc, Soner .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 222 (01) :168-178
[3]  
[Anonymous], 2011, BAS 3 GLOB REG FRAM
[4]  
[Anonymous], P 9 INT WORKSH MACH
[5]  
[Anonymous], 1996, INTRO GENETIC ALGORI
[6]  
Bache K., 2013, UCI Machine Learning Repository
[7]  
Back T., 1997, HDB EVOLUTIONARY COM
[8]   Credit Risk Evaluation Model Development Using Support Vector Based Classifiers [J].
Danenas, Paulius ;
Garsva, Gintautas ;
Gudas, Saulius .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 :1699-1707
[9]   Multiple classifier architectures and their application to credit risk assessment [J].
Finlay, Steven .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 210 (02) :368-378
[10]  
Goldber D. E., 1988, Machine Learning, V3, P95, DOI 10.1023/A:1022602019183