Principal Component Analysis and ReliefF Cascaded with Decision Tree for Credit Scoring

被引:12
作者
Damrongsakmethee, Thitimanan [1 ]
Neagoe, Victor-Emil [1 ]
机构
[1] Univ Politehn Bucuresti, Dept Appl Elect & Informat Engn, Fac Elect Telecommun & Informat Technol, Bucharest, Romania
来源
ARTIFICIAL INTELLIGENCE METHODS IN INTELLIGENT ALGORITHMS | 2019年 / 985卷
关键词
Feature selection; Credit scoring; PCA; ReliefF; Decision tree; SVM;
D O I
10.1007/978-3-030-19810-7_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is to propose a credit scoring approval model using a feature selection technique performed by Principal Component Analysis (PCA) and ReliefF algorithm followed by a decision tree classifier. As a reference classifier, we have chosen Support Vector Machine (SVM). The performance of our proposed model has been tested using the German credit dataset. The experimental results of the proposed signal processing cascade for the credit scoring lead to the best accuracy of 91.67%, while classifiers without feature selection show the best accuracy of only 75.35%. On the other side, using the same combination of feature selection (PCA and ReliefF) but cascaded with SVM classifier, one has obtained an accuracy of only 85.15%. The experimental results confirm the accuracy of the proposed model, and at the same time they show the importance of feature selection and its optimization for credit scoring decision systems.
引用
收藏
页码:85 / 95
页数:11
相关论文
共 20 条
[1]  
Abdelmoula A.K., 2015, J ACCOUNT MANAG INF, V14, P79
[2]   A Weighted Feature Selection Method for Instance-Based Classification [J].
Agre, Gennady ;
Dzhondzhorov, Anton .
ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 :14-25
[3]  
Anaei S.M., 2016, INT J MOD TRENDS ENG, V3, P123
[4]   Probabilistic modeling and visualization for bankruptcy prediction [J].
Antunes, Francisco ;
Ribeiro, Bernardete ;
Pereira, Francisco .
APPLIED SOFT COMPUTING, 2017, 60 :831-843
[5]   Implementing ReliefF filters to extract meaningful features from genetic lifetime datasets [J].
Beretta, Lorenzo ;
Santaniello, Alessandro .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (02) :361-369
[6]  
Browne D., 2016, CREDIT SCORING FEATU
[7]  
Damrongsakmethee T., 2017, INDIAN J SCI TECHNOL, V10, P1, DOI DOI 10.17485/ijst/2017/v10i39/119861
[8]   Predicting Financial Distress: A Comparison of Survival Analysis and Decision Tree Techniques [J].
Gepp, Adrian ;
Kumar, Kuldeep .
ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 :396-404
[9]  
Gupta A., 2015, INT J SCI TECHNOLOGY, V4, P85
[10]  
Ha V.-S., 2016, 7 INT C MECH IND MAN, P1