A new credit scoring method based on rough sets and decision tree

被引:0
作者
Zhou, XiYue [1 ]
Zhang, DeFu [1 ]
Jiang, Yi [1 ]
机构
[1] Xiamen Univ, Dept Comp Sci, Xiamen 361005, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2008年 / 5012卷
关键词
data mining; credit scoring; rough sets; decision tree; attribute reduction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring is a very typical classification problem in Data Mining. Many classification methods have been presented in the literatures to tackle this problem. The decision tree method is a particularly effective method to build a classifier from the sample data. Decision tree classification method has higher prediction accuracy for the problems of classification, and can automatically generate classification rules. However, the original sample data sets used to generate the decision tree classification model often contain many noise or redundant data. These data will have a great impact on the prediction accuracy of the classifier. Therefore, it is necessary and very important to preprocess the original sample data. On this issue, a very effective approach is the rough sets. In rough sets theory, a basic problem that can be tackled using rough sets approach is reduction of redundant attributes. This paper presents a new credit scoring approach based on combination of rough sets theory and decision tree theory. The results of this study indicate that the process of reduction of attribute is very effective and our approach has good performance in terms of prediction accuracy.
引用
收藏
页码:1081 / 1089
页数:9
相关论文
共 14 条
[1]   A comparison of neural networks and linear scoring models in the credit union environment [J].
Desai, VS ;
Crook, JN ;
Overstreet, GA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 95 (01) :24-37
[2]  
HAMILTION AG, 1988, LOGIC MATH
[3]  
Henley W.E., 1995, THESIS OPEN U MILTON
[4]   A k-nearest-neighbour classifier for assessing consumer credit risk [J].
Henley, WE ;
Hand, DJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1996, 45 (01) :77-95
[5]  
Hu QH, 2007, LECT NOTES COMPUT SC, V4426, P96
[6]  
Huang C. L., 2006, EXPERT SYSTEMS APPL
[7]  
KANTARDZIC M, 2002, DATA MINING CONCEPT
[8]  
Koza JR, 1992, GENETIC PROGRAMMING
[9]  
Murphy PM, 2001, UCI REPOSITORY MACHI
[10]   ROUGH SETS [J].
PAWLAK, Z .
INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05) :341-356