Credit scoring by feature-weighted support vector machines

被引:16
作者
Shi, Jian [1 ,2 ]
Zhang, Shu-you [1 ]
Qiu, Le-miao [1 ]
机构
[1] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Zhejiang, Peoples R China
[2] Changshu Inst Technol, Sch Elect & Automat Engn, Changshu 215500, Peoples R China
来源
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS | 2013年 / 14卷 / 03期
基金
中国国家自然科学基金;
关键词
Credit scoring model; Support vector machine (SVM); Feature weight; Random forest; CLASSIFICATION; SELECTION;
D O I
10.1631/jzus.C1200205
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent finance and debt crises have made credit risk management one of the most important issues in financial research. Reliable credit scoring models are crucial for financial agencies to evaluate credit applications and have been widely studied in the field of machine learning and statistics. In this paper, a novel feature-weighted support vector machine (SVM) credit scoring model is presented for credit risk assessment, in which an F-score is adopted for feature importance ranking. Considering the mutual interaction among modeling features, random forest is further introduced for relative feature importance measurement. These two feature-weighted versions of SVM are tested against the traditional SVM on two real-world datasets and the research results reveal the validity of the proposed method.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 19 条
[1]   Empirical characterization of random forest variable importance measures [J].
Archer, Kelfie J. ;
Kirnes, Ryan V. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) :2249-2260
[2]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[3]   Support vector machines for credit scoring and discovery of significant features [J].
Bellotti, Tony ;
Crook, Jonathan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3302-3308
[4]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Chen YW, 2006, STUD FUZZ SOFT COMP, V207, P315
[7]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[8]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[9]   Credit scoring with a data mining approach based on support vector machines [J].
Huang, Cheng-Lung ;
Chen, Mu-Chen ;
Wang, Chieh-Jen .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (04) :847-856
[10]   Comprehensible credit scoring models using rule extraction from support vector machines [J].
Martens, David ;
Baesens, Bart ;
Van Gestel, Tony ;
Vanthienen, Jan .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 183 (03) :1466-1476