A data-distribution based imbalanced data classification method for credit scoring using neural networks

被引:0
作者
Zhang, Dailing [1 ]
Xu, Wei [1 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
来源
2013 SIXTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE) | 2014年
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
credit scroing; imbalanced data; data-distribution; classification;
D O I
10.1109/BIFE.2013.116
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is always a hot topic for the researchers because of its profitability. In this paper, we proposed a novel data-distribution based imbalanced data classification method to construct the credit scoring model using BP neural networks. The method distinguished itself by focusing on the distribution of the data and artificially changes the probabilities of the sampling for the purpose of centralizing the edge samples. The German Credit Dataset is applied for verifying the effectiveness of the method, and the experiment results show that the classifiers constructed by the proposed method performs better for the imbalanced credit data classification.
引用
收藏
页码:557 / 561
页数:5
相关论文
共 12 条
[1]   CREDIT SCORING, STATISTICAL TECHNIQUES AND EVALUATION CRITERIA: A REVIEW OF THE LITERATURE [J].
Abdou, Hussein A. ;
Pointon, John .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2011, 18 (2-3) :59-88
[2]   Classifying credit ratings for Asian banks using integrating feature selection and the CPDA-based rough sets approach [J].
Chen, You-Shyang .
KNOWLEDGE-BASED SYSTEMS, 2012, 26 :259-270
[3]   Multiple classifier architectures and their application to credit risk assessment [J].
Finlay, Steven .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 210 (02) :368-378
[4]  
Hand DJ, 1997, J R STAT SOC A STAT, V160, P523
[5]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[6]   Credit rating analysis with support vector machines and neural networks: a market comparative study [J].
Huang, Z ;
Chen, HC ;
Hsu, CJ ;
Chen, WH ;
Wu, SS .
DECISION SUPPORT SYSTEMS, 2004, 37 (04) :543-558
[7]   A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines [J].
Lee, TS ;
Chen, IF .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :743-752
[8]   Toward integrating feature selection algorithms for classification and clustering [J].
Liu, H ;
Yu, L .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) :491-502
[9]   Financial distress prediction based on serial combination of multiple classifiers [J].
Sun, Jie ;
Li, Hui .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :8659-8666
[10]   Two credit scoring models based on dual strategy ensemble trees [J].
Wang, Gang ;
Ma, Jian ;
Huang, Lihua ;
Xu, Kaiquan .
KNOWLEDGE-BASED SYSTEMS, 2012, 26 :61-68