Sample selection algorithms for credit risk modelling through data mining techniques

被引:7
作者
Protopapadakis, Eftychios [1 ]
Niklis, Dimitrios [2 ]
Doumpos, Michalis [3 ]
Doulamis, Anastasios [1 ]
Zopounidis, Constantin [3 ,4 ]
机构
[1] Natl Tech Univ Athens, Sch Rural & Surveying Engn, 9 Iroon Polytech Str, Zografos 15780, Greece
[2] Western Macedonia Univ Appl Sci, Dept Accounting & Finance, Koila Kozani 50100, Greece
[3] Tech Univ Crete, Sch Prod Engn & Management, Financial Engn Lab, Univ Campus, Khania 73100, Greece
[4] Audencia Business Sch, 8 Route Joneliere,BP 31222, F-44312 Nantes 3, France
关键词
credit risk modelling; data mining; sampling; classification; CLASSIFICATION ALGORITHMS; PREDICTION;
D O I
10.1504/IJDMMM.2019.098969
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit risk assessment is a very challenging and important problem in the domain of financial risk management. The development of reliable credit rating/scoring models is of paramount importance in this area. There are different algorithms and approaches for constructing such models to classify credit applicants (firms or individuals) into risk classes. Reliable sample selection is crucial for this task. The aim of this paper is to examine the effectiveness of sample selection schemes in combination with different classifiers for constructing reliable default prediction models. We consider different algorithms to select representative cases and handle class imbalances. Empirical results are reported for a dataset of Greek companies from the commercial sector.
引用
收藏
页码:103 / 128
页数:26
相关论文
共 51 条
[1]   CREDIT SCORING, STATISTICAL TECHNIQUES AND EVALUATION CRITERIA: A REVIEW OF THE LITERATURE [J].
Abdou, Hussein A. ;
Pointon, John .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2011, 18 (2-3) :59-88
[2]  
Ali A, 2015, Int J Adv Soft Comput Appl, V7, P176
[3]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[4]  
[Anonymous], 2007, P 30 ANN INT ACM SIG, DOI DOI 10.1145/1277741.1277927
[5]  
[Anonymous], 2012, IEEE T SYST MAN CY C, DOI DOI 10.1109/TSMCC.2011.2161285
[6]   Financial statement recasting and credit risk assessment [J].
Batta, George ;
Ganguly, Ananda ;
Rosett, Joshua .
ACCOUNTING AND FINANCE, 2014, 54 (01) :47-82
[7]   An experimental comparison of classification algorithms for imbalanced credit scoring data sets [J].
Brown, Iain ;
Mues, Christophe .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :3446-3453
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]   SMOTEBoost: Improving prediction of the minority class in boosting [J].
Chawla, NV ;
Lazarevic, A ;
Hall, LO ;
Bowyer, KW .
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 :107-119
[10]   Credit risk measurement and early warning of SMEs: An empirical study of listed SMEs in China [J].
Chen, Xiaohong ;
Wang, Xiaoding ;
Wu, Desheng Dash .
DECISION SUPPORT SYSTEMS, 2010, 49 (03) :301-310