Effects of classification, feature selection, and resampling methods on bankruptcy prediction of small and medium-sized enterprises

被引:12
作者
Papikova, Lenka [1 ]
Papik, Mario [1 ]
机构
[1] Comenius Univ, Fac Management, Dept Econ & Finance, Odbojarov 10,POB 95, Bratislava 82005, Slovakia
关键词
bankruptcy prediction; data mining; feature selection; resampling; small and medium-sized enterprises; FINANCIAL DISTRESS; FAILURE PREDICTION; GENETIC ALGORITHM; LISTED COMPANIES; ROUGH SET; RATIOS; ENSEMBLE; CLASSIFIERS; SENTIMENT; WRAPPER;
D O I
10.1002/isaf.1521
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Small and medium-sized enterprises are the pillars of an economy, and their poor performance has a negative impact on living standards of population and country development. This study analyzes real-life data of 89,851 small and medium-sized enterprises, out of which 295 have declared bankruptcy. The analysis is performed via 27 financial ratios. The study framework combines seven classifications and three resampling and seven feature selection methods. Out of all classification methods applied, CatBoost has achieved the best results for all combinations of resampling and feature selection methods. CatBoost surpassed the results of other classification methods for the area under curve parameter, achieving a value of 99.95%. The application of resampling methods on different classification models has not identified a statistically significant level of improvement in any of the resampling methods. This finding has also been observed for feature selection methods. Based on these findings, we assume that individual resampling and feature selection methods do not improve model performance compared with the original imbalanced sample's results. Our results suggest that, even though the data sample may be significantly imbalanced with a minority of bankrupt companies, most classification algorithms can handle this imbalance and achieve interesting results. Moreover, our findings provide broad practical application for all stakeholders who could need to detect bankrupting companies.
引用
收藏
页码:254 / 281
页数:28
相关论文
共 83 条
[1]   Bankruptcy forecasting:: An empirical comparison of AdaBoost and neural networks [J].
Alfaro, Esteban ;
Garcia, Noelia ;
Gamez, Matias ;
Elizondo, David .
DECISION SUPPORT SYSTEMS, 2008, 45 (01) :110-122
[2]   Financial and nonfinancial variables as long-horizon predictors of bankruptcy [J].
Altman, Edward I. ;
Iwanicz-Drozdowska, Malgorzata ;
Laitinen, Erkki K. ;
Suvas, Arto .
JOURNAL OF CREDIT RISK, 2016, 12 (04) :49-78
[3]   FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[4]   Conventional and neural network target-matching methods dynamics: The information technology mergers and acquisitions market in the USA [J].
Anagnostopoulos, Ioannis ;
Rizeq, Anas .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2021, 28 (02) :97-118
[5]   Large-Scale Machine Learning for Business Sector Prediction [J].
Angenent, Mitch N. ;
Barata, Antonio Pereira ;
Takes, Frank W. .
PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, :1143-1146
[6]  
[Anonymous], 2019, World Economic Outlook Database
[7]   Probabilistic modeling and visualization for bankruptcy prediction [J].
Antunes, Francisco ;
Ribeiro, Bernardete ;
Pereira, Francisco .
APPLIED SOFT COMPUTING, 2017, 60 :831-843
[8]   FINANCIAL RATIOS AS PREDICTORS OF FAILURE [J].
BEAVER, WH .
JOURNAL OF ACCOUNTING RESEARCH, 1966, 4 :71-111
[9]   Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering [J].
Ben Jabeur, Sami ;
Stef, Nicolae ;
Carmona, Pedro .
COMPUTATIONAL ECONOMICS, 2023, 61 (02) :715-741
[10]   CatBoost model and artificial intelligence techniques for corporate failure prediction [J].
Ben Jabeur, Sami ;
Gharib, Cheima ;
Mefteh-Wali, Salma ;
Ben Arfi, Wissal .
TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 166