Improving financial bankruptcy prediction in a highly imbalanced class distribution using oversampling and ensemble learning: a case from the Spanish market

被引:55
作者
Faris, Hossam [1 ]
Abukhurma, Ruba [1 ]
Almanaseer, Waref [1 ]
Saadeh, Mohammed [1 ]
Mora, Antonio M. [2 ,3 ]
Castillo, Pedro A. [3 ,4 ]
Aljarah, Ibrahim [1 ]
机构
[1] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan
[2] Univ Granada, ETSIIT, Dept Signal Theory Telemat & Commun, Granada, Spain
[3] Univ Granada, CITIC, Granada, Spain
[4] Univ Granada, ETSIIT, Dept Comp Architecture & Technol, Granada, Spain
关键词
Financial distress; Prediction; Ensemble learning; Financial crisis; FEATURE-SELECTION; CORPORATE BANKRUPTCY; ROTATION FOREST; RATIOS; MACHINE; ALGORITHM; MODELS; SMOTE; SETS;
D O I
10.1007/s13748-019-00197-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bankruptcy is one of the most critical financial problems that reflects the company's failure. From a machine learning perspective, the problem of bankruptcy prediction is considered a challenging one mainly because of the highly imbalanced distribution of the classes in the datasets. Therefore, developing an efficient prediction model that is able to detect the risky situation of a company is a challenging and complex task. To tackle this problem, in this paper, we propose a hybrid approach that combines the synthetic minority oversampling technique with ensemble methods. Moreover, we apply five different feature selection methods to find out what are the most dominant attributes on bankruptcy prediction. The proposed approach is evaluated based on a real dataset collected from Spanish companies. The conducted experiments show promising results, which prove that the proposed approach can be used as an efficient alternative in case of highly imbalanced datasets.
引用
收藏
页码:31 / 53
页数:23
相关论文
共 87 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
Ahmadi F., 2012, INT J COMPUTER APPL, V44, P34
[3]  
Alejo R., 2013, Advances in Intelligent Systems and Computing, P1, DOI [DOI 10.1007/978-3-319-00569-0_1, DOI 10.1007/978-3-319-00569-01]
[4]   Comparing multiobjective evolutionary ensembles for minimizing type I and II errors for bankruptcy prediction [J].
Alfaro-Cid, E. ;
Castillo, P. A. ;
Esparcia, A. ;
Sharman, K. ;
Merelo, J. J. ;
Prieto, A. ;
Mora, A. M. ;
Laredo, J. L. J. .
2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, :2902-+
[5]   Feature Selection Using Information Gain for Improved Structural-Based Alert Correlation [J].
Alhaj, Taqwa Ahmed ;
Siraj, Maheyzah Md ;
Zainal, Anazida ;
Elshoush, Huwaida Tagelsir ;
Elhaj, Fatin .
PLOS ONE, 2016, 11 (11)
[6]   FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[7]  
[Anonymous], 2000, Proceedings of the Seventeenth International Conferenceon Machine Learning, DOI DOI 10.5555/645529.657793
[8]  
[Anonymous], 1997, Icml
[9]  
Aoki S, 2004, APPLICATION OF ECONOPHYSICS, PROCEEDINGS, P299
[10]   Predicting corporate bankruptcy: where we stand? [J].
Aziz, M. Adnan ;
Dar, Humayon A. .
CORPORATE GOVERNANCE-THE INTERNATIONAL JOURNAL OF BUSINESS IN SOCIETY, 2006, 6 (01) :18-+