Data Sample Selection Issues for Bankruptcy Prediction

被引：14

作者：

Tian, Shaonan ^{[1
]}

Yu, Yan ^{[2
]}

Zhou, Ming ^{[3
]}

机构：

[1] San Jose State Univ, Decis Sci, San Jose, CA 95192 USA

[2] Univ Cincinnati, Business Analyt, Cincinnati, OH 45221 USA

[3] San Jose State Univ, Operat & Supply Chain Management, San Jose, CA 95192 USA

来源：

RISK HAZARDS & CRISIS IN PUBLIC POLICY | 2015年 / 6卷 / 01期

关键词：

bankruptcy forecasting; binary classification; logistic regression; neural networks; support vector machines;

D O I：

10.1002/rhc3.12071

中图分类号：

C93 [管理学]; D035 [国家行政管理]; D523 [行政管理]; D63 [国家行政管理];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ; 1204 ; 120401 ;

摘要：

Bankruptcy prediction is of paramount interest to both academics and practitioners. This paper devotes special care to an important aspect of the bankruptcy prediction modeling: Data sample selection issue. To investigate the effect of the different data selection methods, three models are adopted: Logistic regression model, Neural Networks (NNET), and Support Vector Machines (SVM), which have recently gained some popularity in the applications. A Monte Carlo simulation study and an empirical analysis on an updated bankruptcy database are conducted to explore the effect of different data sample selection methods. By comparing the out-of-sample predictive performances, we conclude that if forecasting the probability of bankruptcy is of interest, complete data sampling technique provides more accurate results. However, if a binary bankruptcy decision or classification is desired, choice based sampling technique may still be suitable. In particular, choice-based data samples validated by NNET and SVM can capture more correct predictions of bankruptcy observations, and provide lower asymmetric misclassification rate. In addition, for different choice-based data samples, it is essential to adjust the cut-off probability. An appropriate choice of cut-off probability depends on the specification of the cost ratio between the Type I error and Type II error. The proposed optimal cut-off probability in this work is a function of the data sample selection methods and the cost ratio.

引用

页码：91 / 116

页数：26

共 30 条

[1] Altman E.I., 1977, J BANK FINANC, V1, P29, DOI DOI 10.1016/0378-4266(77)90017-6
[2] FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY
ALTMAN, EI
[J]. JOURNAL OF FINANCE, 1968, 23 (04) : 589 - 609
[3] Beaver W. H., 1966, J ACCOUNTING RES S, V71-102
[4] Forecasting default with the Merton distance to default model
Bharath, Sreedhar T.
Shumway, Tyler
[J]. REVIEW OF FINANCIAL STUDIES, 2008, 21 (03) : 1339 - 1369
[5] Chava S., 2004, Review of Finance, V8, P537, DOI 10.1093/rof/8.4.537
[6] Can out-of-sample forecast comparisons help prevent overfitting?
Clark, TE
[J]. JOURNAL OF FORECASTING, 2004, 23 (02) : 115 - 139
[7] A comparative analysis of current credit risk models
Crouhy, M
Galai, D
Mark, R
[J]. JOURNAL OF BANKING & FINANCE, 2000, 24 (1-2) : 59 - 117
[8] A survey of business failures with an emphasis on prediction methods and industrial applications
Dimitras, AI
Zanakis, SH
Zopounidis, C
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 90 (03) : 487 - 513
[9] A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction
Ding, A. Adam
Tian, Shaonan
Yu, Yan
Guo, Hui
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (499) : 990 - 1003
[10] Multi-period corporate default prediction with stochastic covariates
Duffie, Darrell
Saita, Leandro
Wang, Ke
[J]. JOURNAL OF FINANCIAL ECONOMICS, 2007, 83 (03) : 635 - 665

← 1 2 3 →