An enforced support vector machine model for construction contractor default prediction

被引:55
作者
Tserng, H. Ping [2 ]
Lin, Gwo-Fong [2 ]
Tsai, L. Ken [1 ]
Chen, Po-Cheng [2 ]
机构
[1] Natl Council Struct Engn Assoc, Taipei 11070, Taiwan
[2] Natl Taiwan Univ, Dept Civil Engn, Taipei 10764, Taiwan
关键词
Contractor analysis; Default prediction; Support vector machine; FINANCIAL DISTRESS PREDICTION; BANKRUPTCY PREDICTION; NEURAL-NETWORKS; DISCRIMINANT-ANALYSIS; COMPANY FAILURE; RATIOS; PERFORMANCE; RISK;
D O I
10.1016/j.autcon.2011.05.007
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The financial health of construction contractors is critical in successfully completing a project, and thus default prediction is highly concerned by owners and other stakeholders. In other industries many previous studies employ support vector machine (SVM) or other Artificial Neural Networks (ANN) methods for corporate default prediction using the sample-matching method, which produces sample selection biases. In order to avoid the sample selection biases, this paper used all available firm-years samples during the sample period. Yet this brings a new challenge: the number of non-defaulted samples greatly exceeds the defaulted samples, which is referred to as between-class imbalance. Although the SVM algorithm is a powerful learning process, it cannot always be applied to data with extreme distribution characteristics. This paper proposes an enforced support vector machine-based model (ESVM model) for the default prediction in the construction industry, using all available firm-years data in our sample period to solve the between-class imbalance. The traditional logistic regression model is provided as a benchmark to evaluate the forecasting ability of the ESVM model. All financial variables related to the prediction of contractor default risk as well as 7 variables selected by the Multivariate Discriminant Analysis (MDA) stepwise method are put in the models for comparison. The empirical results of this paper show that the ESVM model always outperforms the logistic regression model, and is more convenient to use because it is relatively independent of the selection of variables. Thus, we recommend the proposed ESVM model as an alternative to the traditionally used logistic model. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1242 / 1249
页数:8
相关论文
共 58 条
[1]  
Abidali A.F., 1995, CONSTR MANAG ECON, V13, P189, DOI [10.1080/01446199500000023, DOI 10.1080/01446199500000023]
[2]   Comparing the performance of market-based and accounting-based bankruptcy prediction models [J].
Agarwal, Vineet ;
Taffler, Richard .
JOURNAL OF BANKING & FINANCE, 2008, 32 (08) :1541-1551
[3]  
Ahn H, 2006, LECT NOTES COMPUT SC, V4234, P420
[4]   Applying support vector machines to imbalanced datasets [J].
Akbani, R ;
Kwek, S ;
Japkowicz, N .
MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50
[5]   FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[6]  
[Anonymous], P INT JOINT C NEUR N
[7]   Bankruptcy prediction for credit risk using neural networks: A survey and new results [J].
Atiya, AF .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (04) :929-935
[8]   FINANCIAL RATIOS AS PREDICTORS OF FAILURE [J].
BEAVER, WH .
JOURNAL OF ACCOUNTING RESEARCH, 1966, 4 :71-111
[9]   Support vector machines for credit scoring and discovery of significant features [J].
Bellotti, Tony ;
Crook, Jonathan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3302-3308
[10]  
Bharath S.T., 2004, FORECASTING DEFAULT