Dynamic ensemble classification for credit scoring using soft probability

被引：81

作者：

Feng, Xiaodong ^{[1
]}

Xiao, Zhi ^{[1
]}

Zhong, Bo ^{[2
]}

Qiu, Jing ^{[1
]}

Dong, Yuanxiang ^{[3
]}

机构：

[1] Chongqing Univ, Sch Econ & Business Adm, Chongqing 400044, Peoples R China

[2] Chongqing Univ, Coll Math & Stat, Chongqing 400044, Peoples R China

[3] Shanxi Univ Finance & Econ, Sch Management Sci & Engn, Taiyuan 030006, Shanxi, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2018年 / 65卷

基金：

中国国家自然科学基金;

关键词：

Credit scoring; Dynamic ensemble classification; Selective ensemble; Soft probability; Machine learning; SUPPORT VECTOR MACHINE; NEURAL-NETWORK; BANKRUPTCY PREDICTION; CORPORATE BANKRUPTCY; RISK-ASSESSMENT; SELECTION; MODELS; CLASSIFIERS; PERFORMANCE; SYSTEMS;

D O I：

10.1016/j.asoc.2018.01.021

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, classification ensembles or multiple classifier systems have been widely applied to credit scoring, and they achieve significantly better performance than individual classifiers do. Selective ensembles, an important part of this group of systems, are a promising field of research. However, none of them considers the relative costs of Type I error and Type II error for credit scoring when selecting classifiers, which bring higher risks for the financial institutions. Moreover, earlier dynamic selective ensembles usually select and combine classifiers for each test sample dynamically based on classifiers performance in the validation set, regardless of their behaviors in the testing set. To fill the gap and overcome the limitations, we propose a new dynamic ensemble classification method for credit scoring based on soft probability. In this method, the classifiers are first selected based on their classification ability and the relative costs of Type I error and Type II error in the validation set. With the selected classifiers, we combine different classifiers for the samples in the testing set based on their classification results to get an interval probability of default by using soft probability. The proposed method is compared with some well-known individual classifiers and ensemble classification methods, including five selective ensembles, for credit scoring by using ten real-world data sets and seven performance indicators. Through these analyses and statistical tests, the experimental results demonstrate the ability and efficiency of the proposed method to improve prediction performance against the benchmark models. (c) 2018 Elsevier B.V. All rights reserved.

引用

页码：139 / 151

页数：13

共 62 条

[11] META-DES: A dynamic ensemble selection framework using meta-learning [J].

Cruz, Rafael M. O. ;

Sabourin, Robert ;

Cavalcanti, George D. C. ;

Ren, Tsang Ing .

PATTERN RECOGNITION, 2015, 48 (05) :1925-1935

[12]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[13] The use of multiple measurements in taxonomic problems [J].

Fisher, RA .

ANNALS OF EUGENICS, 1936, 7 :179-188

[14] An insight into the experimental design for credit risk and corporate bankruptcy prediction systems [J].

Garcia, Vicente ;

Marques, Ana I. ;

Salvador Sanchez, J. .

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 44 (01) :159-189

[15] When is the area under the receiver operating characteristic curve an appropriate measure of classifier performance? [J].

Hand, D. J. ;

Anagnostopoulos, C. .

PATTERN RECOGNITION LETTERS, 2013, 34 (05) :492-495

[16] Measuring classifier performance: a coherent alternative to the area under the ROC curve [J].

Hand, David J. .

MACHINE LEARNING, 2009, 77 (01) :103-123

[17]

Hand DJ, 1997, J R STAT SOC A STAT, V160, P523

[18] Credit scoring using the clustered support vector machine [J].

Harris, Terry .

EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) :741-750

[19]

Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601

[20] A data driven ensemble classifier for credit scoring analysis [J].

Hsieh, Nan-Chen ;

Hung, Lun-Ping .

EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) :534-545

← 1 2 3 4 5 6 7 →