Using semi-supervised classifiers for credit scoring

被引：27

作者：

Kennedy, K. ^{[1
]}

Mac Namee, B. ^{[1
]}

Delany, S. J. ^{[1
]}

机构：

[1] Dublin Inst Technol, Appl Intelligence Res Ctr, Dublin 8, Ireland

来源：

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY | 2013年 / 64卷 / 04期

关键词：

banking; credit scoring; low-default portfolio; supervised classification; one-class classification; benchmarking; BANKRUPTCY PREDICTION; MODELS; PROBABILITIES;

D O I：

10.1057/jors.2011.30

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

In credit scoring, low-default portfolios (LDPs) are those for which very little default history exists. This makes it problematic for financial institutions to estimate a reliable probability of a customer defaulting on a loan. Banking regulation (Basel II Capital Accord), and best practice, however, necessitate an accurate and valid estimate of the probability of default. In this article the suitability of semi-supervised one-class classification (OCC) algorithms as a solution to the LDP problem is evaluated. The performance of OCC algorithms is compared with the performance of supervised two-class classification algorithms. This study also investigates the suitability of over sampling, which is a common approach to dealing with LDPs. Assessment of the performance of one-and two-class classification algorithms using nine real-world banking data sets, which have been modified to replicate LDPs, is provided. Our results demonstrate that only in the near or complete absence of defaulters should semi-supervised OCC algorithms be used instead of supervised two-class classification algorithms. Furthermore, we demonstrate for data sets whose class labels are unevenly distributed that optimising the threshold value on classifier output yields, in many cases, an improvement in classification performance. Finally, our results suggest that oversampling produces no overall improvement to the best performing two-class classification algorithms. Journal of the Operational Research Society (2013) 64, 513-529. doi:10.1057/jors.2011.30

引用

页码：513 / 529

页数：17

共 50 条

[1] A Semi-supervised Approach for Reject Inference in Credit Scoring Using SVMs
Maldonado, Sebastian
Paredes, Gonzalo
ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 558 - 571
[2] Reject inference in credit scoring using Semi-supervised Support Vector Machines
Li, Zhiyong
Tian, Ye
Li, Me
Zhou, Fanyin
Yang, Wei
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 74 : 105 - 114
[3] A new corporate credit scoring system using semi-supervised discriminant analysis
Huang, Shian-Chang
AFRICAN JOURNAL OF BUSINESS MANAGEMENT, 2011, 5 (22): : 9355 - 9362
[4] Credit scoring based on semi-supervised generalized additive logistic regression
Fang K.
Chen Z.
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2020, 40 (02): : 392 - 402
[5] Transductive Semi-Supervised Metric Network for Reject Inference in Credit Scoring
Guo, Zhiyu
Ao, Xiang
He, Qing
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1675 - 1684
[6] Semi-supervised Instance Matching Using Boosted Classifiers
Kejriwa, Mayank
Miranker, Daniel P.
SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 388 - 402
[7] Semi-supervised Gaussian Process Classifiers
Sindhwani, Vikas
Chu, Wei
Keerthi, S. Sathiya
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1059 - 1064
[8] Scalable Semi-Supervised Aggregation of Classifiers
Balsubramani, Akshay
Freund, Yoav
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
[9] Penalizing proposals using classifiers for semi-supervised object detection
Hazra, Somnath
Dasgupta, Pallab
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
[10] Automatic image annotation using a semi-supervised ensemble of classifiers
Marin-Castro, Heidy
Sucar, Enrique
Morales, Eduardo
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 487 - 495

← 1 2 3 4 5 →