Using semi-supervised classifiers for credit scoring

被引:27
|
作者
Kennedy, K. [1 ]
Mac Namee, B. [1 ]
Delany, S. J. [1 ]
机构
[1] Dublin Inst Technol, Appl Intelligence Res Ctr, Dublin 8, Ireland
关键词
banking; credit scoring; low-default portfolio; supervised classification; one-class classification; benchmarking; BANKRUPTCY PREDICTION; MODELS; PROBABILITIES;
D O I
10.1057/jors.2011.30
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In credit scoring, low-default portfolios (LDPs) are those for which very little default history exists. This makes it problematic for financial institutions to estimate a reliable probability of a customer defaulting on a loan. Banking regulation (Basel II Capital Accord), and best practice, however, necessitate an accurate and valid estimate of the probability of default. In this article the suitability of semi-supervised one-class classification (OCC) algorithms as a solution to the LDP problem is evaluated. The performance of OCC algorithms is compared with the performance of supervised two-class classification algorithms. This study also investigates the suitability of over sampling, which is a common approach to dealing with LDPs. Assessment of the performance of one-and two-class classification algorithms using nine real-world banking data sets, which have been modified to replicate LDPs, is provided. Our results demonstrate that only in the near or complete absence of defaulters should semi-supervised OCC algorithms be used instead of supervised two-class classification algorithms. Furthermore, we demonstrate for data sets whose class labels are unevenly distributed that optimising the threshold value on classifier output yields, in many cases, an improvement in classification performance. Finally, our results suggest that oversampling produces no overall improvement to the best performing two-class classification algorithms. Journal of the Operational Research Society (2013) 64, 513-529. doi:10.1057/jors.2011.30
引用
收藏
页码:513 / 529
页数:17
相关论文
共 50 条
  • [1] A Semi-supervised Approach for Reject Inference in Credit Scoring Using SVMs
    Maldonado, Sebastian
    Paredes, Gonzalo
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 558 - 571
  • [2] Reject inference in credit scoring using Semi-supervised Support Vector Machines
    Li, Zhiyong
    Tian, Ye
    Li, Me
    Zhou, Fanyin
    Yang, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 74 : 105 - 114
  • [3] A new corporate credit scoring system using semi-supervised discriminant analysis
    Huang, Shian-Chang
    AFRICAN JOURNAL OF BUSINESS MANAGEMENT, 2011, 5 (22): : 9355 - 9362
  • [4] Credit scoring based on semi-supervised generalized additive logistic regression
    Fang K.
    Chen Z.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2020, 40 (02): : 392 - 402
  • [5] Transductive Semi-Supervised Metric Network for Reject Inference in Credit Scoring
    Guo, Zhiyu
    Ao, Xiang
    He, Qing
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1675 - 1684
  • [6] Semi-supervised Instance Matching Using Boosted Classifiers
    Kejriwa, Mayank
    Miranker, Daniel P.
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 388 - 402
  • [7] Semi-supervised Gaussian Process Classifiers
    Sindhwani, Vikas
    Chu, Wei
    Keerthi, S. Sathiya
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1059 - 1064
  • [8] Scalable Semi-Supervised Aggregation of Classifiers
    Balsubramani, Akshay
    Freund, Yoav
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [9] Penalizing proposals using classifiers for semi-supervised object detection
    Hazra, Somnath
    Dasgupta, Pallab
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
  • [10] Automatic image annotation using a semi-supervised ensemble of classifiers
    Marin-Castro, Heidy
    Sucar, Enrique
    Morales, Eduardo
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 487 - 495