Credit scoring by one-class classification driven dynamical ensemble learning

被引:5
作者
Li, Hao [1 ]
Qiu, Hao [1 ]
Sun, Shu [1 ]
Chang, Jun [1 ]
Tu, Wenting [1 ]
机构
[1] Shanghai Univ Finance & Econ, Shanghai, Peoples R China
关键词
Credit scoring; machine learning; ensemble learning; REJECT INFERENCE;
D O I
10.1080/01605682.2021.1944824
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
It is very useful to endow machines with the ability to measure credit scores of loan applicants. Conventional methodologies always train Credit Scoring (CS) models by using data from clients who passed previous credit examination (i.e. who were considered adequately creditworthy and took out a loan). However, the CS models trained on data from the applicants who with good credit background may not work well for new applicants with plain or ambiguous credit backgrounds. Previous work always alleviates this by techniques of rejected inference and semisupervised learning. In this article, we propose a novel approach called as "One-class Classification Driven Dynamical Ensemble Learning" (abbreviated as OCDDEL). Different from rejected inference or semisupervised learning, OCDDEL does not use inferred labels of past rejected applications. Instead, OCDDEL only relies on past accepted applications and their true labels. It builds a dynamical ensemble model which deal with different test applications in different ways. To determine the ensemble weights for a specific test case, OCDDEL will learn a one-class classifier to separate test applications into groups, according to their similarities with training applicants. An experimental evaluation with 2 real-world datasets demonstrates the effectiveness of our approach.
引用
收藏
页码:181 / 190
页数:10
相关论文
共 26 条
[1]  
Amer M., 2012, P 3 RAPIDMINER COMM, P1, DOI DOI 10.5455/IJAVMS.141
[2]   Credit scoring, augmentation and lean models [J].
Banasik, J ;
Crook, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) :1072-1081
[3]   Sample selection bias in credit scoring models [J].
Banasik, J ;
Crook, J ;
Thomas, L .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (08) :822-832
[4]   Soft clustering using weighted one-class support vector machines [J].
Bicego, Manuele ;
Figueiredo, Mario A. T. .
PATTERN RECOGNITION, 2009, 42 (01) :27-32
[5]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[6]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[7]   Does reject inference really improve the performance of application scoring models? [J].
Crook, J ;
Banasik, J .
JOURNAL OF BANKING & FINANCE, 2004, 28 (04) :857-874
[8]  
Hand DavidJ., 1993, IMA J MANAG MATH, V5, P45, DOI [DOI 10.1093/IMAMAN/5.1.45, 10.1093/imaman/5.1.45]
[9]  
Hosmer D.W., 2013, Area under the receiver operating characteristic curve. Applied Logistic Regression, V3rd, P173
[10]   Hybrid mining approach in the design of credit scoring models [J].
Hsieh, NC .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :655-665