Modified Average of the Base-Level Models in the Hill-Climbing Bagged Ensemble Selection Algorithm for Credit Scoring

被引:5
作者
Handhika, Tri [1 ]
Fahrurozi, Achmad [1 ]
Zen, Revaldo Ilfestra Metzi [2 ]
Lestari, Dewi Putrie [1 ]
Sari, Ilmiyati [1 ]
Murni [1 ]
机构
[1] Gunadarma Univ, Computat Math Study Ctr, Margonda Raya St 100, Depok 16424, Indonesia
[2] Metra Digital Media, Big Data Div, Wisma Aldiron Dirgantara 2nd Floor Suite 202-209, South Jakarta 12780, Indonesia
来源
4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY | 2019年 / 157卷
关键词
Heterogeneous ensemble classifier; hill-climbing; ensemble selection; bagging; average; credit scoring; CLASSIFICATION ALGORITHMS; REGRESSION; TREE;
D O I
10.1016/j.procs.2019.08.162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performance of credit scoring model is a main concern for financial institutions in determining the credit risk of credit applicants. Credit score will be one of basis for the lender to make a decision, approved or rejected, for any credit applications. There are many methods and approaches that have been modeled for this problem. This study tries to explore further the Hill-Climbing Bagged Ensemble Selection (HCES-Bag) algorithm which has the best performance for credit scoring model as has been analyzed comprehensively in the research conducted by Lessmann et al.. We modify some average formulas for the base-level models to find out the opportunity for improving the performance of credit scoring model as measured by several performance indicators. Experiment with German Credit Data from the UCI Machine Learning Repository by first using Multivariate Adaptive Regression Splines (MARS) model for features selection demonstrates that the modification average does not affect credit scoring model performance significantly. However, some of them make the credit scoring model become more efficient because we can obtained same level of credit scoring model performances by using only smaller number of base-level models. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/) Peer-review under responsibility of the scientific committee of the 4th International Conference on Computer Science and Computational Intelligence 2019.
引用
收藏
页码:229 / 237
页数:9
相关论文
共 25 条
[1]   A comparative study on base classifiers in ensemble methods for credit scoring [J].
Abelian, Joaquin ;
Castellano, Javier G. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 :1-10
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   An experimental comparison of classification algorithms for imbalanced credit scoring data sets [J].
Brown, Iain ;
Mues, Christophe .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :3446-3453
[4]  
Caruana R., 2004, P 21 INT C MACHINE L
[5]   Constructing a reassigning credit scoring model [J].
Chuang, Chun-Ling ;
Lin, Rong-Ho .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :1685-1694
[6]   Credit scorecard based on logistic regression with random coefficients [J].
Dong, Gang ;
Lai, Kin Keung ;
Yen, Jerome .
ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01) :2457-2462
[7]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[8]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[9]   Mining the customer credit using classification and regression tree and multivariate adaptive regression splines [J].
Lee, TS ;
Chiu, CC ;
Chou, YC ;
Lu, CJ .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (04) :1113-1130
[10]   A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines [J].
Lee, TS ;
Chen, IF .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :743-752