Tree-based heterogeneous cascade ensemble model for credit scoring

被引:17
作者
Liu, Wanan [1 ]
Fan, Hong [1 ]
Xia, Meng [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Credit scoring; Ensemble algorithm; Heterogeneous deep forest; Weighted voting mechanism; Interpretability; ART CLASSIFICATION ALGORITHMS; BANKRUPTCY PREDICTION; FEATURE-SELECTION; IMPACT; PERFORMANCE; MACHINES;
D O I
10.1016/j.ijforecast.2022.07.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is an important tool to guard against commercial risks for banks and lending companies and provides good conditions for the construction of individual personal credit. Ensemble algorithms have shown appealing progress for the improvement of credit scoring. In this study, to meet the challenge of large-scale credit scoring, we propose a heterogeneous deep forest model (Heter-DF), which is established based on considerations ranging from base learner selection, encouragement of the diversity of base learners, and ensemble strategies, for credit scoring. Heter-DF is designed as a scalable cascading framework that can increase its complexity with the scale of the credit dataset. Moreover, each level of Heter-DF is built by multiple heterogeneous tree-based ensembled base learners, avoiding the homogeneous prediction of the ensemble framework. In addition, a weighted voting mechanism is introduced to highlight important information and suppress irrelevant features, making Heter-DF a robust model for credit scoring. Experimental results on four credit scoring datasets and six evaluation metrics show that the cascading framework a good choice for the ensemble of tree-based base learners. A comparison among homogeneous ensembles and heterogeneous ensembles further demonstrates the effectiveness of Heter-DF. Experiments on different training sets indicate that Heter-DF is a scalable framework which not only deals with large-scale credit scoring but also satisfies the condition where small-scale credit scoring is desirable. Finally, based on the good interpretability of a tree-based structure, the global interpretation of Heter-DF is preliminarily explored. (c) 2022 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1593 / 1614
页数:22
相关论文
共 50 条
[41]   INTELLIGENT TREE-BASED ENSEMBLE APPROACHES FOR PHISHING WEBSITE DETECTION [J].
Alsariera, Yazan A. ;
Balogun, Abdullateef O. ;
Adeyemo, Victor E. ;
Tarawneh, Omar H. ;
Mojeed, Hammed A. .
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (01) :563-582
[42]   A tree-based varying coefficient model [J].
Zakrisson, Henning ;
Lindholm, Mathias .
COMPUTATIONAL STATISTICS, 2025,
[43]   Credit Scoring Models Using Ensemble Learning and Classification Approaches: A Comprehensive Survey [J].
Tripathi, Diwakar ;
Shukla, Alok Kumar ;
Reddy, B. Ramachandra ;
Bopche, Ghanshyam S. ;
Chandramohan, D. .
WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (01) :785-812
[44]   Forecasting regional in-situ thermal conductivity of soil based on tree-based ensemble learning [J].
Li, Xuquan ;
Gong, Mingyu ;
Dong, Jierui ;
Zhou, Ziyi ;
Han, Bo ;
Yu, Huili .
INTERNATIONAL COMMUNICATIONS IN HEAT AND MASS TRANSFER, 2024, 159
[45]   A Credit Scoring Model Based on Integrated Mixed Sampling and Ensemble Feature Selection: RBR XGB _ [J].
Lin, Xiaobing ;
Wu, Zhe ;
Chen, Jianfa ;
Huang, Lianfen ;
Shi, Zhiyuan .
JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05) :1061-1068
[46]   Credit scoring based on tree-enhanced gradient boosting decision trees [J].
Liu, Wanan ;
Fan, Hong ;
Xia, Meng .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
[47]   A novel deep ensemble model for imbalanced credit scoring in internet finance [J].
Xiao, Jin ;
Zhong, Yu ;
Jia, Yanlin ;
Wang, Yadong ;
Li, Ruoyi ;
Jiang, Xiaoyi ;
Wang, Shouyang .
INTERNATIONAL JOURNAL OF FORECASTING, 2024, 40 (01) :348-372
[48]   A Novel Method for Credit Scoring Based on Cost-Sensitive Neural Network Ensemble [J].
Yotsawat, Wirot ;
Wattuya, Pakaket ;
Srivihok, Anongnart .
IEEE ACCESS, 2021, 9 :78521-78537
[49]   Decision Tree-Based Ensemble Model for Predicting National Greenhouse Gas Emissions in Saudi Arabia [J].
Rahman, Muhammad Muhitur ;
Shafiullah, Md ;
Alam, Md Shafiul ;
Rahman, Mohammad Shahedur ;
Alsanad, Mohammed Ahmed ;
Islam, Mohammed Monirul ;
Islam, Md Kamrul ;
Rahman, Syed Masiur .
APPLIED SCIENCES-BASEL, 2023, 13 (06)
[50]   An application of locally linear model tree algorithm with combination of feature selection in credit scoring [J].
Siami, Mohammad ;
Gholamian, Mohammad Reza ;
Basiri, Javad .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2014, 45 (10) :2213-2222