Tree-based heterogeneous cascade ensemble model for credit scoring

被引:12
|
作者
Liu, Wanan [1 ]
Fan, Hong [1 ]
Xia, Meng [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Credit scoring; Ensemble algorithm; Heterogeneous deep forest; Weighted voting mechanism; Interpretability; ART CLASSIFICATION ALGORITHMS; BANKRUPTCY PREDICTION; FEATURE-SELECTION; IMPACT; PERFORMANCE; MACHINES;
D O I
10.1016/j.ijforecast.2022.07.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is an important tool to guard against commercial risks for banks and lending companies and provides good conditions for the construction of individual personal credit. Ensemble algorithms have shown appealing progress for the improvement of credit scoring. In this study, to meet the challenge of large-scale credit scoring, we propose a heterogeneous deep forest model (Heter-DF), which is established based on considerations ranging from base learner selection, encouragement of the diversity of base learners, and ensemble strategies, for credit scoring. Heter-DF is designed as a scalable cascading framework that can increase its complexity with the scale of the credit dataset. Moreover, each level of Heter-DF is built by multiple heterogeneous tree-based ensembled base learners, avoiding the homogeneous prediction of the ensemble framework. In addition, a weighted voting mechanism is introduced to highlight important information and suppress irrelevant features, making Heter-DF a robust model for credit scoring. Experimental results on four credit scoring datasets and six evaluation metrics show that the cascading framework a good choice for the ensemble of tree-based base learners. A comparison among homogeneous ensembles and heterogeneous ensembles further demonstrates the effectiveness of Heter-DF. Experiments on different training sets indicate that Heter-DF is a scalable framework which not only deals with large-scale credit scoring but also satisfies the condition where small-scale credit scoring is desirable. Finally, based on the good interpretability of a tree-based structure, the global interpretation of Heter-DF is preliminarily explored. (c) 2022 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1593 / 1614
页数:22
相关论文
共 50 条
  • [1] A novel tree-based dynamic heterogeneous ensemble method for credit scoring
    Xia, Yufei
    Zhao, Junhao
    He, Lingyun
    Li, Yinguo
    Niu, Mengyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [2] A novel heterogeneous ensemble credit scoring model based on bstacking approach
    Xia, Yufei
    Liu, Chuanzhe
    Da, Bowen
    Xie, Fangming
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 182 - 199
  • [3] An interpretable decision tree ensemble model for imbalanced credit scoring datasets
    My, Bui T. T.
    Ta, Bao Q.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 10853 - 10864
  • [4] A new hybrid ensemble credit scoring model based on classifiers consensus system approach
    Ala'raj, Maher
    Abbod, Maysam F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 36 - 55
  • [5] A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification
    Tripathi, Diwakar
    Edla, Damodar Reddy
    Cheruku, Ramalingaswamy
    Kuppili, Venkatanareshbabu
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (02) : 371 - 394
  • [6] A systematic credit scoring model based on heterogeneous classifier ensembles
    Ala'raj, Maher
    Abbod, Maysam
    2015 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA) PROCEEDINGS, 2015, : 119 - 125
  • [7] A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data
    Zhang, Tong
    Chi, Guotai
    INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (03) : 4372 - 4385
  • [8] Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring
    Plawiak, Pawel
    Abdar, Moloud
    Acharya, U. Rajendra
    APPLIED SOFT COMPUTING, 2019, 84
  • [9] A New Dynamic Credit Scoring Model Based on Clustering Ensemble
    Gao Wei
    Cheng Mingshu
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 421 - 425
  • [10] Enhancing credit risk prediction based on ensemble tree-based feature transformation and logistic regression
    Liu, Jiaming
    Liu, Jiajia
    Wu, Chong
    Wang, Shouyang
    JOURNAL OF FORECASTING, 2024, 43 (02) : 429 - 455