Multi-grained and multi-layered gradient boosting decision tree for credit scoring

被引:21
作者
Liu, Wan'an [1 ]
Fan, Hong [1 ]
Xia, Min [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Nanjing Univ Sci Informat & Technol, Sch Automat, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Credit scoring; Representation learning; Multi-layered; GBDT; CLASSIFICATION ALGORITHMS; DEFAULT PREDICTION; SMOTE;
D O I
10.1007/s10489-021-02715-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring is an important process for banks and financial institutions to manage credit risk. Tree-based ensemble algorithms have made promising progress in credit scoring. However, tree-based ensemble algorithms lack representation learning, making them cannot well express the potential distribution of loan data. In this study, we propose a multi-grained and multi-layered gradient boosting decision tree (GBDT) for credit scoring. Multi-layered GBDT considers the advantages of the explicit learning process of tree-based model and the representation learning ability to discriminate good/bad applicants; multi-grained scanning augments original credit features while enhancing the representation learning ability of multi-layered GBDT. The experimental results on 6 credit scoring datasets show that the hierarchical structure can effectively reduce the intra-class distance and increase the inter-class distance of the credit scoring dataset. In addition, Multi-grained feature augmentation effectively increases the diversity of prediction and further improves the performance of credit scoring, providing more precise credit scoring results.
引用
收藏
页码:5325 / 5341
页数:17
相关论文
共 45 条
[1]   Deep convolutional neural network for the automated diagnosis of congestive heart failure using ECG signals [J].
Acharya, U. Rajendra ;
Fujita, Hamido ;
Oh, Shu Lih ;
Hagiwara, Yuki ;
Tan, Jen Hong ;
Adam, Muhammad ;
Tan, Ru San .
APPLIED INTELLIGENCE, 2019, 49 (01) :16-27
[2]   Classifiers consensus system approach for credit scoring [J].
Ala'raj, Maher ;
Abbod, Maysam F. .
KNOWLEDGE-BASED SYSTEMS, 2016, 104 :89-105
[3]   Example-Dependent Cost-Sensitive Logistic Regression for Credit Scoring [J].
Bahnsen, Alejandro Correa ;
Aouada, Djamila ;
Ottersten, Bjorn .
2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, :263-269
[4]   Wide and deep learning for peer-to-peer lending [J].
Bastani, Kaveh ;
Asgari, Elham ;
Namavari, Hamed .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 134 :209-224
[5]  
Bengio Y., 2007, P ADV NEUR INF PROC, P153
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]   Extreme learning machines for credit scoring: An empirical evaluation [J].
Beque, Artem ;
Lessmann, Stefan .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 86 :42-53
[8]  
Feng Ji, 2018, ADV NEUR IN, V31
[9]   SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary [J].
Fernandez, Alberto ;
Garcia, Salvador ;
Herrera, Francisco ;
Chawla, Nitesh V. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 :863-905
[10]   An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market [J].
Fitzpatrick, Trevor ;
Mues, Christophe .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 249 (02) :427-439