Performance Evaluation of Hybrid Machine Learning Algorithms for Online Lending Credit Risk Prediction

被引:1
作者
Berhane, Tesfahun [1 ]
Melese, Tamiru [1 ]
Seid, Abdu Mohammed [1 ]
机构
[1] Bahir Dar Univ, Dept Math, POB 79, Bahir Dar, Ethiopia
关键词
Convolutional neural networks - Decision trees - E-learning - Finance - Learning algorithms - Logistic regression - Losses - Machine learning - Peer to peer networks - Risk assessment;
D O I
10.1080/08839514.2024.2358661
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Peer-to-Peer systems are still in the early stages of development when it comes to the processing of credit and the appraisal of the risk associated with it. In this study, we used a hybrid convolutional neural network with logistic regression, a gradient-boosting decision tree, and a k-nearest neighbor to predict the credit risk in a P2P lending club. The lending clubs publicly available P2P loan data was used to train the model. In order to address the issue of data imbalance within the dataset, specifically between the non-defaulter and defaulter classes, the synthetic minority oversampling technique sampling approach is utilized. We developed the architecture of our hybrid model by removing the fully connected layer with the soft-max, which is the final layer of the fully connected CNN model and replaced by LR, GBDT, and k-NN algorithms. The experimental results show that the hybrid CNN-kNN model outperforms the CNN-GBDT and CNN-LR models based on the performance metrics accuracy, recall, F1-score, and area under the curve for both all input and important features. This shows that hybrid machine learning models effectively identify and categorize credit risk in peer-to-peer lending clubs, hence assisting in financial loss prevention.
引用
收藏
页数:26
相关论文
共 46 条
[1]   A Pyramid-CNN Based Deep Learning Model for Power Load Forecasting of Similar-Profile Energy Customers Based on Clustering [J].
Aurangzeb, Khursheed ;
Alhussein, Musaed ;
Javaid, Kumail ;
Haider, Syed Irtaza .
IEEE ACCESS, 2021, 9 :14992-15003
[2]   A comparative analysis of gradient boosting algorithms [J].
Bentejac, Candice ;
Csorgo, Anna ;
Martinez-Munoz, Gonzalo .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (03) :1937-1967
[3]  
Berhane T., 2023, Math. Probl. Eng, V2023, P1, DOI [10.1155/2023/8134627, DOI 10.1155/2023/8134627]
[4]   Explainable Machine Learning in Credit Risk Management [J].
Bussmann, Niklas ;
Giudici, Paolo ;
Marinelli, Dimitri ;
Papenbrock, Jochen .
COMPUTATIONAL ECONOMICS, 2021, 57 (01) :203-216
[5]  
Caie PD., 2021, Artificial Intelligence and Deep Learning in Pathology, DOI [10.1016/b978-0-323-67538-3.00008-7, DOI 10.1016/B978-0-323-67538-3.00008-7]
[6]   Machine learning and artificial neural networks to construct P2P lending credit-scoring model: A case using Lending Club data [J].
Chang, An-Hsing ;
Yang, Li-Kai ;
Tsaih, Rua-Huan ;
Lin, Shih-Kuei .
QUANTITATIVE FINANCE AND ECONOMICS, 2022, 6 (02) :303-325
[7]   Machine learning-based classification of rock discontinuity trace: SMOTE oversampling integrated with GBT ensemble learning [J].
Chen, Jiayao ;
Huang, Hongwei ;
Cohn, Anthony G. ;
Zhang, Dongming ;
Zhou, Mingliang .
INTERNATIONAL JOURNAL OF MINING SCIENCE AND TECHNOLOGY, 2022, 32 (02) :309-322
[8]   Predicting corporate financial distress based on integration of decision tree classification and logistic regression [J].
Chen, Mu-Yen .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) :11261-11272
[9]   Predicting Default Risk on Peer-to-Peer Lending Imbalanced Datasets [J].
Chen, Yen-Ru ;
Leu, Jenq-Shiou ;
Huang, Sheng-An ;
Wang, Jui-Tang ;
Takada, Jun-Ichi .
IEEE ACCESS, 2021, 9 :73103-73109
[10]   Ensemble learning for the early prediction of neonatal jaundice with genetic features [J].
Deng, Haowen ;
Zhou, Youyou ;
Wang, Lin ;
Zhang, Cheng .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)