Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing

被引:0
|
作者
Sayed, Eslam Hussein [1 ,2 ]
Alabrah, Amerah [3 ]
Rahouma, Kamel Hussein [4 ]
Zohaib, Muhammad [5 ]
Badry, Rasha M. [1 ]
机构
[1] Fayoum Univ, Fac Comp & Informat, Informat Syst Dept, Faiyum, Egypt
[2] Nahda Univ, Fac Comp Sci, Informat Syst Dept, Bani Suwayf 62764, Egypt
[3] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
[4] Minia Univ, Fac Engn, Elect Engn Dept, Al Minya, Egypt
[5] Lappeenranta Lahti Univ Technol, Software Engn Dept, Informat Syst Dept, Lappeenranta 53851, Finland
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Random forests; Predictive models; Classification algorithms; Prediction algorithms; Machine learning algorithms; Logistic regression; Support vector machines; Ensemble learning; Deep learning; Customer loan prediction; artificial intelligence; data preprocessing; model optimization; machine learning; deep learning; classification models; CLASSIFICATION;
D O I
10.1109/ACCESS.2024.3509774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The prediction of loan defaults is crucial for banks and financial institutions due to its impact on earnings, and it also plays a significant role in shaping credit scores. This task is a challenging one, and as the demand for loans increases, so does the number of applications. Traditional methods of checking eligibility are time-consuming and laborious, and they may not always accurately identify suitable loan recipients. As a result, some applicants may default on their loans, causing financial losses for banks. Artificial Intelligence, using Machine Learning and Deep Learning techniques, can provide a more efficient solution. These techniques can use various classification algorithms to predict which applicants will likely be eligible for loans. This study uses five Machine Learning classification algorithms (Gaussian Naive Bayes, AdaBoost, Gradient Boosting, K Neighbors Classifier, Decision Trees, Random Forest, and Logistic Regression) and eight Deep Learning algorithms (MLP, CNN, LSTM, Transformer, GRU, Autoencoder, ResNet, and DenseNet). The use of Ensemble Methods and SMOTE with SMOTE-TOMEK Techniques also has a positive impact on the results. Four metrics are used to evaluate the effectiveness of these algorithms: accuracy, precision, recall, and F1-measure. The study found that DenseNet and ResNet were the most accurate predictive models. These findings highlight the potential of predictive modeling in identifying credit disapproval among vulnerable consumers in a sea of loan applications.
引用
收藏
页码:193997 / 194019
页数:23
相关论文
共 50 条
  • [1] Machine learning and deep learning methods that use omics data for metastasis prediction
    Albaradei, Somayah
    Thafar, Maha
    Alsaedi, Asim
    Van Neste, Christophe
    Gojobori, Takashi
    Essack, Magbubah
    Gao, Xin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 5008 - 5018
  • [2] Comparative Analysis of Machine Learning Algorithms for CKD Risk Prediction
    Yang, Weilin
    Ahmed, Nasim
    Barczak, Andre L. C.
    IEEE ACCESS, 2024, 12 : 171205 - 171220
  • [3] Ensemble-Based Machine Learning Algorithm for Loan Default Risk Prediction
    Akinjole, Abisola
    Shobayo, Olamilekan
    Popoola, Jumoke
    Okoyeigbo, Obinna
    Ogunleye, Bayode
    MATHEMATICS, 2024, 12 (21)
  • [4] Exploring Deep Learning and Machine Learning Approaches for Brain Hemorrhage Detection
    Ahmed, Samia
    Esha, Jannatul Ferdous
    Rahman, Md. Sazzadur
    Kaiser, M. Shamim
    Hosen, A. S. M. Sanwar
    Ghimire, Deepak
    Park, Mi Jin
    IEEE ACCESS, 2024, 12 : 45060 - 45093
  • [5] An Ensemble Deep Learning Model for Vehicular Engine Health Prediction
    Joseph Chukwudi, Isinka
    Zaman, Nafees
    Abdur Rahim, Md
    Arafatur Rahman, Md
    Alenazi, Mohammed J. F.
    Pillai, Prashant
    IEEE ACCESS, 2024, 12 : 63433 - 63451
  • [6] Data Science in Economics: Comprehensive Review of Advanced Machine Learning and Deep Learning Methods
    Nosratabadi, Saeed
    Mosavi, Amirhosein
    Puhong Duan
    Ghamisi, Pedram
    Filip, Ferdinand
    Band, Shahab S.
    Reuter, Uwe
    Gama, Joao
    Gandomi, Amir H.
    MATHEMATICS, 2020, 8 (10) : 1 - 25
  • [7] DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data
    Olivier B. Poirion
    Zheng Jing
    Kumardeep Chaudhary
    Sijia Huang
    Lana X. Garmire
    Genome Medicine, 13
  • [8] Fraud Detection in Banking Data by Machine Learning Techniques
    Hashemi, Seyedeh Khadijeh
    Mirtaheri, Seyedeh Leili
    Greco, Sergio
    IEEE ACCESS, 2023, 11 : 3034 - 3043
  • [9] DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data
    Poirion, Olivier B.
    Jing, Zheng
    Chaudhary, Kumardeep
    Huang, Sijia
    Garmire, Lana X.
    GENOME MEDICINE, 2021, 13 (01)
  • [10] Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning
    Khan, Muhammad Zafar Irshad
    Ren, Jia-Nan
    Cao, Cheng
    Ye, Hong-Yu-Xiang
    Wang, Hao
    Guo, Ya-Min
    Yang, Jin-Rong
    Chen, Jian-Zhong
    FRONTIERS IN PHARMACOLOGY, 2024, 15