Advancing credit risk modelling with Machine Learning: A comprehensive review of the state-of-the-art

被引:2
作者
Montevechi, Andre Aoun [1 ]
Miranda, Rafael de Carvalho [1 ]
Medeiros, Andre Luiz [1 ]
Montevechi, Jose Arnaldo Barra [1 ]
机构
[1] Fed Univ Itajuba UNIFEI, Ave BPS 1303,Caixa Postal 50, BR-37500903 Itajuba, MG, Brazil
关键词
Credit risk management; Consumer credit scoring; Machine learning in financial risk management; SUPPORT VECTOR MACHINE; CLASSIFICATION ALGORITHMS; FEATURE-SELECTION; NEURAL-NETWORK; SCORING MODEL; LOGISTIC-REGRESSION; PREDICTION; PEER; CLASSIFIERS; SYSTEM;
D O I
10.1016/j.engappai.2024.109082
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring financial stability necessitates responsible credit granting, so lending institutions maintain sufficient regulatory capital to withstand losses from defaults. Classification methods like Logistic Regression (LR) have long been the standard for estimating default likelihood, but they struggle with increasing operational demands and regulatory standards. Advances in Machine Learning (ML) now allow models to handle larger datasets with greater predictive power. Algorithms such as Decision Trees (DT), Support Vector Machines (SVM), and Neural Networks (NN), though not readily transferable to practical applications, have the potential to enhance the credit risk modelling process. While much of the literature on ML in credit risk focuses on sophisticated, experimental algorithms not yet ready for commercial use, few address the steps and nuances of model development. Our article fills this gap by systematically reviewing the literature on ML in consumer credit risk modelling, highlighting the current state of scientific knowledge. We analyze the frequent steps involved and demonstrate how ML's strengths can be leveraged throughout them. From pre-processing steps like data cleaning, transformation, and reduction, to variable selection and parameter optimization, our findings indicate that ML is becoming increasingly important in financial risk management. However, standardized modelling procedures are needed before complex ML classifiers can be commercially deployed. Meanwhile, although relatively interpretable ML models built around DTs and Random Forests (RF) do showcase increased performance in popular credit risk datasets, it is clear that the potential to benefit from ML spans further, with complex models enabling increased performance by aiding simpler, explainable classification algorithms.
引用
收藏
页数:18
相关论文
共 137 条
  • [1] Improve credit scoring using transfer of learned knowledge from self-organizing map
    AghaeiRad, Ali
    Chen, Ning
    Ribeiro, Bernardete
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (06) : 1329 - 1342
  • [2] Classifiers consensus system approach for credit scoring
    Ala'raj, Maher
    Abbod, Maysam F.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 89 - 105
  • [3] Comparing Performance of Machine Learning Algorithms for Default Risk Prediction in Peer to Peer Lending
    Aleksandrova, Yanka
    [J]. TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2021, 10 (01): : 133 - 143
  • [4] A Machine Learning Approach for Micro-Credit Scoring
    Ampountolas, Apostolos
    Nyarko Nde, Titus
    Date, Paresh
    Constantinescu, Corina
    [J]. RISKS, 2021, 9 (03)
  • [5] [Anonymous], 2017, Capco Institute Journal of Financial Transformation
  • [6] Assessing naive Bayes as a method for screening credit applicants
    Antonakis, A. C.
    Sfakianakis, M. E.
    [J]. JOURNAL OF APPLIED STATISTICS, 2009, 36 (05) : 537 - 545
  • [7] Armaki AG, 2017, ENG TECHNOL APPL SCI, V7, P2073
  • [8] Benchmarking state-of-the-art classification algorithms for credit scoring
    Baesens, B
    Van Gestel, T
    Viaene, S
    Stepanova, M
    Suykens, J
    Vanthienen, J
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) : 627 - 635
  • [9] Gradient boosting survival tree with applications in credit scoring
    Bai, Miaojun
    Zheng, Yan
    Shen, Yun
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2022, 73 (01) : 39 - 55
  • [10] Reject inference, augmentation, and sample selection
    Banasik, John
    Crook, Jonathan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 183 (03) : 1582 - 1594