Evolving Transparent Credit Risk Models: A Symbolic Regression Approach Using Genetic Programming

被引:1
|
作者
Sotiropoulos, Dionisios N. [1 ]
Koronakos, Gregory [1 ]
Solanakis, Spyridon V. [1 ]
机构
[1] Univ Piraeus, Dept Informat, 80 Karaoli & Dimitriou Str, Piraeus 18534, Greece
关键词
credit risk assessment; neural networks; support vector machines; genetic programming; radial basis functions networks; LOGISTIC-REGRESSION; NETWORK;
D O I
10.3390/electronics13214324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Credit scoring is a cornerstone of financial risk management, enabling financial institutions to assess the likelihood of loan default. However, widely recognized contemporary credit risk metrics, like FICO (Fair Isaac Corporation) or Vantage scores, remain proprietary and inaccessible to the public. This study aims to devise an alternative credit scoring metric that mirrors the FICO score, using an extensive dataset from Lending Club. The challenge lies in the limited available insights into both the precise analytical formula and the comprehensive suite of credit-specific attributes integral to the FICO score's calculation. Our proposed metric leverages basic information provided by potential borrowers, eliminating the need for extensive historical credit data. We aim to articulate this credit risk metric in a closed analytical form with variable complexity. To achieve this, we employ a symbolic regression method anchored in genetic programming (GP). Here, the Occam's razor principle guides evolutionary bias toward simpler, more interpretable models. To ascertain our method's efficacy, we juxtapose the approximation capabilities of GP-based symbolic regression with established machine learning regression models, such as Gaussian Support Vector Machines (GSVMs), Multilayer Perceptrons (MLPs), Regression Trees, and Radial Basis Function Networks (RBFNs). Our experiments indicate that GP-based symbolic regression offers accuracy comparable to these benchmark methodologies. Moreover, the resultant analytical model offers invaluable insights into credit risk evaluation mechanisms, enabling stakeholders to make informed credit risk assessments. This study contributes to the growing demand for transparent machine learning models by demonstrating the value of interpretable, data-driven credit scoring models.
引用
收藏
页数:37
相关论文
共 50 条
  • [1] A new genetic programming approach in symbolic regression
    Xiong, SW
    Wang, WW
    Li, F
    15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 161 - 165
  • [2] Sequential Symbolic Regression with Genetic Programming
    Oliveira, Luiz Otavio V. B.
    Otero, Fernando E. B.
    Pappa, Gisele L.
    Albinati, Julio
    GENETIC PROGRAMMING THEORY AND PRACTICE XII, 2015, : 73 - 90
  • [3] Compositional Genetic Programming for Symbolic Regression
    Krawiec, Krzysztof
    Kossinski, Dominik
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 570 - 573
  • [4] Symbolic regression via genetic programming
    Augusto, DA
    Barbosa, HJC
    SIXTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, VOL 1, PROCEEDINGS, 2000, : 173 - 178
  • [5] Statistical genetic programming for symbolic regression
    Haeri, Maryam Amir
    Ebadzadeh, Mohammad Mehdi
    Folino, Gianluigi
    APPLIED SOFT COMPUTING, 2017, 60 : 447 - 469
  • [6] The Inefficiency of Genetic Programming for Symbolic Regression
    Kronberger, Gabriel
    de Franca, Fabricio Olivetti
    Desmond, Harry
    Bartlett, Deaglan J.
    Kammerer, Lukas
    PARALLEL PROBLEM SOLVING FROM NATURE-PPSN XVIII, PPSN 2024, PT I, 2024, 15148 : 273 - 289
  • [7] On improving genetic programming for symbolic regression
    Gustafson, S
    Burke, EK
    Krasnogor, N
    2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 912 - 919
  • [8] Taylor Genetic Programming for Symbolic Regression
    He, Baihe
    Lu, Qiang
    Yang, Qingyun
    Luo, Jake
    Wang, Zhiguang
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 946 - 954
  • [9] Is Human Walking a Network Medicine Problem? An Analysis Using Symbolic Regression Models with Genetic Programming
    Dasgupta, Pritika
    Hughes, James Alexander
    Daley, Mark
    Sejdić, Ervin
    Computer Methods and Programs in Biomedicine, 2021, 206
  • [10] Is Human Walking a Network Medicine Problem? An Analysis Using Symbolic Regression Models with Genetic Programming
    Dasgupta, Pritika
    Hughes, James Alexander
    Daley, Mark
    Sejdic, Ervin
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 206