An interpretable credit risk assessment model with boundary sample identification

被引：0

作者：

Zhang, Runchi ^{[1
]}

Li, Iris ^{[2
]}

Ding, Zhiyuan ^{[3
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Econ, Nanjing, Jiangsu, Peoples R China

[2] NYU, Courant Inst Math Sci, New York, NY USA

[3] Franklin & Marshall Coll, Social Sci, Lancaster City, PA USA

来源：

PEERJ COMPUTER SCIENCE | 2025年 / 11卷

基金：

中国国家自然科学基金;

关键词：

Credit risk assessment; Interpretability; Boundary samples; Noise samples; MACHINE; FINANCE;

D O I：

10.7717/peerj-cs.2988

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Background: Interpretability is a key requirement for ensuring that credit risk assessment models are trustworthy and compliant with regulatory standards. Simultaneously, effectively distinguishing between noise samples and boundary samples is crucial for improving the accuracy of credit risk predictions. Methods: This article introduces a novel credit risk assessment model, Interpretable Credit Risk Assessment Model with Identifying Boundary Samples (IAIBS). The model begins with a logistic regression sub-model that offers strong self-interpretable features. For samples that are not correctly classified, the Attribute Recognition and Perception based on the Distribution of neighboring sample features (ARPD) algorithm is applied to filter out noisy samples and identify boundary samples. A deep learning sub-model is then trained to deeply learn the risk features of these boundary samples. Finally, representative features of all samples are extracted using agglomerative clustering, and the most suitable sub-model is selected for prediction based on the similarity between each sample and the cluster centers. Results: Experimental results on four public datasets demonstrate that the IAIBS model significantly outperforms 11 baseline models, as confirmed by the Nemenyi test. The model achieved area under the curve (AUC) scores of 89.17, 79.86, 97.48, and 66.03 on the PCL, FICO, CCF, and VL datasets, respectively. With appropriate parameter tuning, the IAIBS model maintains strong generalization ability, and each module contributes positively to overall performance. Additionally, the IAIBS model effectively interprets key predictors and prediction outcomes.

引用

页数：27

共 34 条

[1] Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk [J].

Abedin, Mohammad Zoynul ;

Guotai, Chi ;

Hajek, Petr ;

Zhang, Tong .

COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) :3559-3579

[2] Credit risk evaluation using clustering based fuzzy classification method [J].

Baser, Furkan ;

Koc, Oguz ;

Selcuk-Kestel, Sevtap .

EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223

[3] Prediction of bank credit worthiness through credit risk analysis: an explainable machine learning study [J].

Chang, Victor ;

Xu, Qianwen Ariel ;

Akinloye, Shola Habib ;

Benson, Vladlena ;

Hall, Karl .

ANNALS OF OPERATIONS RESEARCH, 2024,

[4] Financial credit risk assessment: a recent review [J].

Chen, Ning ;

Ribeiro, Bernardete ;

Chen, An .

ARTIFICIAL INTELLIGENCE REVIEW, 2016, 45 (01) :1-23

[5] Interpretable generalized additive neural networks [J].

Kraus, Mathias ;

Tschernutter, Daniel ;

Weinzierl, Sven ;

Zschech, Patrick .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 317 (02) :303-316

[6] A machine learning approach combining expert knowledge with genetic algorithms in feature selection for credit risk assessment [J].

Lappas, Pantelis Z. ;

Yannacopoulos, Athanasios N. .

APPLIED SOFT COMPUTING, 2021, 107

[7] A Credit Risk Model with Small Sample Data Based on G-XGBoost [J].

Li, Jian ;

Liu, Haibin ;

Yang, Zhijun ;

Han, Lei .

APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) :1550-1566

[8] Credit risk assessment of automobile loans using machine learning-based SHapley Additive exPlanations approach [J].

Lin, Shuoyan ;

Song, Dandan ;

Cao, Boyi ;

Gu, Xin ;

Li, Jiazhan .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147

[9] From local explanations to global understanding with explainable AI for trees [J].

Lundberg, Scott M. ;

Erion, Gabriel ;

Chen, Hugh ;

DeGrave, Alex ;

Prutkin, Jordan M. ;

Nair, Bala ;

Katz, Ronit ;

Himmelfarb, Jonathan ;

Bansal, Nisha ;

Lee, Su-In .

NATURE MACHINE INTELLIGENCE, 2020, 2 (01) :56-67

[10] Unsupervised quadratic surface support vector machine with application to credit risk assessment [J].

Luo, Jian ;

Yan, Xin ;

Tian, Ye .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 280 (03) :1008-1017

← 1 2 3 4 →