共 34 条
An interpretable credit risk assessment model with boundary sample identification
被引:0
作者:
Zhang, Runchi
[1
]
Li, Iris
[2
]
Ding, Zhiyuan
[3
]
机构:
[1] Nanjing Univ Posts & Telecommun, Sch Econ, Nanjing, Jiangsu, Peoples R China
[2] NYU, Courant Inst Math Sci, New York, NY USA
[3] Franklin & Marshall Coll, Social Sci, Lancaster City, PA USA
基金:
中国国家自然科学基金;
关键词:
Credit risk assessment;
Interpretability;
Boundary samples;
Noise samples;
MACHINE;
FINANCE;
D O I:
10.7717/peerj-cs.2988
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Background: Interpretability is a key requirement for ensuring that credit risk assessment models are trustworthy and compliant with regulatory standards. Simultaneously, effectively distinguishing between noise samples and boundary samples is crucial for improving the accuracy of credit risk predictions. Methods: This article introduces a novel credit risk assessment model, Interpretable Credit Risk Assessment Model with Identifying Boundary Samples (IAIBS). The model begins with a logistic regression sub-model that offers strong self-interpretable features. For samples that are not correctly classified, the Attribute Recognition and Perception based on the Distribution of neighboring sample features (ARPD) algorithm is applied to filter out noisy samples and identify boundary samples. A deep learning sub-model is then trained to deeply learn the risk features of these boundary samples. Finally, representative features of all samples are extracted using agglomerative clustering, and the most suitable sub-model is selected for prediction based on the similarity between each sample and the cluster centers. Results: Experimental results on four public datasets demonstrate that the IAIBS model significantly outperforms 11 baseline models, as confirmed by the Nemenyi test. The model achieved area under the curve (AUC) scores of 89.17, 79.86, 97.48, and 66.03 on the PCL, FICO, CCF, and VL datasets, respectively. With appropriate parameter tuning, the IAIBS model maintains strong generalization ability, and each module contributes positively to overall performance. Additionally, the IAIBS model effectively interprets key predictors and prediction outcomes.
引用
收藏
页数:27
相关论文