Improving Risk Predictions by Preprocessing Imbalanced Credit Data

被引:0
作者
Garcia, Vicente [1 ]
Isabel Marques, Ana [2 ]
Salvador Sanchez, Jose [1 ]
机构
[1] Univ Jaume 1, Inst New Imaging Technol, Dept Comp Languages & Syst, Av Vicent Sos Baynat S-N, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dep Business Adm & Mkt, Castellon de La Plana 12071, Spain
来源
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II | 2012年 / 7664卷
关键词
Credit scoring; Class imbalance; Classification; Resampling; Finance; CLASSIFICATION; DEFAULT; SMOTE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced credit data sets refer to databases in which the class of defaulters is heavily under-represented in comparison to the class of non-defaulters. This is a very common situation in real-life credit scoring applications, but it has still received little attention. This paper investigates whether data resampling can be used to improve the performance of learners built from imbalanced credit data sets, and whether the effectiveness of resampling is related to the type of classifier. Experimental results demonstrate that learning with the resampled sets consistently outperforms the use of the original imbalanced credit data, independently of the classifier used.
引用
收藏
页码:68 / 75
页数:8
相关论文
共 50 条
  • [31] A graph-based semi-supervised reject inference framework considering imbalanced data distribution for consumer credit scoring
    Kang, Yanzhe
    Jia, Ning
    Cui, Runbang
    Deng, Jiang
    APPLIED SOFT COMPUTING, 2021, 105
  • [32] NEW HYBRID DATA PREPROCESSING TECHNIQUE FOR HIGHLY IMBALANCED DATASET
    Malik, Esraa Faisal
    Khaw, Khai Wah
    Chew, XinYing
    COMPUTING AND INFORMATICS, 2022, 41 (04) : 981 - 1001
  • [33] Enhancing credit card fraud detection: highly imbalanced data case
    Breskuviene, Dalia
    Dzemyda, Gintautas
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [34] A-RDBOTE: an improved oversampling technique for imbalanced credit-scoring datasets
    Lenka, Sudhansu R.
    Bisoy, Sukant Kishoro
    Priyadarshini, Rojalina
    RISK MANAGEMENT-AN INTERNATIONAL JOURNAL, 2023, 25 (04):
  • [35] Sustainable Fault Diagnosis of Imbalanced Text Mining for CTCS-3 Data Preprocessing
    Shi, Lijuan
    Li, Ang
    Zhang, Lei
    SUSTAINABILITY, 2021, 13 (04) : 1 - 14
  • [36] An Empirical Study on Preprocessing High-dimensional Class-imbalanced Data for Classification
    Yin, Hua
    Gai, Keke
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1314 - 1319
  • [37] A-SMOTE: A New Preprocessing Approach for Highly Imbalanced Datasets by Improving SMOTE
    Ahmed Saad Hussein
    Tianrui Li
    Chubato Wondaferaw Yohannese
    Kamal Bashir
    International Journal of Computational Intelligence Systems, 2019, 12 : 1412 - 1422
  • [38] A hybrid evolutionary preprocessing method for imbalanced datasets
    Wong, Ginny Y.
    Leung, Frank H. F.
    Ling, Sai-Ho
    INFORMATION SCIENCES, 2018, 454 : 161 - 177
  • [39] Credit risk prediction in an imbalanced social lending environment
    Anahita Namvar
    Mohammad Siami
    Fethi Rabhi
    Mohsen Naderpour
    International Journal of Computational Intelligence Systems, 2018, 11 : 925 - 935
  • [40] Credit risk prediction in an imbalanced social lending environment
    Namvar, Anahita
    Siami, Mohammad
    Rabhi, Fethi
    Naderpour, Mohsen
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 925 - 935