Improving Risk Predictions by Preprocessing Imbalanced Credit Data

被引:0
作者
Garcia, Vicente [1 ]
Isabel Marques, Ana [2 ]
Salvador Sanchez, Jose [1 ]
机构
[1] Univ Jaume 1, Inst New Imaging Technol, Dept Comp Languages & Syst, Av Vicent Sos Baynat S-N, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dep Business Adm & Mkt, Castellon de La Plana 12071, Spain
来源
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II | 2012年 / 7664卷
关键词
Credit scoring; Class imbalance; Classification; Resampling; Finance; CLASSIFICATION; DEFAULT; SMOTE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced credit data sets refer to databases in which the class of defaulters is heavily under-represented in comparison to the class of non-defaulters. This is a very common situation in real-life credit scoring applications, but it has still received little attention. This paper investigates whether data resampling can be used to improve the performance of learners built from imbalanced credit data sets, and whether the effectiveness of resampling is related to the type of classifier. Experimental results demonstrate that learning with the resampled sets consistently outperforms the use of the original imbalanced credit data, independently of the classifier used.
引用
收藏
页码:68 / 75
页数:8
相关论文
共 50 条
  • [41] Learning to Undersampling for Class Imbalanced Credit Risk Forecasting
    Chi, Jianfeng
    Zeng, Guanxiong
    Zhong, Qiwei
    Liang, Ting
    Feng, Jinghua
    Xiang, Ao
    Tang, Jiayu
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 72 - 81
  • [42] Improving the Prediction Accuracy of Decision Tree Mining with Data Preprocessing
    Chandrasekar, Priyanga
    Qian, Kai
    Shahriar, Hossain
    Bhattacharya, Prabir
    2017 IEEE 41ST ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2017, : 481 - 484
  • [43] RMkNN and KNORA-IU: Combining Imbalanced Dynamic Selection Techniques for Credit Scoring
    Melo Junior, Leopoldo
    Macedo, Jose Fernandes
    Nardini, Franco Maria
    Renso, Chiara
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 823 - 830
  • [44] A focal-aware cost-sensitive boosted tree for imbalanced credit scoring
    Liu, Wanan
    Fan, Hong
    Xia, Min
    Xia, Meng
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [45] Influence-Balanced XGBoost: Improving XGBoost for Imbalanced Data Using Influence Functions
    Sutou, Akiyoshi
    Wang, Jinfang
    IEEE ACCESS, 2024, 12 : 193473 - 193486
  • [46] Deep reinforcement learning imbalanced credit risk of SMEs in supply chain finance
    Zhang, Wen
    Yan, Shaoshan
    Li, Jian
    Peng, Rui
    Tian, Xin
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [47] An exploration of learning when data is noisy and imbalanced
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    INTELLIGENT DATA ANALYSIS, 2011, 15 (02) : 215 - 236
  • [48] Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring
    Jiang, Cuiqing
    Lu, Wang
    Wang, Zhao
    Ding, Yong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [49] Improving Credit Risk Prediction in Online Peer-to-Peer (P2P) Lending Using Imbalanced Learning Techniques
    Boiko Ferreira, Luis Eduardo
    Barddal, Jean Paul
    Enembreck, Fabricio
    Gomes, Heitor Murilo
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 175 - 181
  • [50] An efficient fraud detection framework with credit card imbalanced data in financial services
    El-Naby, Aya Abd
    Hemdan, Ezz El-Din
    El-Sayed, Ayman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 4139 - 4160