Improving Risk Predictions by Preprocessing Imbalanced Credit Data

被引:0
|
作者
Garcia, Vicente [1 ]
Isabel Marques, Ana [2 ]
Salvador Sanchez, Jose [1 ]
机构
[1] Univ Jaume 1, Inst New Imaging Technol, Dept Comp Languages & Syst, Av Vicent Sos Baynat S-N, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dep Business Adm & Mkt, Castellon de La Plana 12071, Spain
来源
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II | 2012年 / 7664卷
关键词
Credit scoring; Class imbalance; Classification; Resampling; Finance; CLASSIFICATION; DEFAULT; SMOTE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced credit data sets refer to databases in which the class of defaulters is heavily under-represented in comparison to the class of non-defaulters. This is a very common situation in real-life credit scoring applications, but it has still received little attention. This paper investigates whether data resampling can be used to improve the performance of learners built from imbalanced credit data sets, and whether the effectiveness of resampling is related to the type of classifier. Experimental results demonstrate that learning with the resampled sets consistently outperforms the use of the original imbalanced credit data, independently of the classifier used.
引用
收藏
页码:68 / 75
页数:8
相关论文
共 50 条
  • [21] Adaptively Promoting Diversity in a Novel Ensemble Method for Imbalanced Credit-Risk Evaluation
    Guo, Yitong
    Mei, Jie
    Pan, Zhiting
    Liu, Haonan
    Li, Weiwei
    MATHEMATICS, 2022, 10 (11)
  • [22] Granular Computing and Parameters Tuning in Imbalanced Data Preprocessing
    Borowska, Katarzyna
    Stepaniuk, Jaroslaw
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2018, 2018, 11127 : 233 - 245
  • [23] Support vector machines for credit risk assessment with imbalanced datasets
    Khemakhem, Sihem
    Boujelbene, Younes
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2018, 10 (02) : 171 - 187
  • [24] Improving class probability estimates for imbalanced data
    Byron C. Wallace
    Issa J. Dahabreh
    Knowledge and Information Systems, 2014, 41 : 33 - 52
  • [25] Improving class probability estimates for imbalanced data
    Wallace, Byron C.
    Dahabreh, Issa J.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (01) : 33 - 52
  • [26] Hybrid sampling for imbalanced data
    Seiffert, Chris
    Khoshgoftaar, Taghi M.
    Van Hulse, Jason
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2009, 16 (03) : 193 - 210
  • [27] The study of preprocessing methods' utility in analysis of multidimensional and highly imbalanced medical data
    Werner, Aleksandra
    Bach, Malgorzata
    Pluskiewicz, Wojciech
    PROCEEDINGS OF THE 11TH SCIENTIFIC CONFERENCE INTERNET IN THE INFORMATION SOCIETY 2016, 2016, : 71 - 87
  • [28] Credit Card Fraud Detection under Extreme Imbalanced Data: A Comparative Study of Data-level Algorithms
    Singh, Amit
    Ranjan, Ranjeet Kumar
    Tiwari, Abhishek
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (04) : 571 - 598
  • [29] Data Preprocessing and Dynamic Ensemble Selection for Imbalanced Data Stream Classification
    Zyblewski, Pawel
    Sabourin, Robert
    Wozniak, Michal
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 367 - 379
  • [30] Local resampling for locally weighted Naive Bayes in imbalanced data
    Saglam, Fatih
    Cengiz, Mehmet Ali
    COMPUTING, 2024, 106 (01) : 185 - 200