Improving Credit Card Fraud Detection with Data Reduction Approaches

被引:0
|
作者
Wang, Huanjing [1 ]
Hancock, John [2 ]
Khoshgoftaar, Taghi M. [2 ]
机构
[1] Western Kentucky Univ, Sch Engn & Appl Sci, 1906 Coll Hts Blvd, Bowling Green, KY 42101 USA
[2] Florida Atlantic Univ, Dept Elect Engn & Comp Sci, 777 Glades Rd, Boca Raton, FL 33431 USA
关键词
Ensemble supervised feature selection; random undersampling; data reduction; credit card fraud; class imbalance; SELECTION; NETWORK; MACHINE;
D O I
10.1142/S0218539324400011
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Detecting fraudulent activities in credit card transactions can be challenging due to issues like high dimensionality and class imbalance that are often present in the datasets. To address these challenges, data reduction techniques such as data sampling and feature selection have become essential. In this study, we compare four approaches for data reduction: using data sampling alone, employing feature selection alone, applying data sampling followed by feature selection, and using feature selection followed by data sampling. Additionally, we include results using all features. We build classification models using five Decision Tree-based classifiers and Logistic Regression, and evaluate their performance using two performance metrics: the Area Under the Receiver Operating Characteristic Curve (AUC) and the Area under the Precision-Recall Curve (AUPRC). In this work, we adopt ensemble supervised feature selection (SFS) techniques and Random Undersampling (RUS) for data reduction. The experimental results demonstrate that all four data reduction techniques have the potential to improve the performance of classifiers. These results are valuable since the classifiers available are dependent upon application domains, computing environments, and licensing agreements. However, these techniques can be applied independently of all these dependencies. We recommend utilizing the ensemble SFS followed by RUS (SFS-RUS) approach as the preferred data reduction method due to its ability to run feature selection and data sampling in parallel. Additionally, we find that XGBoost and CatBoost outperform other classifiers.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Improving the Data Quality for Credit Card Fraud Detection
    Jing, Rongrong
    Tian, Hu
    Li, Yidi
    Zhang, Xingwei
    Zheng, Xiaolong
    Zhang, Zhu
    Zeng, Daniel
    2020 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2020, : 175 - 180
  • [2] Real Time Data-Driven Approaches for Credit Card Fraud Detection
    Phuong Hanh Tran
    Kim Phuc Tran
    Truong Thu Huong
    Heuchenne, Cedric
    Phuong HienTran
    Thi Minh Huong Le
    2018 INTERNATIONAL CONFERENCE ON E-BUSINESS AND APPLICATIONS (ICEBA 2018), 2018, : 6 - 9
  • [3] Neural data mining for credit card fraud detection
    Guo, Tao
    Li, Gui-Yang
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3630 - 3634
  • [4] Improving Classification Performance in Credit Card Fraud Detection by Using New Data Augmentation
    Strelcenia, Emilija
    Prakoonwit, Simant
    AI, 2023, 4 (01) : 172 - 198
  • [5] Credit Card Fraud Detection
    Tiwari, Mohit
    Sharma, Vipul
    Bala, Devashish
    Devansh
    Kaushal, Dishant
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 1778 - 1789
  • [6] Distributed data mining in credit card fraud detection
    Chan, PK
    Fan, W
    Prodromidis, AL
    Stolfo, SJ
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (06): : 67 - 74
  • [7] Distributed data mining in credit card fraud detection
    Chan, Philip K.
    Fan, Wei
    Prodromidis, Andreas L.
    Stolfo, Salvatore J.
    IEEE Intelligent Systems and Their Applications, 14 (06): : 67 - 74
  • [8] Exploratory Data Analysis for Credit Card Fraud Detection
    Kirar, Jyoti Singh
    Kumar, Dhiraj
    Chatterjee, Diptirtha
    Patel, Prasoon Singh
    Yadav, Shailendra Nath
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 157 - 161
  • [9] Improving Credit Card Fraud Detection by Profiling and Clustering Accounts
    Kasa, Navin
    Dahbura, Andrew
    Ravoori, Charishma
    Adams, Stephen
    2019 SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (SIEDS), 2019, : 346 - 351
  • [10] Improving a credit card fraud detection system using genetic algorithm
    Ozcelik, M. Hamdi
    Isik, Mine
    Duman, Ekrem
    Cevik, Tugba
    2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 436 - 440