A Comparison of Data Sampling Techniques for Credit Card Fraud Detection

被引:0
|
作者
Muaz, Abdulla [1 ]
Jayabalan, Manoj [2 ]
Thiruchelvam, Vinesh [1 ]
机构
[1] Asia Pacific Univ Technol & Innovat, Sch Comp, Kuala Lumpur, Malaysia
[2] Liverpool John Moores Univ, Fac Engn & Technol, Liverpool, Merseyside, England
关键词
Data imbalance; credit card fraud; sampling techniques;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Credit Card fraud is a tough reality that continues to constrain the financial sector and its detrimental effects are felt across the entire financial market. Criminals are continuously on the lookout for ingenious methods for such fraudulent activities and are a real threat to security. Therefore, there is a need for early detection of fraudulent activity to preserve customer trust and safeguard their business. A major challenge faced in designing fraud detection systems is dealing with the class imbalance issue in the data since genuine transactions outnumber the fraudulent transactions typically account less than 1% of the total transactions. This is an important area of study as the positive case (fraudulent case) is hard to distinguish and becomes even harder with the inflow of data where the representation of such cases even decreases further. This study trained four predictive models, Artificial Neural Network (ANN), Gradient Boosting Machine (GBM) and Random Forest (RF) on different sampling methods. Random Under Sampling (RUS), Synthetic Minority Over-sampling Technique (SMOTE), Density-Based Synthetic Minority Over-Sampling Technique (DBSMOTE) and SMOTE combined with Edited Nearest Neighbour (SMOTEENN) was used for all models. The findings of this study indicate promising results with SMOTE based sampling techniques. The best recall score obtained was with SMOTE sampling strategy by DRF classifier at 0.81. The precision score for this classifier was observed to be 0.86. Stacked Ensemble was trained for all the sampled datasets and found to have the best average performance at 0.78. The Stacked Ensemble model has shown promise in the detection of fraudulent transactions across most of the sampling strategies.
引用
收藏
页码:477 / 485
页数:9
相关论文
共 50 条
  • [1] Survey of Credit Card Anomaly and Fraud Detection Using Sampling Techniques
    Alamri, Maram
    Ykhlef, Mourad
    ELECTRONICS, 2022, 11 (23)
  • [2] A Comparison of Data Balancing Techniques for Credit Card Fraud Detection using Neural Network
    Uttam, Atul Kumar
    Sharma, Gaurav
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1136 - 1140
  • [3] The effect of feature extraction and data sampling on credit card fraud detection
    Zahra Salekshahrezaee
    Joffrey L. Leevy
    Taghi M. Khoshgoftaar
    Journal of Big Data, 10
  • [4] The effect of feature extraction and data sampling on credit card fraud detection
    Salekshahrezaee, Zahra
    Leevy, Joffrey L.
    Khoshgoftaar, Taghi M.
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [5] Credit Card Fraud Detection Using Anomaly Techniques
    Sharmila, V. Ceronmani
    Kumar, Kiran R.
    Sundaram, R.
    Samyuktha, D.
    Harish, R.
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [6] A Critical review of Credit Card Fraud Detection Techniques
    Al Smadi, Baker
    Min, Manki
    2020 11TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2020, : 732 - 736
  • [7] Federated learning model for credit card fraud detection with data balancing techniques
    Abdul Salam, Mustafa
    Fouad, Khaled M.
    Elbably, Doaa L.
    Elsayed, Salah M.
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (11): : 6231 - 6256
  • [8] Federated learning model for credit card fraud detection with data balancing techniques
    Mustafa Abdul Salam
    Khaled M. Fouad
    Doaa L. Elbably
    Salah M. Elsayed
    Neural Computing and Applications, 2024, 36 : 6231 - 6256
  • [9] Comparison with Parametric Optimization in Credit Card Fraud Detection
    Gadi, Manoel Fernando Alonso
    Wang, Xidi
    do Lago, Alair Pereira
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 279 - +
  • [10] Neural data mining for credit card fraud detection
    Guo, Tao
    Li, Gui-Yang
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3630 - 3634