Enhancing Financial Fraud Detection through Addressing Class Imbalance Using Hybrid SMOTE-GAN Techniques

被引:7
作者
Cheah, Patience Chew Yee [1 ]
Yang, Yue [1 ]
Lee, Boon Giin [1 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Peoples R China
来源
INTERNATIONAL JOURNAL OF FINANCIAL STUDIES | 2023年 / 11卷 / 03期
关键词
class imbalance; data generation; deep learning; financial fraud detection; ENSEMBLE;
D O I
10.3390/ijfs11030110
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
The class imbalance problem in finance fraud datasets often leads to biased prediction towards the nonfraud class, resulting in poor performance in the fraud class. This study explores the effects of utilizing the Synthetic Minority Oversampling TEchnique (SMOTE), a Generative Adversarial Network (GAN), and their combinations to address the class imbalance issue. Their effectiveness was evaluated using a Feed-forward Neural Network (FNN), Convolutional Neural Network (CNN), and their hybrid (FNN+CNN). This study found that regardless of the data generation techniques applied, the classifier's hyperparameters can affect classification performance. The comparisons of various data generation techniques demonstrated the effectiveness of the hybrid SMOTE and GAN, including SMOTified-GAN, SMOTE+GAN, and GANified-SMOTE, compared with SMOTE and GAN. The SMOTified-GAN and the proposed GANified-SMOTE were able to perform equally well across different amounts of generated fraud samples.
引用
收藏
页数:17
相关论文
共 35 条
  • [1] Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques
    Abdoh, Sherif F.
    Rizka, Mohamed Abo
    Maghraby, Fahima A.
    [J]. IEEE ACCESS, 2018, 6 : 59475 - 59485
  • [2] Computer-aided diagnosis of clinically significant prostate cancer from MRI images using sparse autoencoder and random forest classifier
    Abraham, Bejoy
    Nair, Madhu S.
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2018, 38 (03) : 733 - 744
  • [3] Credit Card Fraud Detection Using State-of-the-Art Machine Learning and Deep Learning Algorithms
    Alarfaj, Fawaz Khaled
    Malik, Iqra
    Khan, Hikmat Ullah
    Almusallam, Naif
    Ramzan, Muhammad
    Ahmed, Muzamil
    [J]. IEEE ACCESS, 2022, 10 : 39700 - 39715
  • [4] RN-SMOTE: Reduced Noise SMOTE based on DBSCAN for enhancing imbalanced data classification
    Arafa, Ahmed
    El-Fishawy, Nawal
    Badawy, Mohammed
    Radad, Marwa
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5059 - 5074
  • [5] Interleaved Sequence RNNs for Fraud Detection
    Branco, Bernardo
    Abreu, Pedro
    Gomes, Ana Sofia
    Almeida, Mariana S. C.
    Ascensao, Joao Tiago
    Bizarro, Pedro
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3101 - 3109
  • [6] Bunkhumpornpat C, 2009, LECT NOTES ARTIF INT, V5476, P475, DOI 10.1007/978-3-642-01307-2_43
  • [7] Charitou C., 2021, arXiv
  • [8] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [9] Chen J. I.-Z., 2021, J. Artif. Intell. Capsule Netw., V3, P101
  • [10] Effective data generation for imbalanced learning using conditional generative adversarial networks
    Douzas, Georgios
    Bacao, Fernando
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 : 464 - 471