Enhanced autoencoder-based fraud detection: a novel approach with noise factor encoding and SMOTE

被引:3
作者
Cakir, Mert Yilmaz [1 ]
Sirin, Yahya [1 ]
机构
[1] Istanbul Sabahattin Zaim Univ, Comp Sci & Engn, TR-34303 Istanbul, Turkiye
关键词
Fraud detection; Noise factor encoding; Autoencoder; Variational autoencoder; Contractive autoencoder; SMOTE; CREDIT; DIMENSIONALITY;
D O I
10.1007/s10115-023-02016-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fraud detection is a critical task across various domains, requiring accurate identification of fraudulent activities within vast arrays of transactional data. The significant challenges in effectively detecting fraud stem from the inherent class imbalance between normal and fraudulent instances. To address this issue, we propose a novel approach that combines autoencoder-based noise factor encoding (NFE) with the synthetic minority oversampling technique (SMOTE). Our study evaluates the efficacy of this approach using three datasets with severe class imbalance. We compare three autoencoder variants-autoencoder (AE), variational autoencoder (VAE), and contractive autoencoder (CAE)-enhanced by the NFE technique. This technique involves training autoencoder models on real fraud data with an added noise factor during the encoding process, followed by combining this altered data with genuine fraud data. Subsequently, SMOTE is employed for oversampling. Through extensive experimentation, we assess various evaluation metrics. Our results demonstrate the superiority of the autoencoder-based NFE approach over the use of traditional oversampling methods like SMOTE alone. Specifically, the AE-NFE method outperforms other techniques in most cases, although the VAE-NFE and CAE-NFE methods also exhibit promising results in specific scenarios. This study highlights the effectiveness of leveraging autoencoder-based NFE and SMOTE for fraud detection. By addressing class imbalance and enhancing the performance of fraud detection models, our approach enables more accurate identification and prevention of fraudulent activities in real-world applications.
引用
收藏
页码:635 / 652
页数:18
相关论文
共 50 条
[31]   Telecom Fraud Detection Based on Feature Binning and Autoencoder [J].
Liang, Fei-Yao ;
Li, Fei-Peng ;
Xu, Rong-Hai ;
Cheng, Wei ;
Deng, Shi-Xian ;
Yang, Zhe-Rui ;
Wang, Chang-Dong .
23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, :368-377
[32]   A convolutional autoencoder-based approach with batch normalization for energy disaggregation [J].
Chen, Huan ;
Wang, Yue-Hsien ;
Fan, Chun-Hung .
JOURNAL OF SUPERCOMPUTING, 2021, 77 (03) :2961-2978
[33]   Minimum interpretation by autoencoder-based serial and enhanced mutual information production [J].
Ryotaro Kamimura .
Applied Intelligence, 2020, 50 :2423-2448
[34]   Minimum interpretation by autoencoder-based serial and enhanced mutual information production [J].
Kamimura, Ryotaro .
APPLIED INTELLIGENCE, 2020, 50 (08) :2423-2448
[35]   Enhanced autoencoder-based LiDAR localization in self-driving vehicles [J].
Charroud, Anas ;
El Moutaouakil, Karim ;
Palade, Vasile ;
Yahyaouy, Ali .
APPLIED SOFT COMPUTING, 2024, 152
[36]   Dynamic Texture Classification Using AutoEncoder-Based Local Features and Fisher Vector Encoding [J].
Li, Zhe ;
Zhao, Xiaochao ;
Zhang, Tianfan ;
Jing, Xiao ;
Shi, Wei ;
Chen, Qian .
IEEE ACCESS, 2024, 12 :90768-90781
[37]   A novel SMOTE-based resampling technique trough noise detection and the boosting procedure [J].
Saglam, Fatih ;
Cengiz, Mehmet Ali .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
[38]   A Semi-Supervised Autoencoder-Based Approach for Protein Function Prediction [J].
Dhanuka, Richa ;
Tripathi, Anushree ;
Singh, Jyoti P. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (10) :4957-4965
[39]   VASP: An autoencoder-based approach for multivariate anomaly detection and robust time series prediction with application in motorsport [J].
von Schleinitz, Julian ;
Graf, Michael ;
Trutschnig, Wolfgang ;
Schroeder, Andreas .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
[40]   Training Strategies for Autoencoder-based Detection of False Data Injection Attacks [J].
Wang, Chenguang ;
Pan, Kaikai ;
Tindemans, Simon ;
Palensky, Peter .
2020 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE 2020): SMART GRIDS: KEY ENABLERS OF A GREEN POWER SYSTEM, 2020, :1-5