Enhancing Credit Card Fraud Detection Through a Novel Ensemble Feature Selection Technique

被引:10
作者
Wang, Huanjing [1 ]
Liang, Qianxin [2 ]
Hancock, John T., III [2 ]
Khoshgoftaar, Taghi M. [2 ]
机构
[1] Western Kentucky Univ, Bowling Green, KY 42101 USA
[2] Florida Atlantic Univ, Boca Raton, FL USA
来源
2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI | 2023年
关键词
Ensemble Supervised Feature Selection; Ensemble Threshold-Based Feature Selection; Credit Card Fraud; Highly Class Imbalance; ALGORITHMS; MACHINE;
D O I
10.1109/IRI58017.2023.00028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying fraudulent activities in credit card transactions is an inherent component of financial computing. The focus of our research is on the Credit Card Fraud Detection Dataset, which is widely used due to its authentic transaction data. In numerous machine learning applications, feature selection has become a crucial step. To improve the chance of discovering the globally optimal feature set, we employ ensembles of feature ranking methods. These ensemble methods merge multiple feature ranking lists through a median approach. We conduct a comprehensive empirical study that examines two different ensembles of feature ranking techniques, including an ensemble of twelve threshold-based feature selection (TBFS) techniques and an ensemble of five supervised feature selection (SFS) techniques. Additionally, we present results where all features are used. We construct classification models using two Decision Tree-based classifiers, CatBoost and XGBoost, and evaluate them using two different performance metrics, the Area Under the Receiver Operating Characteristic Curve (AUC) and the Area under the Precision-Recall Curve (AUPRC). Since AUPRC provides a more accurate representation of the number of false positives, especially for highly imbalanced datasets, evaluating models for AUPRC is a wise choice. The experimental results demonstrate that the ensemble of SFS and all features performs similarly or better than the ensemble of TBFS. Moreover, we find that XGBoost outperforms CatBoost in terms of AUPRC.
引用
收藏
页码:121 / 126
页数:6
相关论文
共 50 条
[41]   CONDITIONAL WEIGHTED TRANSACTION AGGREGATION FOR CREDIT CARD FRAUD DETECTION [J].
Lim, Wee-Yong ;
Sachan, Amit ;
Thing, Vrizlynn .
ADVANCES IN DIGITAL FORENSICS X, 2014, 433 :3-16
[42]   Credit Card Fraud Detection Using Convolutional Neural Networks [J].
Fu, Kang ;
Cheng, Dawei ;
Tu, Yi ;
Zhang, Liqing .
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 :483-490
[43]   Credit Card Fraud Detection with Automated Machine Learning Systems [J].
Plakandaras, Vasilios ;
Gogas, Periklis ;
Papadimitriou, Theophilos ;
Tsamardinos, Ioannis .
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[44]   A Hybrid Machine Learning Approach for Credit Card Fraud Detection [J].
Gupta, Sonam ;
Varshney, Tushtee ;
Verma, Abhinav ;
Goel, Lipika ;
Yadav, Arun Kumar ;
Singh, Arjun .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY PROJECT MANAGEMENT, 2022, 13 (03)
[45]   Credit card fraud detection using asexual reproduction optimization [J].
Ghahfarokhi, Anahita Farhang ;
Mansouri, Taha ;
Moghaddam, Mohammad Reza Sadeghi ;
Bahrambeik, Nila ;
Yava, Ramin ;
Sani, Mohammadreza Fani .
KYBERNETES, 2022, 51 (09) :2852-2876
[46]   BLAST-SSAHA Hybridization for Credit Card Fraud Detection [J].
Kundu, Amlan ;
Panigrahi, Suvasini ;
Sural, Shamik ;
Majumdar, Arun K. .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2009, 6 (04) :309-315
[47]   Review of Machine Learning Approach on Credit Card Fraud Detection [J].
Rejwan Bin Sulaiman ;
Vitaly Schetinin ;
Paul Sant .
Human-Centric Intelligent Systems, 2022, 2 (1-2) :55-68
[48]   A Comparison of Data Sampling Techniques for Credit Card Fraud Detection [J].
Muaz, Abdulla ;
Jayabalan, Manoj ;
Thiruchelvam, Vinesh .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (06) :477-485
[49]   Using Variational Auto Encoding in Credit Card Fraud Detection [J].
Tingfei, Huang ;
Guangquan, Cheng ;
Kuihua, Huang .
IEEE ACCESS, 2020, 8 :149841-149853
[50]   Application of support vector machines on credit card fraud detection for new card users [J].
Chen, Rong-Chang ;
Chen, Tung-Shou ;
Chen, Lin-Ti ;
Huang, Ya-Li ;
Lai, Li-June .
Proceedings of the Third International Conference on Information and Management Sciences, 2004, 3 :406-410