An integrated multistage ensemble machine learning model for fraudulent transaction detection

被引:10
作者
Talukder, Md. Alamin [1 ]
Khalid, Majdi [2 ]
Uddin, Md Ashraf [3 ]
机构
[1] Int Univ Business Agr & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Umm Al Qura Univ, Coll Comp, Dept Comp Sci & Artificial Intelligence, Mecca 21955, Saudi Arabia
[3] Deakin Univ, Sch Informat Technol, Waurn Ponds Campus, Geelong, Australia
关键词
Credit card; Fraudulent transactions; Integrated multistage model; Machine learning; Fraudulent detection;
D O I
10.1186/s40537-024-00996-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fraudulent transactions continue to pose a concern for financial institutions and organizations, necessitating the development of effective detection tools. Identification and prevention of fraudulent transactions depend heavily on the detection of credit card fraud. Even though instances of credit card fraud are uncommon, they can nonetheless cause significant financial losses because of the high cost of fraudulent transactions. When fraud is discovered early on, investigators can act quickly to stop additional losses. But because the investigation process takes a while, there are only so many warnings that can be looked through in detail in a given day. Thus, a fraud detection model's main goal is to minimize false alarms and missed fraud situations while producing accurate alerts. To improve fraud identification, we provide in this study an integrated multistage ensemble Machine Learning (IMEML) model that incorporates various multistage ensemble models intelligently, such as Ensemble Independent Classifier (EIC), Ensemble Bagging Classifier (EBC), and Ensemble ML Classifier (EMC). In order to overcome the problem of data imbalance, we use a number of methods-including Instant Hardness Threshold with EMC (IHT+EMC), Cluster Centroids (CC), and Randon Under Sampler (RUS)-that go beyond traditional methods. We run our studies on a 284,807-transaction credit card dataset that is made available to the public. The accuracy rates of 99.94%, 99.91%, 99.14%, 99.52%, and perfect 100% for accuracy, precision, recall, f1-score, and AUC score, respectively, are achieved by the suggested model, demonstrating remarkable performance scores. For real-world fraud detection applications, the EIBMC model sets a new benchmark for identifying fraudulent transactions in high-frequency scenarios by outperforming cutting-edge techniques.
引用
收藏
页数:25
相关论文
共 21 条
[1]   Digital payment fraud detection methods in digital ages and Industry 4.0 [J].
Chang, Victor ;
Doan, Le Minh Thao ;
Di Stefano, Alessandro ;
Sun, Zhili ;
Fortino, Giancarlo .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
[2]  
Chatterjee P, 2024, Future Generation Computer Systems
[3]   A Neural Network Ensemble With Feature Engineering for Improved Credit Card Fraud Detection [J].
Esenogho, Ebenezer ;
Mienye, Ibomoiye Domor ;
Swart, Theo G. ;
Aruleba, Kehinde ;
Obaido, George .
IEEE ACCESS, 2022, 10 :16400-16407
[4]  
Faraji Z., 2022, SEISENSE J Manag, V5, P49, DOI [10.33215/sjom.v5i1.770, DOI 10.33215/SJOM.V5I1.770]
[5]  
Ganji VR., 2012, IJCSE, V4, P1035
[6]  
Hammed M., 2020, INT J COMPUTER SCI I, V18, P79
[7]  
kaggle, MLG-ULB: Credit Card Fraud Dataset
[8]  
Lakshmi S., 2018, Int. J. Appl. Eng. Res, V13, P16819
[9]  
Mustapha R, 2024, PAK J LIFE SOC SCI, V22, DOI DOI 10.57239/PJLSS-2024-22.1.002
[10]  
Nakitende M G., 2024, Research Anthology on Business Law, Policy, and Social Responsibility, P848