Building Resilience in Banking Against Fraud with Hyper Ensemble Machine Learning and Anomaly Detection Strategies

被引:0
作者
Vashistha A. [1 ]
Tiwari A.K. [1 ]
机构
[1] Department of Humanities and Social Sciences, National Institute of Technology, Raipur
关键词
Bank transactions; Financial; Fraud detection; Hyper-ensemble learning; Isolation forest; Neural network;
D O I
10.1007/s42979-024-02854-w
中图分类号
学科分类号
摘要
Traditional methods of fraud detection rely on rule-based systems or supervised machine learning models that require labelled data and domain knowledge. However, these methods have limitations such as high false positive rates, low scalability, and vulnerability to adversarial attacks. In this paper, a novel approach for bank fraud detection using hyper ensemble machine learning (HEML), which combines multiple unsupervised and semi-supervised models with different features and hyperparameters to achieve high accuracy and robustness, including—logistic regression (LR), decision tree (DT), support vector machine (SVM), neural network (NN), one-class SVM (OCSVM), and isolation forest (IF) are studied.The approach is evaluated on a real-world dataset of bank transactions from a large European bank and compared with several baseline methods.The accuracies of base learners and ensemble learners on the test data of LR, DT, SVM, NN, OCSVM and IF are as follows in order 0.95,0.91,0.96, 0.97, 0.93, 0.92. The results show that HEML outperforms the baselines in terms of precision, recall, F1-score, and AUC-ROC, while reducing the computational cost and human intervention. Additionally, the effectiveness of HEML in detecting new types of frauds that were not seen in the training data is demonstrated. Thus, HEML is a promising technique for bank fraud detection that can adapt to dynamic and complex fraud scenarios. By utilizing multiple models and features, HEML can provide accurate and robust fraud detection while reducing false positives and minimizing human intervention. By employing multiple models and features, HEML has the potential to improve the financial security and stability for both banks and their customers. Graphical Abstract: (Figure presented.) © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024.
引用
收藏
相关论文
共 31 条
[1]  
True cost of fraud study: Financial services and lending, Lexisnexis Risk Solutions, (2020)
[2]  
Carlini N., Wagner D., Kantchelian A., Adversarial examples are not easily detected: Bypassing ten detection methods, Proceedings of the 10Th ACM Workshop on Artificial Intelligence and Security (, pp. 3-14, (2017)
[3]  
Bhattacharyya S., Jha S., Tharakunnel K., Westland J.C., Data mining for credit card fraud: a comparative study, Decis Support Syst, 50, 3, pp. 602-613, (2011)
[4]  
Chandola V., Banerjee A., Kumar V., Anomaly detection: a survey, ACM Comput Surv (CSUR), 41, 3, pp. 1-58, (2009)
[5]  
Biggio B., Nelson B., Laskov P., Poisoning attacks against support vector machines, Proceedings of the 29Th International Conference on Machine Learning, (2012)
[6]  
Dietterich T.G., Ensemble methods in machine learning, International Workshop on Multiple Classifier Systems, pp. 1-15, (2000)
[7]  
Breiman L., Bagging predictors, Mach Learn, 24, 2, pp. 123-140, (1996)
[8]  
Freund Y., Schapire R.E., A decision-theoretic generalization of on-line learning and an application to boosting, J Comput Syst Sci, 55, 1, pp. 119-139, (1997)
[9]  
Breiman L., Random forests, Mach Learn, 45, 1, pp. 5-32, (2001)
[10]  
Wolpert D.H., Stacked generalization, Neural Netw, 5, 2, pp. 241-259, (1992)