Performance of Different Machine Learning Algorithms in Detecting Financial Fraud

被引:12
作者
Alsuwailem, Alhanouf Abdulrahman Saleh [1 ]
Salem, Emad [2 ]
Saudagar, Abdul Khader Jilani [1 ]
机构
[1] Imam Mohammad Ibn Saud Islamic Univ IMSIU, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh, Saudi Arabia
[2] Inst Publ Adm, Dept Stat, Riyadh, Saudi Arabia
关键词
Money laundering; Fraud detection; Classifiers; Machine learning;
D O I
10.1007/s10614-022-10314-x
中图分类号
F [经济];
学科分类号
02 ;
摘要
This research investigates how the problem of money laundering (ML) can be detected in Saudi Arabia with supervised machine learning, specifically at two levels: the establishment-level means that each establishment in the dataset only has one unique record, while the annual level means each establishment has four main records for each year from 2016 to 2019. The main contribution of this study is to show how effective applying machine learning is in detecting ML activities in establishments. It helps to improve the detection process to be in a proactive manner. This research also considers the significance of machine learning techniques in improving the work of the Financial Intelligent Unit, lowering the risks and consequences of financial crime, and fulfilling the Financial Action Task Force's priorities. The Saudi General Organization for Social Insurance contributed the data used in this study from 2016 to 2019. The data pertains to medium and small establishments, it is classified using supervised machine learning algorithms [Random Forest (RF), Decision Tree (DT), Gradient Boosting (GB), and Nearest Neighbor (KNN)]. Each classifier's performance was assessed in terms of accuracy, precision, recall, fi-measure, and area under the curve. The main findings show that the RF classifier provided the best result with 93% accuracy for the establishment level by classifying the establishments and assigning classes for them based on risk levels. Then, the DT achieved an accuracy of 90%, GB and KNN are 74% and 87%, respectively. While at the annual level, the DT and RF are both achieved the same accuracy with 98%, then GB with 92% and 97% for KNN. This research was written due to its importance in improving the investigation process in Saudi Arabia and performing a deep analysis for the establishments that play the main role in passing illegal activities including ML under their umbrella.
引用
收藏
页码:1631 / 1667
页数:37
相关论文
共 32 条
[1]  
About-Financial Action Task Force (FATF), 2019, US
[2]  
Alarab Ismail, 2020, 2020 5 INT C MACH LE, P11, DOI DOI 10.1145/3409073.3409078
[3]  
Alexandre Claudio, 2015, 7th International Conference on Agents and Artificial Intelligence (ICAART 2015). Proceedings, P230
[4]  
Almeida, 2009, THESIS ENGENHARIA IN, P64
[5]   Anti-money laundering systems: a systematic literature review [J].
Alsuwailem, Alhanouf Abdulrahman Saleh ;
Saudagar, Abdul Khader Jilani .
JOURNAL OF MONEY LAUNDERING CONTROL, 2020, 23 (04) :833-848
[6]  
[Anonymous], 2021, ADJ R SQUAR OV IT WO
[7]  
[Anonymous], 2017, OVERFITTING MACHINE
[8]  
[Anonymous], 2019, SAUDI ARABIA FULL ME
[9]  
[Anonymous], 2020, NITAQAT MANUAL ORIGI
[10]  
[Anonymous], 2021, NATL CLASSIFICATION