Predicting tax fraud using supervised machine learning approach

被引:6
作者
Murorunkwere, Belle Fille [1 ]
Haughton, Dominique [2 ]
Nzabanita, Joseph [3 ]
Kipkogei, Francis [4 ]
Kabano, Ignace [5 ]
机构
[1] Univ Rwanda, African Ctr Excellence Data Sci, Rwanda Revenue Author, Kigali, Rwanda
[2] Univ Toulouse TSE R 1, Univ Paris 1 SAMM, Toulouse, France
[3] Univ Rwanda, Coll Sci & Technol, Sch Sci, Kigali, Rwanda
[4] Stepwise Inc, Zalda, Nairobi, Kenya
[5] Univ Rwanda, Coll Business & Econ, African Ctr Excellence Data Sci, Kigali, Rwanda
关键词
tax fraud; fraud detection; features importance; supervised machine-learning models; evaluation metrics;
D O I
10.1080/20421338.2023.2187930
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the advancement in technology, the tax base in Rwanda has become broader, and as a result, tax fraud is growing. Depending on the dataset used, fraud detection experts and researchers have used different methods to identify questionable cases. This paper aims to predict features of tax fraud using the most robust supervised machine-learning model. This research provides a context where a fraud expert can use a machine-learning model, and an implemented model offers instant feedback to the fraud expert. We evaluate supervised machine learning models such as Artificial Neural Network, Logistic Regression, Decision Tree, Random Forest, GaussianNB and XGBoost. Based on different evaluation metrics, Artificial Neural Network was the most robust model for predicting tax fraud. Findings reveal that the time of business that indicates the difference in time from when a business started and the time it was audited, the domestic businesses, taxpayers who import and export goods, those with no losses, those whose businesses are located in the eastern province, and those registered on withholding and Value Added Tax types are more susceptible to tax fraud. This study is among the few to evaluate the effectiveness of multiple supervised machine-learning models for identifying tax fraud factors on an accurate data set with numerous tax types. The evidence generated in the current study will serve as a valuable tool for both tax policymakers and auditors, as well as for enhancing awareness of more robust methods for predicting tax fraud.
引用
收藏
页码:731 / 742
页数:12
相关论文
共 50 条
[1]   A Multi-Module Machine Learning Approach to Detect Tax Fraud [J].
Alsadhan N. .
Computer Systems Science and Engineering, 2023, 46 (01) :241-253
[2]   Fraud Detection using Machine Learning and Deep Learning [J].
Raghavan, Pradheepan ;
El Gayar, Neamat .
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, :335-340
[3]   Fraud Detection Using Machine Learning and Deep Learning [J].
Gandhar A. ;
Gupta K. ;
Pandey A.K. ;
Raj D. .
SN Computer Science, 5 (5)
[4]   Fraud Detection in Blockchains using Machine Learning [J].
Kilic, Baran ;
Sen, Alper ;
Ozturan, Can .
2022 FOURTH INTERNATIONAL CONFERENCE ON BLOCKCHAIN COMPUTING AND APPLICATIONS (BCCA), 2022, :214-218
[5]   Predicting financial reports fraud by machine learning: the proxy of auditor opinions [J].
Thanh, Cuong Nguyen ;
Huy, Tam Phan .
COGENT BUSINESS & MANAGEMENT, 2025, 12 (01)
[6]   Fraud Detection in Big Data using Supervised and Semi-supervised Learning Techniques [J].
Melo-Acosta, German E. ;
Duitama-Munoz, Freddy ;
Arias-Londono, Julian D. .
2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
[7]   Detecting Credit Card Fraud using Machine Learning [J].
Almuteer A.H. ;
Aloufi A.A. ;
Alrashidi W.O. ;
Alshobaili J.F. ;
Ibrahim D.M. .
International Journal of Interactive Mobile Technologies, 2021, 15 (24) :108-122
[8]   Medicare Fraud Detection using Machine Learning Methods [J].
Bauder, Richard A. ;
Khoshgoftaar, Taghi M. .
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, :858-865
[9]   Fraud detection in capital markets: A novel machine learning approach [J].
Yi, Ziwei ;
Cao, Xinwei ;
Pu, Xujin ;
Wu, Yiding ;
Chen, Zuyan ;
Khan, Ameer Tamoor ;
Francis, Adam ;
Li, Shuai .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
[10]   Fraud Claims Detection in Insurance Using Machine Learning [J].
Kalra, Hritik ;
Singh, Ranvir ;
Kumar, T. Senthil .
JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 :327-331