Predicting tax fraud using supervised machine learning approach

被引:2
|
作者
Murorunkwere, Belle Fille [1 ]
Haughton, Dominique [2 ]
Nzabanita, Joseph [3 ]
Kipkogei, Francis [4 ]
Kabano, Ignace [5 ]
机构
[1] Univ Rwanda, African Ctr Excellence Data Sci, Rwanda Revenue Author, Kigali, Rwanda
[2] Univ Toulouse TSE R 1, Univ Paris 1 SAMM, Toulouse, France
[3] Univ Rwanda, Coll Sci & Technol, Sch Sci, Kigali, Rwanda
[4] Stepwise Inc, Zalda, Nairobi, Kenya
[5] Univ Rwanda, Coll Business & Econ, African Ctr Excellence Data Sci, Kigali, Rwanda
来源
AFRICAN JOURNAL OF SCIENCE TECHNOLOGY INNOVATION & DEVELOPMENT | 2023年 / 15卷 / 06期
关键词
tax fraud; fraud detection; features importance; supervised machine-learning models; evaluation metrics;
D O I
10.1080/20421338.2023.2187930
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the advancement in technology, the tax base in Rwanda has become broader, and as a result, tax fraud is growing. Depending on the dataset used, fraud detection experts and researchers have used different methods to identify questionable cases. This paper aims to predict features of tax fraud using the most robust supervised machine-learning model. This research provides a context where a fraud expert can use a machine-learning model, and an implemented model offers instant feedback to the fraud expert. We evaluate supervised machine learning models such as Artificial Neural Network, Logistic Regression, Decision Tree, Random Forest, GaussianNB and XGBoost. Based on different evaluation metrics, Artificial Neural Network was the most robust model for predicting tax fraud. Findings reveal that the time of business that indicates the difference in time from when a business started and the time it was audited, the domestic businesses, taxpayers who import and export goods, those with no losses, those whose businesses are located in the eastern province, and those registered on withholding and Value Added Tax types are more susceptible to tax fraud. This study is among the few to evaluate the effectiveness of multiple supervised machine-learning models for identifying tax fraud factors on an accurate data set with numerous tax types. The evidence generated in the current study will serve as a valuable tool for both tax policymakers and auditors, as well as for enhancing awareness of more robust methods for predicting tax fraud.
引用
收藏
页码:731 / 742
页数:12
相关论文
共 50 条
  • [1] Predicting Credit Card Fraud using Supervised Machine Learning Methods: Comparative Analysis
    Altan, Guener
    Zafer, Metin Recep
    JOURNAL OF ECONOMIC POLICY RESEARCHES-IKTISAT POLITIKASI ARASTIRMALARI DERGISI, 2024, 11 (02): : 242 - 262
  • [2] Detecting insurance fraud using supervised and unsupervised machine learning
    Debener, Joern
    Heinke, Volker
    Kriebel, Johannes
    JOURNAL OF RISK AND INSURANCE, 2023, 90 (03) : 743 - 768
  • [3] Predicting Fraud Victimization Using Classical Machine Learning
    Lokanan, Mark
    Liu, Susan
    ENTROPY, 2021, 23 (03) : 1 - 19
  • [4] A Multi-Module Machine Learning Approach to Detect Tax Fraud
    Alsadhan N.
    Computer Systems Science and Engineering, 2023, 46 (01): : 241 - 253
  • [5] Tax Fraud Detection for Under-Reporting Declarations Using an Unsupervised Machine Learning Approach
    de Roux, Daniel
    Perez, Boris
    Moreno, Andres
    del Pilar Villamil, Maria
    Figueroa, Cesar
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 215 - 222
  • [6] Predicting news deserts using supervised machine learning
    Paladhi, Arijit
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2025, 8 (02):
  • [7] Predicting cancer using supervised machine learning: Mesothelioma
    Choudhury, Avishek
    TECHNOLOGY AND HEALTH CARE, 2021, 29 (01) : 45 - 58
  • [8] Cyber Fraud Prediction with Supervised Machine Learning Techniques
    Li, Zhoulin
    Zhang, Hao
    Masum, Mohammad
    Shahriar, Hossain
    Haddad, Hisham
    ACMSE 2020: PROCEEDINGS OF THE 2020 ACM SOUTHEAST CONFERENCE, 2020, : 176 - 180
  • [9] How Useful Are Tax Disclosures in Predicting Effective Tax Rates? A Machine Learning Approach
    Guenther, David A.
    Peterson, Kyle
    Searcy, Jake
    Williams, Brian M.
    ACCOUNTING REVIEW, 2023, 98 (05): : 297 - 322
  • [10] Predicting survival of pancreatic cancer using supervised machine learning
    Osman, M. H.
    ANNALS OF ONCOLOGY, 2018, 29