Using data-driven methods to detect financial statement fraud in the real scenario

被引:1
作者
Zhou, Ying [1 ]
Xiao, Zhi [1 ,2 ]
Gao, Ruize [3 ,4 ]
Wang, Chang [5 ]
机构
[1] Chongqing Univ, Sch Econ & Business Adm, Chongqing 400030, Peoples R China
[2] Chongqing Univ, Chongqing Key Lab Logist, Chongqing 400030, Peoples R China
[3] Tsinghua Univ, Inst Econ, Sch Social Sci, Beijing 100084, Peoples R China
[4] Beijing Inst Math Sci & Applicat, Beijing 101408, Peoples R China
[5] Wuxi Univ, Sch Digital Econ & Management, Wuxi 214105, Peoples R China
基金
中国国家自然科学基金;
关键词
Fraud detection; Data -driven method; Class imbalance; DATA MINING TECHNIQUES; MANAGEMENT FRAUD; CORPORATE FRAUD;
D O I
10.1016/j.accinf.2024.100693
中图分类号
F [经济];
学科分类号
02 ;
摘要
This study seeks to explore the potential of data -driven methods for developing a financial statement fraud prediction model. We emphasize that building a fraud prediction model that can be used to detect fraud in real -world applications should receive attention from researchers. However, the severe class imbalance issue and the complex nature of fraudulent activities make it a rather challenging task. To address these problems, we apply the combinations of different sampling techniques and tree -based ensemble classifiers to an extensive set of raw financial statement data. The results show that the models using an extensive set of raw financial data, undersampling techniques and boosting tree classifiers are superior in fraud detection. Moreover, several features without a priori knowledge are identified to be important for fraud prediction models by feature importance evaluation. Accordingly, this study provides a methodological guide for designing fraud prediction models for real -world applications and serves as a preliminary step of the knowledge discovery process to complement fraud detection knowledge systems.
引用
收藏
页数:16
相关论文
共 59 条
  • [1] Abbasi A, 2012, MIS QUART, V36, P1293
  • [2] Using machine learning Meta-Classifiers to detect financial frauds
    Achakzai, Muhammad Atif Khan
    Juan, Peng
    [J]. FINANCE RESEARCH LETTERS, 2022, 48
  • [3] Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research
    Agarwal, Ritu
    Dhar, Vasant
    [J]. INFORMATION SYSTEMS RESEARCH, 2014, 25 (03) : 443 - 448
  • [4] [Anonymous], 2002, Account Rev
  • [5] Detecting Accounting Fraud in Publicly Traded US Firms Using a Machine Learning Approach
    Bao, Yang
    Ke, Bin
    Li, Bin
    Yu, Y. Julia
    Zhang, Jie
    [J]. JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) : 199 - 235
  • [6] Using machine learning to detect misstatements
    Bertomeu, Jeremy
    Cheynel, Edwige
    Floyd, Eric
    Pan, Wenqiang
    [J]. REVIEW OF ACCOUNTING STUDIES, 2021, 26 (02) : 468 - 519
  • [7] Using Nonfinancial Measures to Assess Fraud Risk
    Brazel, Joseph F.
    Jones, Keith L.
    Zimbelman, Mark F.
    [J]. JOURNAL OF ACCOUNTING RESEARCH, 2009, 47 (05) : 1135 - 1166
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] What Are You Saying? Using topic to Detect Financial Misreporting
    Brown, Nerissa C.
    Crowley, Richard M.
    Elliott, W. Brooke
    [J]. JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) : 237 - 291
  • [10] Detecting Management Fraud in Public Companies
    Cecchini, Mark
    Aytug, Haldun
    Koehler, Gary J.
    Pathak, Praveen
    [J]. MANAGEMENT SCIENCE, 2010, 56 (07) : 1146 - 1160