Using data-driven methods to detect financial statement fraud in the real scenario

被引:3
作者
Zhou, Ying [1 ]
Xiao, Zhi [1 ,2 ]
Gao, Ruize [3 ,4 ]
Wang, Chang [5 ]
机构
[1] Chongqing Univ, Sch Econ & Business Adm, Chongqing 400030, Peoples R China
[2] Chongqing Univ, Chongqing Key Lab Logist, Chongqing 400030, Peoples R China
[3] Tsinghua Univ, Inst Econ, Sch Social Sci, Beijing 100084, Peoples R China
[4] Beijing Inst Math Sci & Applicat, Beijing 101408, Peoples R China
[5] Wuxi Univ, Sch Digital Econ & Management, Wuxi 214105, Peoples R China
基金
中国国家自然科学基金;
关键词
Fraud detection; Data -driven method; Class imbalance; DATA MINING TECHNIQUES; MANAGEMENT FRAUD; CORPORATE FRAUD;
D O I
10.1016/j.accinf.2024.100693
中图分类号
F [经济];
学科分类号
02 ;
摘要
This study seeks to explore the potential of data -driven methods for developing a financial statement fraud prediction model. We emphasize that building a fraud prediction model that can be used to detect fraud in real -world applications should receive attention from researchers. However, the severe class imbalance issue and the complex nature of fraudulent activities make it a rather challenging task. To address these problems, we apply the combinations of different sampling techniques and tree -based ensemble classifiers to an extensive set of raw financial statement data. The results show that the models using an extensive set of raw financial data, undersampling techniques and boosting tree classifiers are superior in fraud detection. Moreover, several features without a priori knowledge are identified to be important for fraud prediction models by feature importance evaluation. Accordingly, this study provides a methodological guide for designing fraud prediction models for real -world applications and serves as a preliminary step of the knowledge discovery process to complement fraud detection knowledge systems.
引用
收藏
页数:16
相关论文
共 59 条
[1]  
Abbasi A, 2012, MIS QUART, V36, P1293
[2]   Using machine learning Meta-Classifiers to detect financial frauds [J].
Achakzai, Muhammad Atif Khan ;
Juan, Peng .
FINANCE RESEARCH LETTERS, 2022, 48
[3]   Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research [J].
Agarwal, Ritu ;
Dhar, Vasant .
INFORMATION SYSTEMS RESEARCH, 2014, 25 (03) :443-448
[4]  
[Anonymous], 2006, GESTS International Transactions on Computer Science and Engineering
[5]   Detecting Accounting Fraud in Publicly Traded US Firms Using a Machine Learning Approach [J].
Bao, Yang ;
Ke, Bin ;
Li, Bin ;
Yu, Y. Julia ;
Zhang, Jie .
JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) :199-235
[6]   Using machine learning to detect misstatements [J].
Bertomeu, Jeremy ;
Cheynel, Edwige ;
Floyd, Eric ;
Pan, Wenqiang .
REVIEW OF ACCOUNTING STUDIES, 2021, 26 (02) :468-519
[7]   Using Nonfinancial Measures to Assess Fraud Risk [J].
Brazel, Joseph F. ;
Jones, Keith L. ;
Zimbelman, Mark F. .
JOURNAL OF ACCOUNTING RESEARCH, 2009, 47 (05) :1135-1166
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   What Are You Saying? Using topic to Detect Financial Misreporting [J].
Brown, Nerissa C. ;
Crowley, Richard M. ;
Elliott, W. Brooke .
JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) :237-291
[10]   Detecting Management Fraud in Public Companies [J].
Cecchini, Mark ;
Aytug, Haldun ;
Koehler, Gary J. ;
Pathak, Praveen .
MANAGEMENT SCIENCE, 2010, 56 (07) :1146-1160