Intelligent Fraud Detection in Financial Statements Using Machine Learning and Data Mining: A Systematic Literature Review

被引:38
作者
Ashtiani, Matin N. [1 ]
Raahemi, Bijan [1 ]
机构
[1] Univ Ottawa, Telfer Sch Management, Knowledge Discovery & Data Min Lab, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data mining; Systematics; Libraries; Bibliographies; Machine learning; Companies; Business; Fraud detection; financial statement; machine learning; data mining; outlier detection; systematic literature review; TEXT;
D O I
10.1109/ACCESS.2021.3096799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fraudulent financial statements (FFS) are the results of manipulating financial elements by overvaluing incomes, assets, sales, and profits while underrating expenses, debts, or losses. To identify such fraudulent statements, traditional methods, including manual auditing and inspections, are costly, imprecise, and time-consuming. Intelligent methods can significantly help auditors in analyzing a large number of financial statements. In this study, we systematically review and synthesize the existing literature on intelligent fraud detection in corporate financial statements. In particular, the focus of this review is on exploring machine learning and data mining methods, as well as the various datasets that are studied for detecting financial fraud. We adopted the Kitchenham methodology as a well-defined protocol to extract, synthesize, and report the results. Accordingly, 47 articles were selected, synthesized, and analyzed. We present the key issues, gaps, and limitations in the area of fraud detection in financial statements and suggest areas for future research. Since supervised algorithms were employed more than unsupervised approaches like clustering, the future research should focus on unsupervised, semi-supervised, as well as bio-inspired and evolutionary heuristic methods for anomaly (fraud) detection. In terms of datasets, it is envisaged that future research making use of textual and audio data. While imposing new challenges, this unstructured data deserves further study as it can show interesting results for intelligent fraud detection.
引用
收藏
页码:72504 / 72525
页数:22
相关论文
共 82 条
[31]  
Gupta R, 2012, INT J ADV COMPUT SC, V3, P150
[32]  
Hajek P, 2019, P IFIP INT C ART INT, V15, P425, DOI DOI 10.1007/978-3-030-19823-7_36
[33]   Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methods [J].
Hajek, Petr ;
Henriques, Roberto .
KNOWLEDGE-BASED SYSTEMS, 2017, 128 :139-152
[34]   Analyzing Speech to Detect Financial Misreporting [J].
Hobson, Jessen L. ;
Mayew, William J. ;
Venkatachalam, Mohan .
JOURNAL OF ACCOUNTING RESEARCH, 2012, 50 (02) :349-392
[35]   Topological pattern discovery and feature extraction for fraudulent financial reporting [J].
Huang, Shin-Ying ;
Tsaih, Rua-Huan ;
Yu, Fang .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (09) :4360-4372
[36]   Identification of fraudulent financial statements using linguistic credibility analysis [J].
Humpherys, Sean L. ;
Moffitt, Kevin C. ;
Burns, Mary B. ;
Burgoon, Judee K. ;
Felix, William F. .
DECISION SUPPORT SYSTEMS, 2011, 50 (03) :585-594
[37]  
Ivakhnenko A. G., 1968, SOV AUTOM CONTROL, V13, P43
[38]   An Effective Financial Statements Fraud Detection Model for the Sustainable Development of Financial Markets: Evidence from Taiwan [J].
Jan, Chyan-long .
SUSTAINABILITY, 2018, 10 (02)
[39]  
Karlen S., 2016, P THER EUR FREQ TIM, P1
[40]   Using Active Learning Methods for Predicting Fraudulent Financial Statements [J].
Karlos, Stamatis ;
Kostopoulos, Georgios ;
Kotsiantis, Sotiris ;
Tampakas, Vassilis .
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2017, 2017, 744 :351-362