Multi-Relational Graph Representation Learning for Financial Statement Fraud Detection

被引:1
作者
Wang, Chenxu [1 ,2 ]
Wang, Mengqin [1 ]
Wang, Xiaoguang [1 ]
Zhang, Luyue [1 ]
Long, Yi [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, MoE Key Lab Intelligent Networks & Network Secur, Xian 710049, Peoples R China
[3] Chinese Univ Hong Kong Shenzhen CUHK Shenzhen, Shenzhen Finance Inst, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
financial statement fraud; class imbalance; Graph Neural Networks (GNN); multi-relational graphs; CORPORATE FRAUD; MORLET WAVELET; NEURAL-NETWORK; VECTOR MACHINE; DECISION TREE; DESIGN; FOREST;
D O I
10.26599/BDMA.2024.9020013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Financial statement fraud refers to malicious manipulations of financial data in listed companies' annual statements. Traditional machine learning approaches focus on individual companies, overlooking the interactive relationships among companies that are crucial for identifying fraud patterns. Moreover, fraud detection is a typical imbalanced binary classification task with normal samples outnumbering fraud ones. In this paper, we propose a multi-relational graph convolutional network, named FraudGCN, for detecting financial statement fraud. A multi-relational graph is constructed to integrate industrial, supply chain, and accounting-sharing relationships, effectively encapsulating the multidimensional and complex interactions among companies. We then develop a multi-relational graph convolutional network to aggregate information within each relationship and employ an attention mechanism to fuse information across multiple relationships. The attention mechanism enables the model to distinguish the importance of different relationships, thereby aggregating more useful information from key relationships. To alleviate the class imbalance problem, we present a diffusion-based under-sampling strategy that strategically selects key nodes globally for model training. We also employ focal loss to assign greater weights to harder-to-classify minority samples. We build a real-world dataset from the annual financial statement of listed companies in China. The experimental results show that FraudGCN achieves an improvement of 3.15% in Macro-recall, 3.36% in Macro-F1, and 3.86% in GMean compared to the second-best method. The dataset and codes are publicly available at: https://github.com/XNetLab/MRG-for-Finance.
引用
收藏
页码:920 / 941
页数:22
相关论文
共 69 条
[1]   Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019 [J].
Al-Hashedi, Khaled Gubran ;
Magalingam, Pritheega .
COMPUTER SCIENCE REVIEW, 2021, 40
[2]   Collective dynamics of stock market efficiency [J].
Alves, Luiz G. A. ;
Sigaki, Higor Y. D. ;
Perc, Matjaz ;
Ribeiro, Haroldo V. .
SCIENTIFIC REPORTS, 2020, 10 (01)
[3]   Data engineering for fraud detection [J].
Baesens, Bart ;
Hoppner, Sebastiaan ;
Verdonck, Tim .
DECISION SUPPORT SYSTEMS, 2021, 150
[4]   Detecting Accounting Fraud in Publicly Traded US Firms Using a Machine Learning Approach [J].
Bao, Yang ;
Ke, Bin ;
Li, Bin ;
Yu, Y. Julia ;
Zhang, Jie .
JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) :199-235
[5]  
Barandela R, 2004, LECT NOTES COMPUT SC, V3138, P806
[6]  
Barman S, 2016, International Journal of Trust Management in Computing and Communications, V3, P336, DOI [10.1504/ijtmcc.2016.084561, 10.1504/IJTMCC.2016.084561, DOI 10.1504/IJTMCC.2016.084561]
[7]   FINANCIAL RATIOS AS PREDICTORS OF FAILURE [J].
BEAVER, WH .
JOURNAL OF ACCOUNTING RESEARCH, 1966, 4 :71-111
[8]   Application of relevance vector machine and logistic regression for machine degradation assessment [J].
Caesarendra, Wahyu ;
Widodo, Achmad ;
Yang, Bo-Suk .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2010, 24 (04) :1161-1171
[9]   Making words work: Using financial text as a predictor of financial events [J].
Cecchini, Mark ;
Aytug, Haldun ;
Koehler, Gary J. ;
Pathak, Praveen .
DECISION SUPPORT SYSTEMS, 2010, 50 (01) :164-175
[10]   Detecting Management Fraud in Public Companies [J].
Cecchini, Mark ;
Aytug, Haldun ;
Koehler, Gary J. ;
Pathak, Praveen .
MANAGEMENT SCIENCE, 2010, 56 (07) :1146-1160