AntiBenford Subgraphs: Unsupervised Anomaly Detection in Financial Networks

被引:33
作者
Chen, Tianyi [1 ]
Tsourakakis, Charalampos [1 ]
机构
[1] Boston Univ, Boston, MA 02215 USA
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
关键词
anomaly detection; financial networks; dense subgraph; DENSEST SUBGRAPH; ALGORITHM;
D O I
10.1145/3534678.3539100
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Benford's law describes the distribution of the first digit of numbers appearing in a wide variety of numerical data, including tax records, and election outcomes, and has been used to raise "red flags" about potential anomalies in the data such as tax evasion. In this work, we ask the following novel question: Given a large transaction or financial graph, how do we find a set of nodes that perform many transactions among each other that also deviate significantly from Benford's law? We propose the AntiBenford subgraph framework that is founded on well-established statistical principles. Furthermore, we design an efficient algorithm that finds AntiBenford subgraphs in nearlinear time on real data. We evaluate our framework on both real and synthetic data against a variety of competitors. We showempirically that our proposed framework enables the detection of anomalous subgraphs in cryptocurrency transaction networks that go undetected by state-of-the-art graph-based anomaly detection methods. Our empirical findings show that our AntiBenford framework is able to mine anomalous subgraphs, and provide novel insights into financial transaction data. The code and the datasets are available at https://github.com/tsourakakis-lab/antibenford-subgraphs.
引用
收藏
页码:2762 / 2770
页数:9
相关论文
共 80 条
[1]   ChiSeL: Graph Similarity Search using Chi-Squared Statistics in Large Probabilistic Graphs [J].
Agarwal, Shubhangi ;
Dutta, Sourav ;
Bhattacharya, Arnab .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (10) :1654-1668
[2]   Graph based anomaly detection and description: a survey [J].
Akoglu, Leman ;
Tong, Hanghang ;
Koutra, Danai .
DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (03) :626-688
[3]  
Akoglu L, 2010, LECT NOTES ARTIF INT, V6119, P410
[4]  
AL CE, 2014, ADV KNOWLEDGE DISCOV, P126
[5]   powerlaw: A Python']Python Package for Analysis of Heavy-Tailed Distributions [J].
Alstott, Jeff ;
Bullmore, Edward T. ;
Plenz, Dietmar .
PLOS ONE, 2014, 9 (01)
[6]  
Andersen R, 2009, LECT NOTES COMPUT SC, V5427, P25, DOI 10.1007/978-3-540-95995-3_3
[7]  
[Anonymous], 2014, ARXIV14051477
[8]  
[Anonymous], 2016, P 27 ANN ACM SIAM S
[9]  
[Anonymous], 2015, ARXIV150604505
[10]  
[Anonymous], COMPL LIV HIST ETH B