Big Data Analytics for Anomaly Detection in Blockchain

被引:0
作者
Ozbilen, Mahmut Lutfullah [1 ]
Ozcan, Elif [1 ]
Keles, Mustafa Berk [1 ]
Zeybel, Merve [1 ]
Dervisoglu, Havanur [1 ]
Dogan, Aslinur [1 ]
Haklidir, Mehmet [1 ]
机构
[1] TUBITAK BILGEM Baris, Dr Zeki Acar Cad 1, Gebze, Kocaeli, Turkiye
来源
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年
关键词
blockchain; anomaly detection; machine learning; clustering;
D O I
10.1109/SIU59756.2023.10223963
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to make fast and automatic transactions anonymously without intermediary institutions offered by blockchain technology has made this technology widespread in the use of money transfer. With its widespread use, it has also become a preferred transfer method for illegal transactions such as tax evasion and the sale of illegal goods. This has increased the importance of anomalous transactions and anomalous user detection studies in transactions. In this study, a solution is presented that enables the detection of anomalies and anomalous users in blockchain transfers by using big data storage and data processing tools. A scalable structure has been created in order to deal with the increase in the size and frequency of blockchain transfers with the use of big data technologies. In the solution, first of all, the users making the transfers and the transfer transactions are clustered using the K-Means method. Transfer information and attributes that represent the blockchain network as a graph are used as features. After clustering, anomalies were found by measuring the distance of the elements from the cluster center. The performance of the clustering was calculated using the silhouette score. When the results are analyzed, the model that reduces the size to two with principal component analysis for clustering users showed the highest scores with a silhouette score of 0.51, and the model that reduces the size of graph-based attributes with principal component analysis for clustering transfers showed the highest scores with a silhouette score of 0.87. These models were applied to real time transactions.
引用
收藏
页数:4
相关论文
共 16 条
[1]   Data mining for detecting Bitcoin Ponzi schemes [J].
Bartoletti, Massimo ;
Pes, Barbara ;
Serusi, Sergio .
2018 CRYPTO VALLEY CONFERENCE ON BLOCKCHAIN TECHNOLOGY (CVCBT), 2018, :75-84
[2]  
Cloudera, HDP Documentation
[3]  
Giordani P., 2018, Encyclopedia of Social Network Analysis and Mining
[4]  
Graphframes.github, GraphFrames 0.8.0 Documentation
[5]  
Jin X., 2017, Encyclopedia of machine learning
[6]  
Kakavand H., 2019, SSRN Electronic Journal
[7]  
Kanemura K, 2019, 2019 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN AND CRYPTOCURRENCY (ICBC), P154, DOI 10.1109/BLOC.2019.8751391
[8]   The Rise in Popularity of Cryptocurrency and Associated Criminal Activity [J].
Kethineni, Sesha ;
Cao, Ying .
INTERNATIONAL CRIMINAL JUSTICE REVIEW, 2020, 30 (03) :325-344
[9]   A Survey on Blockchain Anomaly Detection Using Data Mining Techniques [J].
Li, Ji ;
Gu, Chunxiang ;
Wei, Fushan ;
Chen, Xi .
BLOCKCHAIN AND TRUSTWORTHY SYSTEMS, BLOCKSYS 2019, 2020, 1156 :491-504
[10]   Machine learning methods to detect money laundering in the Bitcoin blockchain in the presence of label scarcity [J].
Lorenz, Joana ;
Silva, Maria Ines ;
Aparicio, David ;
Ascensao, Joao Tiago ;
Bizarro, Pedro .
FIRST ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2020, 2020,