Big Data Analytics for Anomaly Detection in Blockchain

被引:0
作者
Ozbilen, Mahmut Lutfullah [1 ]
Ozcan, Elif [1 ]
Keles, Mustafa Berk [1 ]
Zeybel, Merve [1 ]
Dervisoglu, Havanur [1 ]
Dogan, Aslinur [1 ]
Haklidir, Mehmet [1 ]
机构
[1] TUBITAK BILGEM Baris, Dr Zeki Acar Cad 1, Gebze, Kocaeli, Turkiye
来源
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年
关键词
blockchain; anomaly detection; machine learning; clustering;
D O I
10.1109/SIU59756.2023.10223963
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to make fast and automatic transactions anonymously without intermediary institutions offered by blockchain technology has made this technology widespread in the use of money transfer. With its widespread use, it has also become a preferred transfer method for illegal transactions such as tax evasion and the sale of illegal goods. This has increased the importance of anomalous transactions and anomalous user detection studies in transactions. In this study, a solution is presented that enables the detection of anomalies and anomalous users in blockchain transfers by using big data storage and data processing tools. A scalable structure has been created in order to deal with the increase in the size and frequency of blockchain transfers with the use of big data technologies. In the solution, first of all, the users making the transfers and the transfer transactions are clustered using the K-Means method. Transfer information and attributes that represent the blockchain network as a graph are used as features. After clustering, anomalies were found by measuring the distance of the elements from the cluster center. The performance of the clustering was calculated using the silhouette score. When the results are analyzed, the model that reduces the size to two with principal component analysis for clustering users showed the highest scores with a silhouette score of 0.51, and the model that reduces the size of graph-based attributes with principal component analysis for clustering transfers showed the highest scores with a silhouette score of 0.87. These models were applied to real time transactions.
引用
收藏
页数:4
相关论文
共 16 条
[11]  
Monamo P, 2016, INFO SECUR S AFR, P129, DOI 10.1109/ISSA.2016.7802939
[12]  
Pham T., 2016, arXiv
[13]  
Sayadi S, 2019, INT WIREL COMMUN, P895, DOI 10.1109/IWCMC.2019.8766765
[14]   Cluster Quality Analysis Using Silhouette Score [J].
Shahapure, Ketan Rajshekhar ;
Nicholas, Charles .
2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, :747-748
[15]  
Spark.apache, PySpark 3.4.0 documentation
[16]  
Vujicic D, 2018, INFOTEH JAHORINA