Machine learning in classifying bitcoin addresses

被引:2
作者
Garin, Leonid [1 ]
Gisin, Vladimir [2 ]
机构
[1] PJSC Rosbank, Mashi Poryvayevoy St 34, Moscow, Russia
[2] Financial Univ Govt Russian Federat, Leningradsky Prospect 49-2, Moscow 125167, Russia
来源
JOURNAL OF FINANCE AND DATA SCIENCE | 2023年 / 9卷
关键词
Blockchain; Transaction; Scoring; Machine learning; Cryptocurrency; Suspicious activity;
D O I
10.1016/j.jfds.2023.100109
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
The emergence of the Bitcoin cryptocurrency marked a new era of illegal transactions. Cryptocurrency provides some level of anonymity allowing its users to create an unlimited number of wallets with alias addresses, which makes it challenging to identify the actual user. This is used by criminals for the purpose of making illegal transactions. At the same time, Bitcoin stores and provides information about all committed transactions, which opens up opportunities for identifying suspicious behavior patterns in this network using data mining. The problem of detecting suspicious activity in the Bitcoin network can be solved with sufficiently high accuracy using machine learning methods. The paper provides a comparative study of various machine learning methods to solve the mentioned problem: logistic regression, decision tree, random forest, gradient boosting. Selecting hyper parameters, rebalancing the dataset, and active learning are particularly important. The most important hyperparameters of the algorithms are described. Metrics show that the gradient boosting looks the most promising. In total 38 features of bitcoin addresses were identified. The top features are presented in the paper.
引用
收藏
页数:10
相关论文
共 33 条
  • [1] Akhtar N., 2021, Research Anthology on Digital Transformation, Organizational Change, and the Impact of Remote Work, P485
  • [2] Alkadri S., 2018, Duke Law Technology Review, V17, P71
  • [3] Balaskas A, 2018, 2018 INTERNATIONAL CONFERENCE ON CYBER SECURITY AND PROTECTION OF DIGITAL SERVICES (CYBER SECURITY)
  • [4] A comparative analysis of gradient boosting algorithms
    Bentejac, Candice
    Csorgo, Anna
    Martinez-Munoz, Gonzalo
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (03) : 1937 - 1967
  • [5] The Identity, Fungibility and Anonymity of Money
    Berg, Alastair
    [J]. ECONOMIC PAPERS, 2020, 39 (02): : 104 - 117
  • [6] Graph-based visual analytics for cyber threat intelligence
    Boehm, Fabian
    Menges, Florian
    Pernul, Guenther
    [J]. CYBERSECURITY, 2018, 1 (01):
  • [7] Bolton RJ, 2002, STAT SCI, V17, P235
  • [8] Federal Bureau of Investigation, 2012, Bitcoin virtual currency: unique features present distinct challenges for deterring illicit Intelligence Assessment, V1, P4
  • [9] Feldman E.V., 2021, Chelyabinsk Journal of Physics and Mathematics, V1, P119
  • [10] Financial Action Task Force, 2021, Public Statement on Virtual Assets and Related Providers. Updated Guidance for a Risk-Based Approach to Virtual Assets and Virtual Asset Service Providers