Mapping user behaviors to identify professional accounts in Ethereum using semi-supervised learning

被引:1
|
作者
Valadares, Julia Almeida [1 ]
Villela, Saulo Moraes [1 ]
Bernardino, Heder Soares [1 ]
Goncalves, Glauber Dias [2 ]
Vieira, Alex Borges [1 ]
机构
[1] Univ Fed Juiz de Fora, Comp Sci Dept, Juiz De Fora, MG, Brazil
[2] Univ Fed Piaui, Informat Syst, Picos, Piaui, Brazil
关键词
Cryptocurrency; Blockchain; Ethereum; Transactions; Machine learning; Semi -supervised learning;
D O I
10.1016/j.eswa.2023.120438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ethereum is one of the largest blockchain platforms currently that has become a digital business environment. This platform allows for decentralized transactions between anonymous users. Thus, the development of methods to identify users' behaviors and keep them anonymous can potentially leverage business on this platform. In this work, we aim to combine different categories of machine learning approaches, namely, unsupervised and semi-supervised, to map the behaviors of users' owned accounts and identify users with professional activities in Ethereum. In addition, we provide here data to the community and analyze different machine learning techniques to characterize the users of Ethereum. These are challenging tasks due to the small fraction of publicly labeled data referring to users' accounts that provide services on this platform, such as exchange, payment, and entertainment, among most casual behavior users. Initially, we use unsupervised learning techniques to cluster the unlabeled users' accounts and to identify a set of them with casual behavior. As an outcome, a dataset containing labeled (casual or professional) and unlabeled instances is obtained. Semi-supervised learning methods are then applied (i) to generate models that classify accounts' behaviors into casual or professional ones and (ii) to discover accounts with professional behaviors among the unlabeled ones. Computational experiments were conducted, and the results obtained by the proposed procedure are compared to those achieved by supervised learning techniques from the literature. The proposal outperformed those from the literature and reached values higher than 95% for the accuracy, precision, recall, F beta-scores, MCC, and AUC-ROC.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Semi-supervised learning by disagreement
    Zhi-Hua Zhou
    Ming Li
    Knowledge and Information Systems, 2010, 24 : 415 - 439
  • [2] A survey on semi-supervised learning
    Jesper E. van Engelen
    Holger H. Hoos
    Machine Learning, 2020, 109 : 373 - 440
  • [3] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
  • [4] Human Semi-Supervised Learning
    Gibson, Bryan R.
    Rogers, Timothy T.
    Zhu, Xiaojin
    TOPICS IN COGNITIVE SCIENCE, 2013, 5 (01) : 132 - 172
  • [5] A survey on semi-supervised learning
    Van Engelen, Jesper E.
    Hoos, Holger H.
    MACHINE LEARNING, 2020, 109 (02) : 373 - 440
  • [6] Semi-supervised Learning by Spectral Mapping with Label Information
    Zhao, Zhong-Qiu
    Gao, Jun
    Wu, Xindong
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 448 - +
  • [7] SemiHD: Semi-Supervised Learning Using Hyperdimensional Computing
    Imani, Mohsen
    Bosch, Samuel
    Javaheripi, Mojan
    Rouhani, Bita
    Wu, Xinyu
    Koushanfar, Farinaz
    Rosing, Tajana
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [8] Using Semi-Supervised Learning for Predicting Metamorphic Relations
    Hardin, Bonnie
    Kanewala, Upulee
    2018 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2018), 2018, : 14 - 17
  • [9] Predicting buffer overflow using semi-supervised learning
    Meng Qingkun
    Wen Shameng
    Feng Chao
    Tang Chaojing
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 1959 - 1963
  • [10] Detecting Malicious Accounts on the Ethereum Blockchain with Supervised Learning
    Kumar, Nitesh
    Singh, Ajay
    Handa, Anand
    Shukla, Sandeep Kumar
    CYBER SECURITY CRYPTOGRAPHY AND MACHINE LEARNING (CSCML 2020), 2020, 12161 : 94 - 109