Mapping user behaviors to identify professional accounts in Ethereum using semi-supervised learning

被引:1
|
作者
Valadares, Julia Almeida [1 ]
Villela, Saulo Moraes [1 ]
Bernardino, Heder Soares [1 ]
Goncalves, Glauber Dias [2 ]
Vieira, Alex Borges [1 ]
机构
[1] Univ Fed Juiz de Fora, Comp Sci Dept, Juiz De Fora, MG, Brazil
[2] Univ Fed Piaui, Informat Syst, Picos, Piaui, Brazil
关键词
Cryptocurrency; Blockchain; Ethereum; Transactions; Machine learning; Semi -supervised learning;
D O I
10.1016/j.eswa.2023.120438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ethereum is one of the largest blockchain platforms currently that has become a digital business environment. This platform allows for decentralized transactions between anonymous users. Thus, the development of methods to identify users' behaviors and keep them anonymous can potentially leverage business on this platform. In this work, we aim to combine different categories of machine learning approaches, namely, unsupervised and semi-supervised, to map the behaviors of users' owned accounts and identify users with professional activities in Ethereum. In addition, we provide here data to the community and analyze different machine learning techniques to characterize the users of Ethereum. These are challenging tasks due to the small fraction of publicly labeled data referring to users' accounts that provide services on this platform, such as exchange, payment, and entertainment, among most casual behavior users. Initially, we use unsupervised learning techniques to cluster the unlabeled users' accounts and to identify a set of them with casual behavior. As an outcome, a dataset containing labeled (casual or professional) and unlabeled instances is obtained. Semi-supervised learning methods are then applied (i) to generate models that classify accounts' behaviors into casual or professional ones and (ii) to discover accounts with professional behaviors among the unlabeled ones. Computational experiments were conducted, and the results obtained by the proposed procedure are compared to those achieved by supervised learning techniques from the literature. The proposal outperformed those from the literature and reached values higher than 95% for the accuracy, precision, recall, F beta-scores, MCC, and AUC-ROC.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Identifying User Behavior Profiles in Ethereum Using Machine Learning Techniques
    Valadares, Julia Almeida
    Oliveira, Vinicius C.
    de Azevedo Sousa, Jose Eduardo
    Bernardino, Heder S.
    Villela, Saulo Moraes
    Vieira, Alex Borges
    Goncalves, Glauber
    2021 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN (BLOCKCHAIN 2021), 2021, : 327 - 332
  • [32] Driver Distraction Detection Using Semi-Supervised Machine Learning
    Liu, Tianchi
    Yang, Yan
    Huang, Guang-Bin
    Yeo, Yong Kiang
    Lin, Zhiping
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (04) : 1108 - 1120
  • [33] Effective Intrusion Detection System Using Semi-Supervised Learning
    Wagh, Sharmila Kishor
    Kolhe, Satish R.
    2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
  • [34] Robust identification of molecular phenotypes using semi-supervised learning
    Heinrich Roder
    Carlos Oliveira
    Lelia Net
    Benjamin Linstid
    Maxim Tsypin
    Joanna Roder
    BMC Bioinformatics, 20
  • [35] Abnormal Transactions Detection in the Ethereum Network Using Semi-Supervised Generative Adversarial Networks
    Sanjalawe, Yousef K.
    Al-E'mari, Salam R.
    IEEE ACCESS, 2023, 11 : 98516 - 98531
  • [36] Determining Functional User Comments of Mobile Applications with Semi-Supervised Learning
    Akbunar, Duygu
    Erdogan, Batuhan
    Biricik, Goksel
    Amasyali, Mehmet Fatih
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [37] Introduction to semi-supervised learning
    Goldberg, Xiaojin
    Synthesis Lectures on Artificial Intelligence and Machine Learning, 2009, 6 : 1 - 116
  • [38] Reliable Semi-supervised Learning
    Shao, Junming
    Huang, Chen
    Yang, Qinli
    Luo, Guangchun
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1197 - 1202
  • [39] Supervised and Semi-Supervised Learning for Failure Identification in Microwave Networks
    Musumeci, Francesco
    Magni, Luca
    Ayoub, Omran
    Rubino, Roberto
    Capacchione, Massimiliano
    Rigamonti, Gabriele
    Milano, Michele
    Passera, Claudio
    Tornatore, Massimo
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (02): : 1934 - 1945
  • [40] On Semi-Supervised Learning and Sparsity
    Balinsky, Alexander
    Balinsky, Helen
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3083 - +