Do Not Rug on Me: Leveraging Machine Learning Techniques for Automated Scam Detection

被引:22
|
作者
Mazorra, Bruno [1 ]
Adan, Victor [2 ]
Daza, Vanesa [1 ]
机构
[1] Pompeu Fabra Univ, Dept Informat & Commun Technol, Tanger Bldg, Barcelona 08018, Spain
[2] Univ Barcelona, Fac Econ & Business, Barcelona 08034, Spain
关键词
ethereum; DeFi; DEX; scam detection;
D O I
10.3390/math10060949
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Uniswap, as with other DEXs, has gained much attention this year because it is a non-custodial and publicly verifiable exchange that allows users to trade digital assets without trusted third parties. However, its simplicity and lack of regulation also make it easy to execute initial coin offering scams by listing non-valuable tokens. This method of performing scams is known as rug pull, a phenomenon that already exists in traditional finance but has become more relevant in DeFi. Various projects have contributed to detecting rug pulls in EVM compatible chains. However, the first longitudinal and academic step to detecting and characterizing scam tokens on Uniswap was made. The authors collected all the transactions related to the Uniswap V2 exchange and proposed a machine learning algorithm to label tokens as scams. However, the algorithm is only valuable for detecting scams accurately after they have been executed. This paper increases their dataset by 20K tokens and proposes a new methodology to label tokens as scams. After manually analyzing the data, we devised a theoretical classification of different malicious maneuvers in the Uniswap protocol. We propose various machine-learning-based algorithms with new, relevant features related to the token propagation and smart contract heuristics to detect potential rug pulls before they occur. In general, the models proposed achieved similar results. The best model obtained accuracy of 0.9936, recall of 0.9540, and precision of 0.9838 in distinguishing non-malicious tokens from scams prior to the malicious maneuver.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] A Review of Machine Learning Techniques in Cyberbullying Detection
    Sultan, Daniyar
    Omarov, Batyrkhan
    Kozhamkulova, Zhazira
    Kazbekova, Gulnur
    Alimzhanova, Laura
    Dautbayeva, Aigul
    Zholdassov, Yernar
    Abdrakhmanov, Rustam
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5625 - 5640
  • [42] Machine Learning Techniques for Breast Cancer Detection
    Hall, Karl
    Chang, Victor
    Mitchell, Paul
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COMPLEXITY, FUTURE INFORMATION SYSTEMS AND RISK (COMPLEXIS), 2022, : 116 - 122
  • [43] Study on Machine Learning Techniques for Botnet Detection
    Silva, L.
    Utimura, L.
    Costa, K.
    Silva, M.
    Prado, S.
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (05) : 881 - 888
  • [44] Botnet Detection via Machine Learning Techniques
    Wang, Haofan
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 836 - 841
  • [45] Comparison of machine learning techniques for spam detection
    Ghosh, Argha
    Senthilrajan, A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29227 - 29254
  • [46] An Insight into Machine Learning Techniques for Cancer Detection
    Chhillar I.
    Singh A.
    Journal of The Institution of Engineers (India): Series B, 2023, 104 (04) : 963 - 985
  • [47] Futuristic Machine Learning Techniques for Diabetes Detection
    Panakanti, Pavan Kumar
    Porika, Sammulal
    Yadav, S. K.
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (05): : 487 - 500
  • [48] Anomaly Detection using Machine Learning Techniques
    Wankhede, Sonali B.
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [49] Investigating machine learning techniques for the detection of autism
    Alkoot, Fuad M.
    Alqallaf, Abdullah K.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 16 (02) : 141 - 169
  • [50] Automated IT Service Desk Systems Using Machine Learning Techniques
    Paramesh, S. P.
    Shreedhara, K. S.
    DATA ANALYTICS AND LEARNING, 2019, 43 : 331 - 346