Near real-time twitter spam detection with machine learning techniques

被引:0
|
作者
Sun N. [1 ]
Lin G. [1 ]
Qiu J. [1 ]
Rimba P. [2 ]
机构
[1] School of Information Technology, Deakin University, Geelong
[2] Data61, CSIRO, Melbourne
关键词
classification; machine learning; Social network security; spam detection;
D O I
10.1080/1206212X.2020.1751387
中图分类号
学科分类号
摘要
The popularity of social media networks, such as Twitter, leads to an increasing number of spamming activities. Researchers employed various machine learning methods to detect Twitter spam. However, majorities of existing researches are limited to theoretically study, few of them can apply detection techniques to real-time scenario. In this paper, we bridge the gap by proposing a near real-time Twitter spam detection system, which provides near real-time tweets data acquisition, light-weight features extraction from a specific Twitter account, training detection model, and online visualizing detection results. In this system, account-based and content-based features are extracted to facilitate spam detection. The models that are applied to our Twitter spam detection system are trained based on 1.5 million public tweets and nine mainstream algorithms. In addition, in order to efficiently reduce training time spent on massive data and save the cost of model updating, a parallel computing technique is introduced to train and update the models in this system. Empirical results verify that the model can achieve satisfactory performance based on our datasets. Furthermore, we implement a near real-time Twitter spam detection system which can better protect users from combating spams. This system also acts as a tweets collection tool, allowing researchers to test the performance of trained classifiers in realistic scenarios. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:338 / 348
页数:10
相关论文
共 50 条
  • [1] Machine and Deep Learning Algorithms for Twitter Spam Detection
    Alsaffar, Dalia
    Alfahhad, Amjad
    Alqhtani, Bashaier
    Alamri, Lama
    Alansari, Shahad
    Alqahtani, Nada
    Alboaneen, Dabiah A.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 483 - 491
  • [2] Statistical Features-Based Real-Time Detection of Drifted Twitter Spam
    Chen, Chao
    Wang, Yu
    Zhang, Jun
    Xiang, Yang
    Zhou, Wanlei
    Min, Geyong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) : 914 - 925
  • [3] Comparison of machine learning techniques for spam detection
    Ghosh, Argha
    Senthilrajan, A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29227 - 29254
  • [4] Comparison of machine learning techniques for spam detection
    Argha Ghosh
    A. Senthilrajan
    Multimedia Tools and Applications, 2023, 82 : 29227 - 29254
  • [5] A Survey On Twitter Spam Drift Detection using Machine Learning
    Bhavani, G. Venkata Durga
    Lakshmi, K. Jayasri Rama
    Sirisha, K.
    Jyothi, K. Satya
    Lakshmi, M. Anantha
    Parthiban, M.
    2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 345 - 349
  • [6] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
    Gheewala, Shivangi
    Patel, Rakesh
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84
  • [7] Enhancing Email Security: A Real-Time Machine Learning-Based Spam Detection System
    Yadav, Dharmveer Kumar
    Raj, Abhishek
    Rajlakshmi, Neeraj
    Kumar, Neeraj
    Kumari, Ritu
    INTERNET TECHNOLOGY LETTERS, 2024,
  • [8] Near-real-time Anomaly Detection in Encrypted Traffic using Machine Learning Techniques
    Ucci, Daniele
    Sobrero, Filippo
    Bisio, Federica
    Zorzino, Matteo
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [9] Analysis of Optimized Machine Learning and Deep Learning Techniques for Spam Detection
    Hossain, Fahima
    Uddin, Mohammed Nasir
    Halder, Rajib Kumar
    2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2021, : 552 - 558
  • [10] Performance Evaluation of Machine Learning Algorithms for Spam Profile Detection on Twitter Using WEKA and RapidMiner
    Hanif, Mohamad Hazim Md
    Adewole, Kayode Sakariyah
    Anuar, Nor Badrul
    Kamsin, Amirrudin
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1043 - 1046