Near real-time twitter spam detection with machine learning techniques

被引:0
|
作者
Sun N. [1 ]
Lin G. [1 ]
Qiu J. [1 ]
Rimba P. [2 ]
机构
[1] School of Information Technology, Deakin University, Geelong
[2] Data61, CSIRO, Melbourne
关键词
classification; machine learning; Social network security; spam detection;
D O I
10.1080/1206212X.2020.1751387
中图分类号
学科分类号
摘要
The popularity of social media networks, such as Twitter, leads to an increasing number of spamming activities. Researchers employed various machine learning methods to detect Twitter spam. However, majorities of existing researches are limited to theoretically study, few of them can apply detection techniques to real-time scenario. In this paper, we bridge the gap by proposing a near real-time Twitter spam detection system, which provides near real-time tweets data acquisition, light-weight features extraction from a specific Twitter account, training detection model, and online visualizing detection results. In this system, account-based and content-based features are extracted to facilitate spam detection. The models that are applied to our Twitter spam detection system are trained based on 1.5 million public tweets and nine mainstream algorithms. In addition, in order to efficiently reduce training time spent on massive data and save the cost of model updating, a parallel computing technique is introduced to train and update the models in this system. Empirical results verify that the model can achieve satisfactory performance based on our datasets. Furthermore, we implement a near real-time Twitter spam detection system which can better protect users from combating spams. This system also acts as a tweets collection tool, allowing researchers to test the performance of trained classifiers in realistic scenarios. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:338 / 348
页数:10
相关论文
共 50 条
  • [1] RETRACTED: Real-Time Twitter Spam Detection and Sentiment Analysis using Machine Learning and Deep Learning Techniques (Retracted Article)
    Rodrigues, Anisha P.
    Fernandes, Roshan
    Aakash, A.
    Abhishek, B.
    Shetty, Adarsh
    Atul, K.
    Lakshmanna, Kuruva
    Shafi, R. Mahammad
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [2] A Framework for Real-Time Spam Detection in Twitter
    Gupta, Himank
    Jamal, Mohd. Saalim
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 380 - 387
  • [3] Machine Learning for the Detection of Spam in Twitter Networks
    Wang, Alex Hai
    E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
  • [4] Machine and Deep Learning Algorithms for Twitter Spam Detection
    Alsaffar, Dalia
    Alfahhad, Amjad
    Alqhtani, Bashaier
    Alamri, Lama
    Alansari, Shahad
    Alqahtani, Nada
    Alboaneen, Dabiah A.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 483 - 491
  • [5] Statistical Features-Based Real-Time Detection of Drifted Twitter Spam
    Chen, Chao
    Wang, Yu
    Zhang, Jun
    Xiang, Yang
    Zhou, Wanlei
    Min, Geyong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) : 914 - 925
  • [6] Enhancing Email Security: A Real-Time Machine Learning-Based Spam Detection System
    Yadav, Dharmveer Kumar
    Raj, Abhishek
    Rajlakshmi, Neeraj
    Kumar, Neeraj
    Kumari, Ritu
    INTERNET TECHNOLOGY LETTERS, 2024,
  • [7] Real-Time Twitter Trend Analysis Using Big Data Analytics and Machine Learning Techniques
    Rodrigues, Anisha P.
    Fernandes, Roshan
    Bhandary, Adarsh
    Shenoy, Asha C.
    Shetty, Ashwanth
    Anisha, M.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [8] Comparison of machine learning techniques for spam detection
    Ghosh, Argha
    Senthilrajan, A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29227 - 29254
  • [9] Comparison of machine learning techniques for spam detection
    Argha Ghosh
    A. Senthilrajan
    Multimedia Tools and Applications, 2023, 82 : 29227 - 29254
  • [10] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
    Gheewala, Shivangi
    Patel, Rakesh
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84