Near real-time twitter spam detection with machine learning techniques

被引:0
|
作者
Sun N. [1 ]
Lin G. [1 ]
Qiu J. [1 ]
Rimba P. [2 ]
机构
[1] School of Information Technology, Deakin University, Geelong
[2] Data61, CSIRO, Melbourne
关键词
classification; machine learning; Social network security; spam detection;
D O I
10.1080/1206212X.2020.1751387
中图分类号
学科分类号
摘要
The popularity of social media networks, such as Twitter, leads to an increasing number of spamming activities. Researchers employed various machine learning methods to detect Twitter spam. However, majorities of existing researches are limited to theoretically study, few of them can apply detection techniques to real-time scenario. In this paper, we bridge the gap by proposing a near real-time Twitter spam detection system, which provides near real-time tweets data acquisition, light-weight features extraction from a specific Twitter account, training detection model, and online visualizing detection results. In this system, account-based and content-based features are extracted to facilitate spam detection. The models that are applied to our Twitter spam detection system are trained based on 1.5 million public tweets and nine mainstream algorithms. In addition, in order to efficiently reduce training time spent on massive data and save the cost of model updating, a parallel computing technique is introduced to train and update the models in this system. Empirical results verify that the model can achieve satisfactory performance based on our datasets. Furthermore, we implement a near real-time Twitter spam detection system which can better protect users from combating spams. This system also acts as a tweets collection tool, allowing researchers to test the performance of trained classifiers in realistic scenarios. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:338 / 348
页数:10
相关论文
共 50 条
  • [31] Real-Time Identification of Medicinal Plants using Machine Learning Techniques
    Sivaranjani, C.
    Kalinathan, Lekshmi
    Amutha, R.
    Kathavarayan, Ruba Soundar
    Kumar, Jegadish K. J.
    2019 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS 2019), 2019,
  • [32] Real-Time Portfolio Management System Utilizing Machine Learning Techniques
    Aithal, Prakash K.
    Geetha, M.
    Acharya, U. Dinesh
    Savitha, Basri
    Menon, Parthiv
    IEEE ACCESS, 2023, 11 : 32595 - 32608
  • [33] Real-time automatic detection and classification of groundnut leaf disease using hybrid machine learning techniques
    Suresh
    Seetharaman, K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 1935 - 1963
  • [34] Real-time automatic detection and classification of groundnut leaf disease using hybrid machine learning techniques
    K. Suresh
    Multimedia Tools and Applications, 2023, 82 : 1935 - 1963
  • [35] Data Extraction Method Combined with Machine Learning Techniques for the Detection of Premature Ventricular Contractions in Real-Time
    Sodre, L. C.
    Dutra, B. G.
    Silveira, A. S.
    Mizara, I. M.
    XXVII BRAZILIAN CONGRESS ON BIOMEDICAL ENGINEERING, CBEB 2020, 2022, : 1973 - 1978
  • [36] Frontal lobe real-time EEG analysis using machine learning techniques for mental stress detection
    AlShorman, Omar
    Masadeh, Mahmoud
    Bin Heyat, Md Belal
    Akhtar, Faijan
    Almahasneh, Hossam
    Ashraf, Ghulam Md
    Alexiou, Athanasios
    JOURNAL OF INTEGRATIVE NEUROSCIENCE, 2022, 21 (01)
  • [37] Tweet Spam Detection Using Machine Learning and Swarm Optimization Techniques
    Manasa, Pinnapureddy
    Malik, Arun
    Alqahtani, Khaled N.
    Alomar, Madani Abdu
    Basingab, Mohammed Salem
    Soni, Mukesh
    Rizwan, Ali
    Batra, Isha
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4870 - 4877
  • [38] Real-time Event Detection in Twitter: A Case Study
    Sani, Ali Momen
    Moeini, Ali
    2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 48 - 51
  • [39] Towards Automated Real-Time Detection of Misinformation on Twitter
    Jain, Suchita
    Sharma, Vanya
    Kaushal, Rishabh
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2015 - 2020
  • [40] Real-time Detection of Cyberbullying in Arabic Twitter Streams
    Mouheb, Djedjiga
    Abushamleh, Masa Hilal
    Abushamleh, Maya Hilal
    Al Aghbari, Zaher
    Kamel, Ibrahim
    2019 10TH IFIP INTERNATIONAL CONFERENCE ON NEW TECHNOLOGIES, MOBILITY AND SECURITY (NTMS), 2019,