A hybrid classification method for Twitter spam detection based on differential evolution and random forest

被引:29
|
作者
Bazzaz Abkenar, Sepideh [1 ]
Mahdipour, Ebrahim [1 ]
Jameii, Seyed Mahdi [2 ]
Haghi Kashani, Mostafa [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
[2] Islamic Azad Univ, Shahr E Qods Branch, Dept Comp Engn, Tehran, Iran
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2021年 / 33卷 / 21期
关键词
imbalanced dataset; machine learning; social networks; spam; Twitter;
D O I
10.1002/cpe.6381
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Social networking services are online platforms that are distributed across different computers over long distances. Twitter is the most popular microblogging site that allows users to share their opinions and real-world events. Due to its popularity and ease of use, Twitter has also attracted spammers. As a result, spam detection is one of the most critical problems. In order to provide a spam-free environment, it is necessary to identify and filter spam tweets as well as their owners. A hybrid method, which is based on Synthetic Minority Over-sampling TEchnique (SMOTE) and Differential Evolution (DE) strategies, is presented to enhance the spam detection rate in real Twitter datasets. SMOTE is applied to tackle the imbalanced class distribution of datasets, while DE is used to tune Random Forest (RF) hyperparameters. Compared with related work and based on evaluation results, the presented method significantly enhances the classification performance in imbalanced datasets. The detection rate of optimized RF with excellent F-1-score and Area Under the Receiver Operating Characteristic Curve (AUROC), which are 98.97% and 0.999, respectively, demonstrates the high efficiency of the proposed method.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Tweet and Account Based Spam Detection on Twitter
    Gungor, Kubra Nur
    Erdem, O. Ayhan
    Dogru, Ibrahim Alper
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 898 - 905
  • [2] Sentiment Based Twitter Spam Detection
    Perveen, Nasira
    Missen, Malik M. Saad
    Rasool, Qaisar
    Akhtar, Nadeem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 568 - 573
  • [3] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
    Gheewala, Shivangi
    Patel, Rakesh
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84
  • [4] Hybrid email spam detection model with negative selection algorithm and differential evolution
    Idris, Ismaila
    Selamat, Ali
    Omatu, Sigeru
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 28 : 97 - 110
  • [5] Random forest-based robust classification for lithographic hotspot detection
    Dewar, Rohit
    Barai, Samit
    Kumar, Pardeep
    Srinivasan, Babji
    Mohapatra, Nihar R.
    JOURNAL OF MICRO-NANOLITHOGRAPHY MEMS AND MOEMS, 2019, 18 (02):
  • [6] Machine Learning based Optimization Scheme for Detection of Spam and Malware Propagation in Twitter
    Sheoran, Savita Kumari
    Yadav, Partibha
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 495 - 503
  • [7] How Spam Features Change in Twitter and the Impact to Machine Learning Based Detection
    Wu, Tingmin
    Wang, Derek
    Wen, Sheng
    Xiang, Yang
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2017, 2017, 10701 : 898 - 904
  • [8] Vietnamese spam detection based on language classification
    Anh, Nguyen Tuan
    Anh, Tran Quang
    Binh, Nguyen Ngoc
    2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 73 - +
  • [9] A Neural Network-Based Ensemble Approach for Spam Detection in Twitter
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2018, 5 (04): : 973 - 984
  • [10] Quantum behaved binary gravitational search algorithm with random forest for twitter spammer detection
    Sharma, Kanta Prasad
    Lal, Gendal
    Shukla, Madhu
    Yadav, Anupam
    Jayaprakash, B.
    Juneja, Bhanu
    Jagtap, Jayant
    Singh, Amrita
    Bhowmik, A.
    Santhosh, A. Johnson
    RESULTS IN ENGINEERING, 2025, 25