Ensemble based spam detection in social loT using probabilistic data structures

被引:24
|
作者
Singh, Amritpal [1 ]
Batra, Shalini [1 ]
机构
[1] Thapar Univ, Patiala 147001, Punjab, India
关键词
Spam detection; Tweet classification; Ensemble model; Quotient Filter; Locality Sensitive Hashing; INTERNET;
D O I
10.1016/j.future.2017.09.072
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A social approach can be used for the Internet of Things (IoT) to connect large number of objects in social networks like Twitter, Facebook, Instagram, etc. Social networks within the loT domain have simplified the task of dynamic discovery of services and information. Detecting spam in social media, especially when massive data flows continuously and large number of attributes are associated with it, is a daunting task which requires lot of technical insight. This paper proposes a semi-supervised technique for spam detection in Twitter by employing ensemble based framework comprising of four classifiers. The framework is based on usage of Probabilistic Data Structures (PDS) like Quotient Filter (QF) to query the URL database, spam users, spam words databases and Locality Sensitive Hashing (LSH) for similarity search, as classifiers in various stages which provide fast results with less computational effort. Performance of the framework has been evaluated by comparative analysis of PDS with the similar data structures and through the standard evaluation parameters which include precision, recall and F-score. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:359 / 371
页数:13
相关论文
共 50 条
  • [1] Spam Detection Using Ensemble Learning
    Gupta, Vashu
    Mehta, Aman
    Goel, Akshay
    Dixit, Utkarsh
    Pandey, Avinash Chandra
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 661 - 668
  • [2] A Heterogeneous Ensemble Learning Framework for Spam Detection in Social Networks with Imbalanced Data
    Zhao, Chensu
    Xin, Yang
    Li, Xuefeng
    Yang, Yixian
    Chen, Yuling
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [3] Consensus based Ensemble model for Spam detection
    Pantola, Paritosh
    Bala, Anju
    Rana, Prashant Singh
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1724 - 1727
  • [4] Hybrid ensemble framework with self-attention mechanism for social spam detection on imbalanced data
    Rao, Sanjeev
    Verma, Anil Kumar
    Bhatia, Tarunpreet
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
  • [5] Ensemble-Based Text Classification for Spam Detection
    Zhang X.
    Liu G.
    Zhang M.
    Informatica (Slovenia), 2024, 48 (06): : 71 - 80
  • [6] Enhancing Detection of Arabic Social Spam Using Data Augmentation and Machine Learning
    Alkadri, Abdullah M.
    Elkorany, Abeer
    Ahmed, Cherry
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [7] Using Social Network Analysis for Spam Detection
    DeBarr, Dave
    Wechsler, Harry
    ADVANCES IN SOCIAL COMPUTING, PROCEEDINGS, 2010, 6007 : 62 - 69
  • [8] Social SpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks
    Jin, Xin
    Lin, Cindy Xide
    Luo, Jiebo
    Han, Jiawei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (12): : 1458 - 1461
  • [9] Ensemble-Based Spam Detection in Smart Home IoT Devices Time Series Data Using Machine Learning Techniques
    Zainab, Ameema
    S. Refaat, Shady
    Bouhali, Othmane
    INFORMATION, 2020, 11 (07)
  • [10] SPAM DETECTION USING DATA COMPRESSION AND SIGNATURES
    Prilepok, Michal
    Berek, Petr
    Platos, Jan
    Snasel, Vaclav
    CYBERNETICS AND SYSTEMS, 2013, 44 (6-7) : 533 - 549