Platform-Oblivious Anti-Spam Gateway

被引:0
|
作者
Zhang, Yihe [1 ]
Yuan, Xu [1 ]
Tzeng, Nian-Feng [1 ]
机构
[1] Univ Louisiana Lafayette, Lafayette, LA 70504 USA
来源
37TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2021 | 2021年
关键词
Anti-Spam; Unsupervised; Outlier Detection;
D O I
10.1145/3485832.3488024
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses a novel anti-spam gateway targeting multiple linguistic-based social platforms to expose the outlier property of their spam messages uniformly for effective detection. Instead of labeling ground truth datasets and extracting key features, which are labor-intensive and time-consuming, we start with coarsely mining seed corpora of spams and hams from the target data (aiming for spam classification), before reconstructing them as the reference. To catch each word's rich information in the semantic and syntactic perspectives, we then leverage the natural language processing (NLP) model to embed each word into the high-dimensional vector space and use a neural network to train a spam word model. After that, each message is encoded by using the predicted spam scores from this model for all included stem words. The encoded messages are processed by the prominent outlier techniques to produce their respective scores, allowing us to rank them for making the outlier visible. Our solution is unsupervised, without relying on specifics of any platform or dataset, to be platform-oblivious. Through extensive experiments, our solution is demonstrated to expose spammers' outlier characteristics effectively, outperform all examined unsupervised methods in almost all metrics, and may even better supervised counterparts.
引用
收藏
页码:1064 / 1077
页数:14
相关论文
共 22 条
  • [1] Flow-based anti-spam
    Qiu, XF
    Hao, JH
    Chen, M
    2004 IEEE Workshop on IP Operations and Management Proceedings (IPOM 2004): SELF-MEASUREMENT & SELF-MANAGEMENT OF IP NETWORKS & SERVICES, 2004, : 99 - 103
  • [2] A new anti-spam protocol using CAPTCHA
    Shirali-Shahreza, Sajad
    Movaghar, Ali
    2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2, 2007, : 234 - +
  • [3] An Anti-Spam System Based on Service Grid
    Ye, Liang
    Zhong, Weiming
    Liu, Peng
    PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 1421 - +
  • [4] Toward Spam 2.0: An Evaluation of Web 2.0 Anti-Spam Methods
    Hayati, Pedram
    Potdar, Vidyasagar
    2009 7TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1 AND 2, 2009, : 875 - 880
  • [5] An anti-spam scheme using pre-challenges
    Roman, Rodrigo
    Zhou, Hanying
    Lopez, Javier
    COMPUTER COMMUNICATIONS, 2006, 29 (15) : 2739 - 2749
  • [6] Optimization of Anti-Spam Systems with Multiobjective Evolutionary Algorithms
    Basto-Fernandes, Vitor
    Yevseyeva, Iryna
    Mendez, Jose R.
    INFORMATION RESOURCES MANAGEMENT JOURNAL, 2013, 26 (01) : 54 - 67
  • [7] ADAPTABLE ANTI-SPAM TECHNIQUE FOR THE INTERNET WEB BBS
    Hong, Joonmo
    Kang, Boo Joong
    Im, Eul Gyu
    2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 614 - 617
  • [8] Anti-spam Filters Based on Support Vector Machines
    Xie, Chengwang
    Ding, Lixin
    Du, Xin
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2009, 5821 : 349 - 357
  • [9] Study on ASP-based Anti-spam Management System
    Wang Yingjie
    Chen Xiaoyu
    Wang Lin
    Liang Xiaoqiang
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 581 - 584
  • [10] AUTOSCALABILE DISTRIBUTED ANTI-SPAM SMTP SYSTEM BASED ON KUBERNETES
    Gavrilovic, Nadja
    Ciric, Vladimir
    FACTA UNIVERSITATIS-SERIES ELECTRONICS AND ENERGETICS, 2021, 34 (04) : 525 - 546