Platform-Oblivious Anti-Spam Gateway

被引：0

作者：

Zhang, Yihe ^{[1
]}

Yuan, Xu ^{[1
]}

Tzeng, Nian-Feng ^{[1
]}

机构：

[1] Univ Louisiana Lafayette, Lafayette, LA 70504 USA

来源：

37TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2021 | 2021年

关键词：

Anti-Spam; Unsupervised; Outlier Detection;

D O I：

10.1145/3485832.3488024

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This paper addresses a novel anti-spam gateway targeting multiple linguistic-based social platforms to expose the outlier property of their spam messages uniformly for effective detection. Instead of labeling ground truth datasets and extracting key features, which are labor-intensive and time-consuming, we start with coarsely mining seed corpora of spams and hams from the target data (aiming for spam classification), before reconstructing them as the reference. To catch each word's rich information in the semantic and syntactic perspectives, we then leverage the natural language processing (NLP) model to embed each word into the high-dimensional vector space and use a neural network to train a spam word model. After that, each message is encoded by using the predicted spam scores from this model for all included stem words. The encoded messages are processed by the prominent outlier techniques to produce their respective scores, allowing us to rank them for making the outlier visible. Our solution is unsupervised, without relying on specifics of any platform or dataset, to be platform-oblivious. Through extensive experiments, our solution is demonstrated to expose spammers' outlier characteristics effectively, outperform all examined unsupervised methods in almost all metrics, and may even better supervised counterparts.

引用

页码：1064 / 1077

页数：14

共 22 条

[1] Flow-based anti-spam
Qiu, XF
Hao, JH
Chen, M
2004 IEEE Workshop on IP Operations and Management Proceedings (IPOM 2004): SELF-MEASUREMENT & SELF-MANAGEMENT OF IP NETWORKS & SERVICES, 2004, : 99 - 103
[2] A new anti-spam protocol using CAPTCHA
Shirali-Shahreza, Sajad
Movaghar, Ali
2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2, 2007, : 234 - +
[3] An Anti-Spam System Based on Service Grid
Ye, Liang
Zhong, Weiming
Liu, Peng
PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 1421 - +
[4] Toward Spam 2.0: An Evaluation of Web 2.0 Anti-Spam Methods
Hayati, Pedram
Potdar, Vidyasagar
2009 7TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1 AND 2, 2009, : 875 - 880
[5] An anti-spam scheme using pre-challenges
Roman, Rodrigo
Zhou, Hanying
Lopez, Javier
COMPUTER COMMUNICATIONS, 2006, 29 (15) : 2739 - 2749
[6] Optimization of Anti-Spam Systems with Multiobjective Evolutionary Algorithms
Basto-Fernandes, Vitor
Yevseyeva, Iryna
Mendez, Jose R.
INFORMATION RESOURCES MANAGEMENT JOURNAL, 2013, 26 (01) : 54 - 67
[7] ADAPTABLE ANTI-SPAM TECHNIQUE FOR THE INTERNET WEB BBS
Hong, Joonmo
Kang, Boo Joong
Im, Eul Gyu
2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 614 - 617
[8] Anti-spam Filters Based on Support Vector Machines
Xie, Chengwang
Ding, Lixin
Du, Xin
ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2009, 5821 : 349 - 357
[9] Study on ASP-based Anti-spam Management System
Wang Yingjie
Chen Xiaoyu
Wang Lin
Liang Xiaoqiang
INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 581 - 584
[10] AUTOSCALABILE DISTRIBUTED ANTI-SPAM SMTP SYSTEM BASED ON KUBERNETES
Gavrilovic, Nadja
Ciric, Vladimir
FACTA UNIVERSITATIS-SERIES ELECTRONICS AND ENERGETICS, 2021, 34 (04) : 525 - 546

← 1 2 3 →