Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques

被引:9
作者
Akinyelu, Andronicus A. [1 ]
机构
[1] Univ Free State, Dept Comp Sci & Informat, ZA-9301 Bloemfontein, South Africa
关键词
Spam detection; nature-inspired algorithm; machine learning; spam email; web spam; social network spam; review spam; PARTICLE SWARM OPTIMIZATION; FEATURE-SELECTION METHOD; INSTANCE SELECTION; HYBRID APPROACH; CLASSIFICATION; ALGORITHM; ANALYTICS; SEARCH;
D O I
10.3233/JCS-210022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the great advances in spam detection, spam remains a major problem that has affected the global economy enormously. Spam attacks are popularly perpetrated through different digital platforms with a large electronic audience, such as emails, microblogging websites (e.g. Twitter), social networks (e.g. Facebook), and review sites (e.g. Amazon). Different spam detection solutions have been proposed in the literature, however, Machine Learning (ML) based solutions are one of the most effective. Nevertheless, most ML algorithms have computational complexity problem, thus some studies introduced Nature Inspired (NI) algorithms to further improve the speed and generalization performance of ML algorithms. This study presents a survey of recent ML-based and NI-based spam detection techniques to empower the research community with information that is suitable for designing effective spam filtering systems for emails, social networks, microblogging, and review websites. The recent success and prevalence of deep learning show that it can be used to solve spam detection problems. Moreover, the availability of large-scale spam datasets makes deep learning and big data solutions (such as Mahout) very suitable for spam detection. Few studies explored deep learning algorithms and big data solutions for spam detection. Besides, most of the datasets used in the literature are either small or synthetically created. Therefore, future studies can consider exploring big data solutions, big datasets, and deep learning algorithms for building efficient spam detection techniques.
引用
收藏
页码:473 / 529
页数:57
相关论文
共 196 条
[1]  
Abdulhamid Shafi'i Muhammad, 2018, International Journal of Computer Network and Information Security, V10, P60, DOI 10.5815/ijcnis.2018.01.07
[2]  
Abi-Haidar A., 2008, Artificial Life XI: Eleventh International Conference on the Simulation and Synthesis of Living Systems, P1
[3]  
Abu Hammad A.S., 2013, THESIS ISLAMIC U GAZ
[4]   Genetic optimized artificial immune system in spam detection: a review and a model [J].
Abu Zitar, Raed ;
Hamdan, Adel .
ARTIFICIAL INTELLIGENCE REVIEW, 2013, 40 (03) :305-377
[5]  
ACCC, 2019, SCAM STAT
[6]   SMSAD: a framework for spam message and spam account detection [J].
Adewole, Kayode Sakariyah ;
Anuar, Nor Badrul ;
Kamsin, Amirrudin ;
Sangaiah, Arun Kumar .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) :3925-3960
[7]  
Agarwal K, 2018, PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), P685, DOI 10.1109/ICCONS.2018.8662957
[8]   Detecting Deceptive Reviews using Generative Adversarial Networks [J].
Aghakhani, Hojjat ;
Machiry, Aravind ;
Nilizadeh, Shirin ;
Kruegel, Christopher ;
Vigna, Giovanni .
2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, :89-95
[9]  
Agrawal R., 1994, VLDB 94, P487
[10]   On the Performance of Cuckoo Search and Bat Algorithms Based Instance Selection Techniques for SVM Speed Optimization with Application to e-Fraud Detection [J].
Akinyelu, Andronicus Ayobami ;
Adewumi, Aderemi Oluyinka .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (03) :1348-1375