Classification of spammer and nonspammer content in online social network using genetic algorithm-based feature selection

被引:28
作者
Sahoo, Somya Ranjan [1 ]
Gupta, B. B. [1 ]
机构
[1] Natl Inst Technol Kurukshetra, Dept Comp Engn, Kurukshetra, Haryana, India
关键词
Online social networking; genetic algorithm; spammer; machine learning; HYBRID APPROACH;
D O I
10.1080/17517575.2020.1712742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of online social network invokes social actors to share their personal information digitally. Moreover, it provides the facility to maintain their links with people of same interest globally. Take advantage of these services; it has become a fascinating testbed to invite various threats like a spammer. Detection of spammer in OSN is one of the most critical tasks. Spammer not only spreads unwanted or bad advertisement but does certain malicious activity in others' profiles. By clearly understanding the activities of different threats, some incremental and accurate approaches are needed for detecting spammer content and profiles involved in these activities by using social network services. Therefore, the focus of this article is to detect spammer content and account, specifically on the leading microblogging platform called Twitter. We propose a hybrid approach which leverages the capabilities of various machine learning algorithms to separate spammer and nonspammer contents and account. Initially, the optimisation algorithm called genetic algorithm analyses the various features and selects the best suitable features that influence the behaviour of user account, and these features are then used to train classifiers. Our framework achieved to severalise spammer and nonspammer content in an effective way. Finally, to prove the efficiency of our proposed framework, a comparative analysis is conducted with some existing state-of-art techniques. The experimental analysis shows that our approach achieves a high detection rate of 99.6%, which is better than other state-of-art techniques.
引用
收藏
页码:710 / 736
页数:27
相关论文
共 46 条
[1]   A Hybrid Approach for Detecting Spammers in Online Social Networks [J].
Alghamdi, Bandar ;
Xu, Yue ;
Watson, Jason .
WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT I, 2018, 11233 :189-198
[2]   DNS rule-based schema to botnet detection [J].
Alieyan, Kamal ;
Almomani, Ammar ;
Anbar, Mohammed ;
Alauthman, Mohammad ;
Abdullah, Rosni ;
Gupta, B. B. .
ENTERPRISE INFORMATION SYSTEMS, 2021, 15 (04) :545-564
[3]  
[Anonymous], 2012, PRESENTED PART 9 USE
[4]  
ASLAN CB, 2018, P 9 INT C SOC MED SO
[5]   Integro: Leveraging Victim Prediction for Robust Fake Account Detection in OSNs [J].
Boshmaf, Yazan ;
Logothetis, Dionysios ;
Siganos, Georgos ;
Leria, Jorge ;
Lorenzo, Jose ;
Ripeanu, Matei ;
Beznosov, Konstantin .
22ND ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2015), 2015,
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Investigating the deceptive information in Twitter spam [J].
Chen, Chao ;
Wen, Sheng ;
Zhang, Jun ;
Xiang, Yang ;
Oliver, Jonathan ;
Alelaiwi, Abdulhameed ;
Hassan, Mohammad Mehedi .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 72 :319-326
[8]  
Cheng Cao, 2015, Advances in Information Retrieval. 37th European Conference on IR Research (ECIR 2015). Proceedings: LNCS 9022, P703, DOI 10.1007/978-3-319-16354-3_77
[9]  
Chhabra M., 2013, SciRes Journal of Information Security, V4, P165
[10]   A PBNM and economic incentive-based defensive mechanism against DDoS attacks [J].
Dahiya, Amrita ;
Gupta, B. B. .
ENTERPRISE INFORMATION SYSTEMS, 2022, 16 (03) :406-426