Performance Evaluation of Machine Learning Algorithms for Spam Profile Detection on Twitter Using WEKA and RapidMiner

被引:6
作者
Hanif, Mohamad Hazim Md [1 ]
Adewole, Kayode Sakariyah [1 ]
Anuar, Nor Badrul [1 ]
Kamsin, Amirrudin [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
关键词
Online Social Network; Spam Detection; WEKA; RapidMiner; Classification; Machine Learning; ACCOUNTS;
D O I
10.1166/asl.2018.10683
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Twitter social network is growing on a daily basis and as a result, attackers have developed interest in distributing malicious contents on this platform. Numerous studies have investigated the possibility of reducing spamming activities on Twitter with each study focusing on introducing a new set of features for countermeasure. This paper adopts the set of features for identifying spammers on Twitter and introduces additional features to improve classifier performance. The performance of four machine learning algorithms: Random forest (RF), Support vector machine (SVM), K nearest neighbor (KNN), and Multilayer perceptron (MLP) across two popular machine learning tools-WEKA and RapidMiner were evaluated. Results from the experiment show that SVM, KNN, and MLP on WEKA outperformed those algorithms on RapidMiner. However, in the case of RF, RapidMiner achieved higher accuracy compare to RF on WEKA. Based on the 32 features in the dataset, MLP and RF on both WEKA and RapidMiner outperformed other classifiers with accuracy of 95.42% and 95.44% respectively. These findings would be useful for researchers willing to develop a machine learning model to detect malicious activities on social network.
引用
收藏
页码:1043 / 1046
页数:4
相关论文
共 19 条
[1]  
Adikari S., 2014, IDENTIFYING FAKE PRO
[2]  
Aggarwal A, 2012, ECRIM RES SUM
[3]  
Ahmed F., 2012, 2012 IEEE 11 INT C T
[4]   A generic statistical approach for spam detection in Online Social Networks [J].
Ahmed, Faraz ;
Abulaish, Muhammad .
COMPUTER COMMUNICATIONS, 2013, 36 (10-11) :1120-1129
[5]  
Alsaleh M, 2014, 2014 13 IEEE INT C M
[6]  
[Anonymous], LNCS
[7]  
[Anonymous], NUMB 150 AM TWITT ST
[8]  
Benevenuto F., 2010, Detecting Spammers on Twitter
[9]   Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg? [J].
Chu, Zi ;
Gianvecchio, Steven ;
Wang, Haining ;
Jajodia, Sushil .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2012, 9 (06) :811-824
[10]   Towards Detecting Compromised Accounts on Social Networks [J].
Egele, Manuel ;
Stringhini, Gianluca ;
Kruegel, Christopher ;
Vigna, Giovanni .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2017, 14 (04) :447-460