Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions

被引:1
作者
Saleh Beyt Sheikh Ahmad
Mahnaz Rafie
Seyed Mojtaba Ghorabie
机构
[1] Arvandan Nonprofit Higher Education Institute,Department of Computer Engineering
[2] Islamic Azad University,Department of Computer Engineering, Ramhormoz Branch
[3] Islamic Azad University,Department of Computer Engineering, International Branch
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Tweet; Twitter; Spam; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Spam tweets might cause numerous problems for users. An automatic method is introduced as a proposed method to detect spam tweets. This method is based on pre-processing and feature extraction steps. The pre-processing step is significant for our problem due to the specific structure of tweets. The pre-processing step is performed in such a way that after which only the words remain in each tweet that can play a key role in determining whether the tweet is spam or non-spam. In the proposed method, the features are classified into five classes of user profile features, account information features, user activity based features, user interaction based features, and tweet content-based features including 28 different features. In the feature selection step, an optimal subset of these features is selected for the learning process. However, a support vector classifier is used for the learning process by two Gaussian and polynomial kernels. Finally, the proposed method is compared with multi-layer perceptron (MLP), Naive Bayes (NB), random forest (RF), and k-nearest neighbors (KNN) methods in terms of standard criteria. The obtained results show the superiority of the proposed method using support vector machine (SVM) algorithm and polynomial kernel with 0.988 precision, 0.953 efficiency, 0.96 accuracy, F-0.969, and 0.985 ROC area under the curve compared to the other methods, indicating that the proposed method has better performance overall.
引用
收藏
页码:11583 / 11605
页数:22
相关论文
共 50 条
[21]   Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine [J].
Kumaresan, T. ;
Saravanakumar, S. ;
Balamurugan, R. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1) :33-46
[22]   Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine [J].
T. Kumaresan ;
S. Saravanakumar ;
R. Balamurugan .
Cluster Computing, 2019, 22 :33-46
[23]   Machine Learning based Optimization Scheme for Detection of Spam and Malware Propagation in Twitter [J].
Sheoran, Savita Kumari ;
Yadav, Partibha .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) :495-503
[24]   A Model for Spam Filtering Using Support Vector Machine and Artificial Immune System [J].
Jiang, Yaping ;
Guo, Hao ;
Guo, Peigen .
PROCEEDINGS OF THE 2017 2ND INTERNATIONAL SYMPOSIUM ON ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (ISAEECE 2017), 2017, 124 :334-337
[25]   Enhancement of spam detection mechanism based on hybrid -mean clustering and support vector machine [J].
Elssied, Nadir Omer Fadl ;
Ibrahim, Othman ;
Osman, Ahmed Hamza .
SOFT COMPUTING, 2015, 19 (11) :3237-3248
[26]   Masquerade Detection Using Support Vector Machine [J].
YANG Min WANG Lina ZHANG Huanguo CHEN Wei School of Computer Wuhan University Wuhan Hubei China .
WuhanUniversityJournalofNaturalSciences, 2005, (01) :103-106
[27]   Thermography Based Breast Cancer Detection Using Texture Features and Support Vector Machine [J].
Acharya, U. Rajendra ;
Ng, E. Y. K. ;
Tan, Jen-Hong ;
Sree, S. Vinitha .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (03) :1503-1510
[28]   Vehicle and Pedestrian Detection Using Support Vector Machine and Histogram of Oriented Gradients Features [J].
Chen, Zhiqian ;
Chen, Kai ;
Chen, James .
2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, :365-368
[29]   Thermography Based Breast Cancer Detection Using Texture Features and Support Vector Machine [J].
U. Rajendra Acharya ;
E. Y. K. Ng ;
Jen-Hong Tan ;
S. Vinitha Sree .
Journal of Medical Systems, 2012, 36 :1503-1510
[30]   An innovative spam filtering model based on support vector machine [J].
Islam, Md. Rafiqul ;
Chowdhury, Morshed U. ;
Zhou, Wanlei .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, :348-+