Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions

被引:1
作者
Saleh Beyt Sheikh Ahmad
Mahnaz Rafie
Seyed Mojtaba Ghorabie
机构
[1] Arvandan Nonprofit Higher Education Institute,Department of Computer Engineering
[2] Islamic Azad University,Department of Computer Engineering, Ramhormoz Branch
[3] Islamic Azad University,Department of Computer Engineering, International Branch
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Tweet; Twitter; Spam; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Spam tweets might cause numerous problems for users. An automatic method is introduced as a proposed method to detect spam tweets. This method is based on pre-processing and feature extraction steps. The pre-processing step is significant for our problem due to the specific structure of tweets. The pre-processing step is performed in such a way that after which only the words remain in each tweet that can play a key role in determining whether the tweet is spam or non-spam. In the proposed method, the features are classified into five classes of user profile features, account information features, user activity based features, user interaction based features, and tweet content-based features including 28 different features. In the feature selection step, an optimal subset of these features is selected for the learning process. However, a support vector classifier is used for the learning process by two Gaussian and polynomial kernels. Finally, the proposed method is compared with multi-layer perceptron (MLP), Naive Bayes (NB), random forest (RF), and k-nearest neighbors (KNN) methods in terms of standard criteria. The obtained results show the superiority of the proposed method using support vector machine (SVM) algorithm and polynomial kernel with 0.988 precision, 0.953 efficiency, 0.96 accuracy, F-0.969, and 0.985 ROC area under the curve compared to the other methods, indicating that the proposed method has better performance overall.
引用
收藏
页码:11583 / 11605
页数:22
相关论文
共 50 条
  • [21] Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine
    Kumaresan, T.
    Saravanakumar, S.
    Balamurugan, R.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 33 - 46
  • [22] Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine
    T. Kumaresan
    S. Saravanakumar
    R. Balamurugan
    Cluster Computing, 2019, 22 : 33 - 46
  • [23] Machine Learning based Optimization Scheme for Detection of Spam and Malware Propagation in Twitter
    Sheoran, Savita Kumari
    Yadav, Partibha
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 495 - 503
  • [24] A Model for Spam Filtering Using Support Vector Machine and Artificial Immune System
    Jiang, Yaping
    Guo, Hao
    Guo, Peigen
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL SYMPOSIUM ON ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (ISAEECE 2017), 2017, 124 : 334 - 337
  • [25] Enhancement of spam detection mechanism based on hybrid -mean clustering and support vector machine
    Elssied, Nadir Omer Fadl
    Ibrahim, Othman
    Osman, Ahmed Hamza
    SOFT COMPUTING, 2015, 19 (11) : 3237 - 3248
  • [26] Masquerade Detection Using Support Vector Machine
    YANG Min
    WuhanUniversityJournalofNaturalSciences, 2005, (01) : 103 - 106
  • [27] Thermography Based Breast Cancer Detection Using Texture Features and Support Vector Machine
    Acharya, U. Rajendra
    Ng, E. Y. K.
    Tan, Jen-Hong
    Sree, S. Vinitha
    JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (03) : 1503 - 1510
  • [28] Vehicle and Pedestrian Detection Using Support Vector Machine and Histogram of Oriented Gradients Features
    Chen, Zhiqian
    Chen, Kai
    Chen, James
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 365 - 368
  • [29] Thermography Based Breast Cancer Detection Using Texture Features and Support Vector Machine
    U. Rajendra Acharya
    E. Y. K. Ng
    Jen-Hong Tan
    S. Vinitha Sree
    Journal of Medical Systems, 2012, 36 : 1503 - 1510
  • [30] An innovative spam filtering model based on support vector machine
    Islam, Md. Rafiqul
    Chowdhury, Morshed U.
    Zhou, Wanlei
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 348 - +