Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions

被引:1
作者
Saleh Beyt Sheikh Ahmad
Mahnaz Rafie
Seyed Mojtaba Ghorabie
机构
[1] Arvandan Nonprofit Higher Education Institute,Department of Computer Engineering
[2] Islamic Azad University,Department of Computer Engineering, Ramhormoz Branch
[3] Islamic Azad University,Department of Computer Engineering, International Branch
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Tweet; Twitter; Spam; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Spam tweets might cause numerous problems for users. An automatic method is introduced as a proposed method to detect spam tweets. This method is based on pre-processing and feature extraction steps. The pre-processing step is significant for our problem due to the specific structure of tweets. The pre-processing step is performed in such a way that after which only the words remain in each tweet that can play a key role in determining whether the tweet is spam or non-spam. In the proposed method, the features are classified into five classes of user profile features, account information features, user activity based features, user interaction based features, and tweet content-based features including 28 different features. In the feature selection step, an optimal subset of these features is selected for the learning process. However, a support vector classifier is used for the learning process by two Gaussian and polynomial kernels. Finally, the proposed method is compared with multi-layer perceptron (MLP), Naive Bayes (NB), random forest (RF), and k-nearest neighbors (KNN) methods in terms of standard criteria. The obtained results show the superiority of the proposed method using support vector machine (SVM) algorithm and polynomial kernel with 0.988 precision, 0.953 efficiency, 0.96 accuracy, F-0.969, and 0.985 ROC area under the curve compared to the other methods, indicating that the proposed method has better performance overall.
引用
收藏
页码:11583 / 11605
页数:22
相关论文
共 50 条
[41]   Twitter Feature Selection and Classification Using Support Vector Machine for Aspect-Based Sentiment Analysis [J].
Zainuddin, Nurulhuda ;
Selamat, Ali ;
Ibrahim, Roliana .
TRENDS IN APPLIED KNOWLEDGE-BASED SYSTEMS AND DATA SCIENCE, 2016, 9799 :269-279
[42]   Identifying translation initiation sites in prokaryotes using support vector machine [J].
Gao, Tingting ;
Yang, Zhixia ;
Wang, Yong ;
Jing, Ling .
JOURNAL OF THEORETICAL BIOLOGY, 2010, 262 (04) :644-649
[43]   A Support Vector Machine based Naive Bayes Algorithm for Spam Filtering [J].
Feng, Weimiao ;
Sun, Jianguo ;
Zhang, Liguo ;
Cao, Cuiling ;
Yang, Qing .
2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
[44]   Improvement of detection algorithm of extrasystoles based on support vector machine using multiple features in surface electrocardiogram [J].
Amishiki K. ;
Abe M. .
IEEJ Transactions on Electronics, Information and Systems, 2020, 140 (12) :1380-1385
[45]   Analysis of Features Selection for P2P Traffic Detection Using Support Vector Machine [J].
Jamil, Haitham A. ;
Zarei, Roozbeh ;
Fadlelssied, Nadir O. ;
Aliyu, M. ;
Nor, Sulaiman M. ;
Marsono, Muhammad N. .
2013 INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2013, :116-121
[46]   Disulfide connectivity prediction using support vector machine and novel features [J].
Tsai, CH ;
Tsai, HK ;
Chen, SC ;
Kao, CY .
METMBS '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2004, :391-395
[47]   Reputation Based Malware Detection Using Support Vector Machine [J].
Kalshetti, Urmila ;
Singh, Prashant ;
Bhapkar, Vaibhav ;
Gaikwad, Manish ;
Bhat, Arvind .
INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 :1338-1344
[48]   Support vector machine based arrhythmia classification using reduced features [J].
Song, MH ;
Lee, J ;
Cho, SP ;
Lee, KJ ;
Yoo, SK .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2005, 3 (04) :571-579
[49]   Respiratory Sound Classification using Cepstral Features and Support Vector Machine [J].
Palaniappan, Rajkumar ;
Sundaraj, K. .
2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, :132-136
[50]   Automatic detection of voltage notches using support vector machine [J].
Qi R. ;
Zyabkina O. ;
Martinez D.A. ;
Meyer J. .
Renewable Energy and Power Quality Journal, 2021, 19 :528-533