Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions

被引:1
作者
Saleh Beyt Sheikh Ahmad
Mahnaz Rafie
Seyed Mojtaba Ghorabie
机构
[1] Arvandan Nonprofit Higher Education Institute,Department of Computer Engineering
[2] Islamic Azad University,Department of Computer Engineering, Ramhormoz Branch
[3] Islamic Azad University,Department of Computer Engineering, International Branch
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Tweet; Twitter; Spam; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Spam tweets might cause numerous problems for users. An automatic method is introduced as a proposed method to detect spam tweets. This method is based on pre-processing and feature extraction steps. The pre-processing step is significant for our problem due to the specific structure of tweets. The pre-processing step is performed in such a way that after which only the words remain in each tweet that can play a key role in determining whether the tweet is spam or non-spam. In the proposed method, the features are classified into five classes of user profile features, account information features, user activity based features, user interaction based features, and tweet content-based features including 28 different features. In the feature selection step, an optimal subset of these features is selected for the learning process. However, a support vector classifier is used for the learning process by two Gaussian and polynomial kernels. Finally, the proposed method is compared with multi-layer perceptron (MLP), Naive Bayes (NB), random forest (RF), and k-nearest neighbors (KNN) methods in terms of standard criteria. The obtained results show the superiority of the proposed method using support vector machine (SVM) algorithm and polynomial kernel with 0.988 precision, 0.953 efficiency, 0.96 accuracy, F-0.969, and 0.985 ROC area under the curve compared to the other methods, indicating that the proposed method has better performance overall.
引用
收藏
页码:11583 / 11605
页数:22
相关论文
共 50 条
[31]   Hoax News Detection on Twitter using Term Frequency Inverse Document Frequency and Support Vector Machine Method [J].
Fauzi, A. ;
Setiawan, E. B. ;
Baiza, Z. K. A. .
2ND INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE, 2019, 1192
[32]   Classification of Pornographic Content on Twitter Using Support Vector Machine and Naive Bayes [J].
Izzah, Nur ;
Budi, Indra ;
Louvan, Samuel .
2018 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA), 2018, :156-160
[33]   Local Binary Pattern Features Based Detection of Glaucoma Using Support Vector Machine Classifier [J].
Nirmala, K. ;
Venkateswaran, N. .
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2016, 6 (06) :1370-1378
[34]   Analysis of Kernel Performance in Support Vector Machine Using Seven Features Extraction for Obstacle Detection [J].
Utaminingrum, Fitri ;
Somawirata, I. Komang ;
Mayena, Sri ;
Septiarini, Anindita ;
Shih, Timothy K. .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (01) :281-291
[35]   Analysis of Kernel Performance in Support Vector Machine Using Seven Features Extraction for Obstacle Detection [J].
Fitri Utaminingrum ;
I. Komang Somawirata ;
Sri Mayena ;
Anindita Septiarini ;
Timothy K. Shih .
International Journal of Control, Automation and Systems, 2023, 21 :281-291
[36]   Detection of Splice Sites Using Support Vector Machine [J].
Varadwaj, Pritish ;
Purohit, Neetesh ;
Arora, Bhumika .
CONTEMPORARY COMPUTING, PROCEEDINGS, 2009, 40 :493-502
[37]   Atrial Fibrillation Detection Using Support Vector Machine [J].
Nuryani, Nuryani ;
Harjito, Bambang ;
Yahya, Iwan ;
Lestari, Anik .
PROCEEDING JOINT INTERNATIONAL CONFERENCE ON ELECTRIC VEHICULAR TECHNOLOGY AND INDUSTRIAL, MECHANICAL, ELECTRICAL, AND CHEMICAL ENGINEERING (ICEVT & IMECE), 2015, :215-218
[38]   Fraud detection using support vector machine ensemble [J].
Pang, SN ;
Kim, D ;
Bang, SY .
8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, :1344-1349
[39]   Bank Fraud Detection Using Support Vector Machine [J].
Gyamfi, Nana Kwame ;
Abdulai, Jamal-Deen .
2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2018, :37-41
[40]   Identifying Depressive Users in Twitter Using Multimodal Analysis [J].
Kang, Keumhee ;
Yoon, Chanhee ;
Kim, Eun Yi .
2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, :231-238