共 14 条
Tweet Spam Detection Using Machine Learning and Swarm Optimization Techniques
被引:5
作者:
Manasa, Pinnapureddy
[1
]
Malik, Arun
[1
]
Alqahtani, Khaled N.
[2
]
Alomar, Madani Abdu
[3
]
Basingab, Mohammed Salem
[3
]
Soni, Mukesh
[4
]
Rizwan, Ali
[3
]
Batra, Isha
[1
]
机构:
[1] Lovely Profess Univ, Dept Comp Sci & Engn, Phagwara 144402, India
[2] Taibah Univ, Coll Engn, Dept Ind Engn, Medina 41411, Saudi Arabia
[3] King Abdulaziz Univ, Fac Engn, Dept Ind Engn, Jeddah 21589, Saudi Arabia
[4] Chandigarh Univ, Univ Ctr Res & Dev, Dept Comp Sci & Engn, Mohali 140413, India
关键词:
Social networking (online);
Blogs;
Feature extraction;
Random forests;
Unsolicited e-mail;
Particle swarm optimization;
Metaheuristics;
Adaboost (AB);
metaheuristic features;
stochastic gradient descent (SGD);
tweet spam;
Whale swam optimization algorithm (WOA);
ACCOUNTS;
D O I:
10.1109/TCSS.2022.3230823
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Social media networking platforms connect people living in every corner of the world. Twitter has now become a popular microblogging service, allowing users to express themselves and keep up with current events. Twitter has attracted spammers because to its popularity and ease of use. As a result, spam identification (ID) has become one of the most pressing issues. It is vital to detect and filter spam tweets as well as their owners in order to provide a spam-free environment. In this article, a spam detection method is proposed using a swarm optimization approach on a tweet-by-tweet basis. A spam tweet detection dataset is used to train the machine learning (ML) model. Metaheuristic features are created based on the input features in the dataset. Whale swam optimization algorithm (WOA) is used to select the important features before classification. The conventional objective function of WOA is modified into stochastic gradient descent (SGD) to perform feature selection. The selected subset of features is used to train the Adaboost (AB) classifier to detect the spam in the tweets. The AB classifier produced the best results in combination with WOA and SGD. The obtained accuracy is 99.85% in testing with a minimum subset of seven features and in the least possible minimum time of 17.9 s.
引用
收藏
页码:4870 / 4877
页数:8
相关论文