Twitter Spam Detection Using Naive Bayes Classifier

被引:4
作者
Santoshi, K. Ushasree [1 ]
Bhavya, S. Sree [1 ]
Sri, Y. Bhavya [1 ]
Venkateswarlu, B. [2 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept CSE, Vaddeswaram, Andhra Pradesh, India
[2] Koneru Lakshmaiah Educ Educ, Dept CSE, Vaddeswaram, Andhra Pradesh, India
来源
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021) | 2021年
关键词
Malicious Tweets; filtering; ham and spam; deep learning; SVM; naive Bayes classifier;
D O I
10.1109/ICICT50816.2021.9358579
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter is the well liked social media platform that has over 300 million monthly users which post 500 million tweets per day. This is the main reason why spammers use Twitter for their spiteful doings such as spreading malignant software that steals the user personal information and tweets containing fake or faulty URLs, assertively follow or un-follow users and trending fake tweets to get users attention, spread pornography advertisements. In recent years twitter has reportedly collected the data of active users and analyzed their actions, the report clearly shows that over 32 million users have interacted with server for casual information in daily basis. Hence, identifying and filtering the malicious tweets or trends that are harmful or unwanted for users is very important in current social world. This paper discusses about the ways to analyze the tweets and classify them into spam and ham based on the words involved in tweets. Although there are various machine learning and deep learning methods to classify and detect spam tweets like SVM, clustering methods and binary detection models that are used Naive Bayes classifier. Recently, twitter users are experiencing data stealing malware by accessing or visiting unnecessary spam messages or tweets. It has to be considered seriously since many people are losing money or personal information. Besides data stealing malware, fake trends also been a threat. It has to be controlled. Spammers are likely to interact with more people because of the auto-follow option.
引用
收藏
页码:773 / 777
页数:5
相关论文
共 50 条
[31]   Hotspot diagnosis for solar photovoltaic modules using a Naive Bayes classifier [J].
Niazi, Kamran Ali Khan ;
Akhtar, Wajahat ;
Khan, Hassan A. ;
Yang, Yongheng ;
Athar, Shahrukh .
SOLAR ENERGY, 2019, 190 :34-43
[32]   Self-learning knowledge base using Naive Bayes classifier [J].
Lodhi, Pooja ;
Mishra, Omji ;
Jain, Shikha .
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2018, 12 (04) :491-498
[33]   An Integration of K-Means Clustering and Naive Bayes Classifier for Intrusion Detection [J].
Varuna, S. ;
Natesan, P. .
2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
[34]   A Model for Accurate Prediction in GeoRSS Data Using Naive Bayes Classifier [J].
Netti, K. ;
Radhika, Y. .
JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2017, 76 (08) :473-476
[35]   A FRAMEWORK FOR STUDENTS' ACADEMIC PERFORMANCE ANALYSIS USING NAIVE BAYES CLASSIFIER [J].
Aziz, Azwa Abdul ;
Ismail, Nur Hafieza ;
Ahmad, Fadhilah ;
Hassan, Hasni .
JURNAL TEKNOLOGI, 2015, 75 (03) :13-19
[36]   Applying Naive Bayes Classifier to Document Clustering [J].
Ji, Jie ;
Zhao, Qiangfu .
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (06) :624-630
[37]   A sequential naive Bayes classifier for DNA barcodes [J].
Anderson, Michael P. ;
Dubnicka, Suzanne R. .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) :423-434
[38]   LEARNING THE NAIVE BAYES CLASSIFIER WITH OPTIMIZATION MODELS [J].
Taheri, Sona ;
Mammadov, Musa .
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (04) :787-795
[39]   DECOMPOSABLE NAIVE BAYES CLASSIFIER FOR PARTITIONED DATA [J].
Khedr, Ahmed M. .
COMPUTING AND INFORMATICS, 2012, 31 (06) :1511-1531
[40]   Extended Naive Bayes classifier for mixed data [J].
Hsu, Chung-Chian ;
Huang, Yan-Ping ;
Chang, Keng-Wei .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) :1080-1083