Clickbait in Social Media : Detection and Analysis of the Bait

被引:9
作者
Jain, Mini [1 ]
Mowar, Peya [1 ]
Goel, Ruchika [1 ]
Vishwakarma, Dinesh K. [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Delhi, India
来源
2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS) | 2021年
关键词
clickbait detection; ensemble learning; stacking classifier; visual; instagram; twitter;
D O I
10.1109/CISS50987.2021.9400293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Taking advantage of visual-centric social media's rising popularity, content creators have started using enticing images to lure users into clicking on bothersome clickbaits, in place of previously used text-based baits. In addition, the development of a single model to detect clickbait on multiple image-centric social media platforms is largely an unexplored problem. Therefore, we introduce a novel model that can detect visual clickbaits on both Instagram and Twitter posts. The proposed model consists of a stacking classifier framework composed of six base models (K-Nearest Neighbors, Support Vector Machine, XGBoost, Naive Bayes, Logistic Regression, and Multilayer Perceptron) and a meta-classifier (Random Forest). The developed classifier achieved an accuracy of 88.5% for Instagram posts and 85% for Twitter posts, which is an improvement over previous separate state-of-the-art models for both platforms. Additionally, the stated classifier does not use meta-features (e.g., the number of likes or followers) for classification, which helps to detect potential clickbaits right away, enhancing its applicability in real-time clickbait detection use cases. Furthermore, based on our analysis, we have drawn essential conclusions about the telling characteristics of clickbaits.
引用
收藏
页数:6
相关论文
共 14 条
[1]  
Agrawal A, 2016, PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), P268, DOI 10.1109/NGCT.2016.7877426
[2]  
Biyani P, 2016, AAAI CONF ARTIF INTE, P94
[3]  
Chakraborty A, 2016, PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, P9, DOI 10.1109/ASONAM.2016.7752207
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]  
Chesney S., 2017, P EMNLP WORKSH NAT L, P56
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Ha Yui, 2018, ICWSM
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Potthast Martin, 2016, Advances in Information Retrieval. 38th European Conference on IR Research, ECIR 2016. Proceedings