A deep learning framework for clickbait detection on social area network using natural language cues

被引:24
作者
Naeem, Bilal [1 ]
Khan, Aymen [1 ]
Beg, Mirza Omer [1 ]
Mujtaba, Hasan [1 ]
机构
[1] Natl Univ Comp & Emerging Sci, AK Brohi Rd,Sect H-11-4, Islamabad, Pakistan
来源
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE | 2020年 / 3卷 / 01期
关键词
Natural language processing; Artificial intelligence; Clickbait detection; Deep learning; Recurrent neural network; LSTM;
D O I
10.1007/s42001-020-00063-y
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Social networks are generating huge amounts of complex textual data which is becoming increasingly difficult to process intelligently. Misinformation on social media networks, in the form of fake news, has the power to influence people, sway opinions and even have a decisive impact on elections. To shield ourselves against manipulative misinformation, we need to develop a reliable mechanism to detect fake news. Yellow journalism along with sensationalism has done a lot of damage by misrepresenting facts and manipulating readers into believing false narratives through hyperbole. Clickbait does exactly this by using characteristics of natural language to entice users into clicking a link and can hence be classified as fake news. In this paper, we present a deep learning framework for clickbait detection. The framework is trained to model the intrinsic characteristics of clickbait for knowledge discovery and then used for decision making by classifying headlines as either clickbait or legitimate news. We focus our attention on the linguistic analysis during the knowledge discovery phase as we investigate the underlying structure of clickbait headlines using our Part of Speech Analysis Module. The decision-making task of classification is carried out using long short-term memory. We believe that it is our framework's architecture that has played a pivotal role to outperform the current state of the art with a classification accuracy of 97%.
引用
收藏
页码:231 / 243
页数:13
相关论文
共 21 条
[1]  
Agrawal A, 2016, PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), P268, DOI 10.1109/NGCT.2016.7877426
[2]  
[Anonymous], 2016, ABC NEWS
[3]  
Bourgonje P., 2017, P 2017 EMNLP WORKSH, P84
[4]   Mining urban events from the tweet stream through a probabilistic mixture model [J].
Capdevila, Joan ;
Cerquides, Jesus ;
Torres, Jordi .
DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (03) :764-786
[5]  
Conroy N. J., 2015, INFORM SCI IMPACT RE
[6]  
Conroy N.J., 2015, P ASS INF SCI TECHN
[7]  
Conroy Niall J, 2015, Proc. Assoc. Inf. Sci. Technol, P82, DOI DOI 10.1002/PRA2.2015.145052010082
[8]  
Freid J, 2018, MARKETING LAND
[9]  
Hackett R., 2017, FORTUNE 0822
[10]  
Kai Shu, 2017, ACM SIGKDD Explorations Newsletter, V19, P22, DOI 10.1145/3137597.3137600