Cross-SEAN: A cross-stitch semi-supervised neural attention model for COVID-19 fake news detection

被引:91
作者
Paka, William Scott [1 ]
Bansal, Rachit [2 ]
Kaushik, Abhay [3 ]
Sengupta, Shubhashis [4 ]
Chakraborty, Tanmoy [1 ]
机构
[1] IIIT Delhi, Delhi, India
[2] DTU Delhi, Delhi, India
[3] IIT Kanpur, Kanpur, Uttar Pradesh, India
[4] Accenture Labs, Bangalore, Karnataka, India
关键词
Fake news detection; Social media; COVID-19; CLASSIFICATION;
D O I
10.1016/j.asoc.2021.107393
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the COVID-19 pandemic sweeps across the world, it has been accompanied by a tsunami of fake news and misinformation on social media. At the time when reliable information is vital for public health and safety, COVID-19 related fake news has been spreading even faster than the facts. During times such as the COVID-19 pandemic, fake news can not only cause intellectual confusion but can also place people's lives at risk. This calls for an immediate need to contain the spread of such misinformation on social media. We introduce CTF, a large-scale COVID-19 Twitter dataset with labelled genuine and fake tweets. Additionally, we propose Cross-SEAN, a cross-stitch based semi-supervised end-to-end neural attention model which leverages the large amount of unlabelled data. Cross-SEAN partially generalises to emerging fake news as it learns from relevant external knowledge. We compare Cross-SEAN with seven state-of-the-art fake news detection methods. We observe that it achieves 0.95 F1 Score on CTF, outperforming the best baseline by 9%. We also develop Chrome-SEAN, a Cross-SEAN based chrome extension for real-time detection of fake tweets. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 74 条
[1]   Detecting opinion spams and fake news using text classification [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
SECURITY AND PRIVACY, 2018, 1 (01)
[2]   How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [J].
Akhtar, Md Shad ;
Ekbal, Asif ;
Cambria, Erik .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2020, 15 (01) :64-75
[3]   Social Media and Fake News in the 2016 Election [J].
Allcott, Hunt ;
Gentzkow, Matthew .
JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235
[4]  
Angeli G., 2015, P 2015 C EMP METH NA, DOI DOI 10.18653/V1/D15-1075
[5]   Multitask Learning for Blackmarket Tweet Detection [J].
Arora, Udit ;
Paka, William Scott ;
Chakraborty, Tanmoy .
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, :127-130
[6]   ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis [J].
Basiri, Mohammad Ehsan ;
Nemati, Shahla ;
Abdar, Moloud ;
Cambria, Erik ;
Acharya, U. Rajendra .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 :279-294
[7]  
Bhardwaj M., 2020, ARXIV PREPRINT ARXIV
[8]   A survey on fake news and rumour detection techniques [J].
Bondielli, Alessandro ;
Marcelloni, Francesco .
INFORMATION SCIENCES, 2019, 497 :38-55
[9]  
Cambria E, 2017, SOCIO AFFECT COMPUT, V5, P1, DOI 10.1007/978-3-319-55394-8_1
[10]  
Carlson, 2020, COR TWEETS TWEETS JS