Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

被引:0
作者
Alam, Firoj [1 ]
Shaar, Shaden [1 ]
Dalvi, Fahim [1 ]
Sajjad, Hassan [1 ]
Nikolov, Alex [2 ]
Mubarak, Hamdy [1 ]
Martino, Giovanni Da San [3 ]
Abdelali, Ahmed [1 ]
Durrani, Nadir [1 ]
Darwish, Kareem [1 ]
Al-Homaid, Abdulaziz [1 ]
Zaghouani, Wajdi [4 ]
Caselli, Tommaso [5 ]
Danoe, Gijs [5 ]
Stolk, Friso [5 ]
Bruntink, Britt [5 ]
Nakov, Preslav [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Ar Rayyan, Qatar
[2] Sofia Univ, Sofia, Bulgaria
[3] Univ Padua, Padua, Italy
[4] Hamad Bin Khalifa Univ, Ar Rayyan, Qatar
[5] Univ Groningen, Groningen, Netherlands
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021 | 2021年
基金
美国国家科学基金会;
关键词
PROPPY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic. Fighting this infodemic has been declared one of the most important focus areas of the World Health Organization, with dangers ranging from promoting fake cures, rumors, and conspiracy theories to spreading xenophobia and panic. Addressing the issue requires solving a number of challenging problems such as identifying messages containing claims, determining their check-worthiness and factuality, and their potential to do harm as well as the nature of that harm, to mention just a few. To address this gap, we release a large dataset of 16K manually annotated tweets for fine-grained disinformation analysis that (i) focuses on COVID19, (ii) combines the perspectives and the interests of journalists, fact-checkers, social media platforms, policy makers, and society, and (iii) covers Arabic, Bulgarian, Dutch, and English. Finally, we show strong evaluation results using pretrained Transformers, thus confirming the practical utility of the dataset in monolingual vs. multilingual, and single task vs. multitask settings.
引用
收藏
页码:611 / 649
页数:39
相关论文
共 72 条
[1]  
Abdul-Mageed M, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P3402
[2]  
Alam F., 2021, P INT AAAI C WEB SOC, V15, P923
[3]  
Alam F., 2021, P INT AAAI C WEB SOC, P913, DOI DOI 10.1609/ICWSM.V15I1.18114
[4]  
Albaum G, 1997, J MARKET RES SOC, V39, P331
[5]  
Allport GW, 1947, The Psychology of Rumor
[6]  
Alshehri A., 2021, Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda, P57, DOI [DOI 10.18653/V1/2021.NLP4IF-1.9, DOI 10.18653/V1/2021.NLP4IF-1]
[7]  
Antoun Wissam, 2020, P 4 WORKSH OP SOURC, P9
[8]  
Augenstein I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4685
[9]  
Baly R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2109
[10]  
Baly R, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P3364