ArCovidVac: Analyzing Arabic Tweets About COVID-19 Vaccination

被引:0
作者
Mubarak, Hamdy [1 ]
Hassan, Sabit [2 ]
Chowdhury, Shammur Absar [1 ]
Alam, Firoj [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] Univ Pittsburgh, Pittsburgh, PA USA
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
COVID-19; Vaccination; Stance Detection; Arabic Tweets; Tweet Classification;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The emergence of the COVID-19 pandemic and the first global infodemic have changed our lives in many different ways. We relied on social media to get the latest information about COVID-19 pandemic and at the same time to disseminate information. The content in social media consisted not only health related advise, plans, and informative news from policymakers, but also contains conspiracies and rumors. It became important to identify such information as soon as they are posted to make an actionable decision (e.g., debunking rumors, or taking certain measures for traveling). To address this challenge, we developed and publicly released the first largest manually annotated Arabic tweet dataset, ArCovidVac, for the COVID-19 vaccination campaign, covering many countries in the Arab region. The dataset is enriched with different layers of annotation, including, (i) Informativeness (more vs. less important tweets); (ii) fine-grained tweet content types (e.g., advice, rumors, restriction, authenticate news/information); and (iii) stance towards vaccination (pro-vaccination, neutral, anti-vaccination). Further, we performed in-depth analysis of the data, exploring the popularity of different vaccines, trending hashtags, topics and presence of offensiveness in the tweets. We studied the data for individual types of tweets and temporal changes in stance towards vaccine. We benchmarked the ArCovidVac dataset using transformer models for informativeness, content types, and stance detection.
引用
收藏
页码:3220 / 3230
页数:11
相关论文
共 46 条
[1]  
Abdelali A., 2020, ABS200506557 ARXIV
[2]  
Abdelali A., 2021, ABS210210684 ARXIV
[3]  
Abdul-Mageed M., 2020, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
[4]  
Alam F, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, P611
[5]  
Alam F, 2021, Proceedings of the International AAAI Conference on Web and Social Media, V15, P913, DOI [10.1609/icwsm.v15i1.18114, DOI 10.1609/ICWSM.V15I1.18114]
[6]  
Alanazi E, 2020, ARXIV200404315
[7]   The growing amplification of social media: measuring temporal and social contagion dynamics for over 150 languages on Twitter for 2009-2020 [J].
Alshaabi, Thayer ;
Dewhurst, David Rushing ;
Minot, Joshua R. ;
Arnold, Michael V. ;
Adams, Jane L. ;
Danforth, Christopher M. ;
Dodds, Peter Sheridan .
EPJ DATA SCIENCE, 2021, 10 (01)
[8]  
Alsudias L., 2020, P 1 WORKSH NLP COVID
[9]  
Antoun W., 2020, Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, P9
[10]  
Baly Ramy., 2018, P 2018 C N AM CHAPTE, V2, P21, DOI DOI 10.18653/V1/N18-2004