ASAVACT: Arabic sentiment analysis for vaccine-related COVID-19 tweets using deep learning

被引:5
作者
Alhumoud, Sarah [1 ]
Al Wazrah, Asma [1 ]
Alhussain, Laila [1 ]
Alrushud, Lama [1 ]
Aldosari, Atheer [1 ]
Altammami, Reema Nasser [1 ]
Almukirsh, Njood [1 ]
Alharbi, Hind [1 ]
Alshahrani, Wejdan [1 ]
机构
[1] Al Imam Mohamed Ibn Saud Islamic Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh, Saudi Arabia
关键词
Deep learning; Machine learning; Text mining; Natural language processing; Sentiment analysis; COVID-19; vaccine; Twitter;
D O I
10.7717/peerj-cs.1507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
COVID-19 has become a global pandemic that has affected not only the health sector but also economic, social, and psychological well-being. Individuals are using social media platforms to communicate their feelings and sentiments about the pandemic. One of the most debated topics in that regard is the vaccine. People are divided mainly into two groups, pro-vaccine and anti-vaccine. This article aims to explore Arabic Sentiment Analysis for Vaccine-Related COVID-19 Tweets (ASAVACT) to quantify sentiment polarity shared publicly, and it is considered the first and the largest human-annotated dataset in Arabic. The analysis is done using state-of-theart deep learning models that proved superiority in the field of language processing and analysis. The models are the stacked gated recurrent unit (SGRU), the stacked bidirectional gated recurrent unit (SBi-GRU), and the ensemble architecture of SGRU, SBi-GRU, and AraBERT. Additionally, this article presents the largest Arabic Twitter corpus on COVID-19 vaccination, with 32,476 annotated Tweets. The results show that the ensemble model outperformed other singular models with at least 7% accuracy enhancement.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 34 条
[1]   Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study [J].
Abd-Alrazaq, Alaa ;
Alhuwail, Dari ;
Househ, Mowafa ;
Hamdi, Mounir ;
Shah, Zubair .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (04)
[2]  
Addawood A, 2020, P 1 WORKSH NLP COV 2, DOI [10.18653/v1/2020.nlpcovid19-2.24, DOI 10.18653/V1/2020.NLPCOVID19-2.24]
[3]  
Al Twairesh N., 2016, Arabic spam detection in Twitter, P38
[4]   Sentiment Analysis Using Stacked Gated Recurrent Unit for Arabic Tweets [J].
Al Wazrah, Asma ;
Alhumoud, Sarah .
IEEE ACCESS, 2021, 9 :137176-137187
[5]  
Alanezi M A., 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI), V2020, P1, DOI DOI 10.1109/ICDABI51230.2020.9325679
[6]   Arabic Sentiment Analysis using Deep Learning for COVID-19 Twitter Data [J].
Alhumoud, Sarah .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (09) :132-138
[7]   Twitter Analysis for Intelligent Transportation [J].
Alhumoud, Sarah .
COMPUTER JOURNAL, 2019, 62 (11) :1547-1556
[8]  
Alhumoud S, 2015, 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), P417
[9]  
Alhumoud SO., 2015, International Science Index, V9, P364
[10]  
Alsudias L., 2020, P 1 WORKSH NLP COVID