Adapting recurrent neural networks for classifying public discourse on COVID-19 symptoms in Twitter content

被引:7
作者
Amin, Samina [1 ]
Alharbi, Abdullah [2 ]
Uddin, M. Irfan [1 ]
Alyami, Hashem [3 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 2600, Pakistan
[2] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, POB 11099, Taif 21944, Saudi Arabia
[3] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, POB 11099, Taif 21944, Saudi Arabia
关键词
Deep learning; Coronavirus; Pandemic; COVID-19; Classification; Recurrent neural networks; Twitter; TWEETS; CORONAVIRUS;
D O I
10.1007/s00500-022-07405-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The COVID-19 infection, which began in December 2019, has claimed many lives and impacted all aspects of human life. With time, COVID-19 was identified as a pandemic outbreak by the World Health Organization (WHO), putting massive pressure on global health. During this ongoing pandemic, the exponential growth of social media platforms has provided valuable resources for distributing information, as well as a source for self-reported disease symptoms in public discourse. Therefore, there is an urgent need for effective approaches to detect self-reported symptoms or cases in social media content. In this study, we scrapped public discourse on COVID-19 symptoms in Twitter content. For this, we developed a huge dataset of COVID-19 self-reported symptoms and gold-annotated the tweets into four categories: confirmed, death, suspected, and recovered. Then, we use a machine and deep machine learning models, each with its own set of features, such as feature representation. Furthermore, the experimentations were achieved with recurrent neural networks (RNNs) variants and compared their performance with traditional machine learning algorithms. Experimental results report that optimizing the area under the curve (AUC) enhances model performance, and the long short-term memory (LSTM) has the highest accuracy in detecting COVID-19 symptoms in real-time public messaging. Thus, the LSTM classifier in the proposed pipeline achieves a classification accuracy of 90.7%, outperforming existing state-of-the-art algorithms for multi-class classification.
引用
收藏
页码:11077 / 11089
页数:13
相关论文
共 52 条
  • [1] Risk Communication During COVID-19
    Abrams, Elissa M.
    Greenhawt, Matthew
    [J]. JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2020, 8 (06) : 1791 - 1794
  • [2] COVID-19 and the 5G Conspiracy Theory: Social Network Analysis of Twitter Data
    Ahmed, Wasim
    Vidal-Alaball, Josep
    Downing, Joseph
    Lopez Segui, Francesc
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (05)
  • [3] Can We Predict a Riot? Disruptive Event Detection Using Twitter
    Alsaedi, Nasser
    Burnap, Pete
    Rana, Omer
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2017, 17 (02)
  • [4] Optimizing Convolutional Neural Networks with Transfer Learning for Making Classification Report in COVID-19 Chest X-Rays Scans
    Amin, Samina
    Alouffi, Bader
    Uddin, M. Irfan
    Alosaimi, Wael
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [5] Machine Learning Approach for COVID-19 Detection on Twitter
    Amin, Samina
    Uddin, M. Irfan
    Al-Baity, Heyam H.
    Zeb, M. Ali
    Khan, M. Abrar
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (02): : 2231 - 2247
  • [6] Early Detection of Seasonal Outbreaks from Twitter Data Using Machine Learning Approaches
    Amin, Samina
    Uddin, Muhammad Irfan
    AlSaeed, Duaa H.
    Khan, Atif
    Adnan, Muhammad
    [J]. COMPLEXITY, 2021, 2021
  • [7] Detecting Information on the Spread of Dengue on Twitter Using Artificial Neural Networks
    Amin, Samina
    Uddin, M. Irfan
    Zeb, M. Ali
    Alarood, Ala Abdulsalam
    Mahmoud, Marwan
    Alkinani, Monagi H.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (01): : 1317 - 1332
  • [8] Detecting Dengue/Flu Infections Based on Tweets Using LSTM and Word Embedding
    Amin, Samina
    Uddin, M. Irfan
    Zeb, M. Ali
    Alarood, Ala Abdulsalam
    Mahmoud, Marwan
    Alkinani, Monagi H.
    [J]. IEEE ACCESS, 2020, 8 : 189054 - 189068
  • [9] Recurrent Neural Networks With TF-IDF Embedding Technique for Detection and Classification in Tweets of Dengue Disease
    Amin, Samina
    Uddin, M. Irfan
    Hassan, Saima
    Khan, Atif
    Nasser, Nidal
    Alharbi, Abdullah
    Alyami, Hashem
    [J]. IEEE ACCESS, 2020, 8 : 131522 - 131533
  • [10] Bird S., 2009, NATURAL LANGUAGE PRO