ADAPTATION OF DOMAIN-SPECIFIC TRANSFORMER MODELS WITH TEXT OVERSAMPLING FOR SENTIMENT ANALYSIS OF SOCIAL MEDIA POSTS ON COVID-19 VACCINE

被引:1
作者
Bansal, Anmol [1 ]
Choudhry, Arjun [1 ]
Sharma, Anubhav [1 ]
Susan, Seba [1 ]
机构
[1] Delhi Technol Univ, New Delhi, India
来源
COMPUTER SCIENCE-AGH | 2023年 / 24卷 / 02期
关键词
Covid-19; vaccine; transformer; Twitter; BERTweet; CT-BERT; BERT; XLNet; RoBERTa; text oversampling; LMOTE; class imbalance; small sample data set; TWITTER;
D O I
10.7494/csci.2023.24.2.4761
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Covid-19 has spread across the world, and several vaccines have been developed to counter its surge. To identify the correct sentiments that are associated with the vaccines from social media posts, we fine-tune various state-of-the-art pre -trained transformer models on tweets that are associated with Covid-19 vac-cines. Specifically, we use the recently introduced state-of-the-art RoBERTa, XLNet, and BERT pre-trained transformer models, and the domain-specific CT-BERT and BERTweet transformer models that have been pre-trained on Covid-19 tweets. We further explore the option of text augmentation by over -sampling using the language model-based oversampling technique (LMOTE) to improve the accuracies of these models - specifically, for small sample data sets where there is an imbalanced class distribution among the positive, nega-tive, and neutral sentiment classes. Our results summarize our findings on the suitability of text oversampling for imbalanced small-sample data sets that are used to fine-tune state-of-the-art pre-trained transformer models as well as the utility of domain-specific transformer models for the classification task.
引用
收藏
页码:167 / 186
页数:20
相关论文
共 50 条
  • [31] Long-term Effects of the COVID-19 Pandemic on Public Sentiments in Mainland China: Sentiment Analysis of Social Media Posts
    Tan, Hao
    Peng, Sheng-Lan
    Zhu, Chun-Peng
    You, Zuo
    Miao, Ming-Cheng
    Kuai, Shu-Guang
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (08)
  • [32] A Transformer-Based Model for Evaluation of Information Relevance in Online Social-Media: A Case Study of Covid-19 Media Posts
    Utkarsh Sharma
    Prateek Pandey
    Shishir Kumar
    New Generation Computing, 2022, 40 : 1029 - 1052
  • [33] ASAVACT: Arabic sentiment analysis for vaccine-related COVID-19 tweets using deep learning
    Alhumoud, Sarah
    Al Wazrah, Asma
    Alhussain, Laila
    Alrushud, Lama
    Aldosari, Atheer
    Altammami, Reema Nasser
    Almukirsh, Njood
    Alharbi, Hind
    Alshahrani, Wejdan
    PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 18
  • [34] Social Media Users' Opinions on Remote Work during the COVID-19 Pandemic. Thematic and Sentiment Analysis
    Wrycza, Stanislaw
    Maslankowski, Jacek
    INFORMATION SYSTEMS MANAGEMENT, 2020, 37 (04) : 288 - 297
  • [35] To live or to stay alive? A thematic and sentiment analysis of public posts on social media during the 2022 Shanghai COVID-19 outbreak
    Chen, Lixiong
    Xu, Nairui
    DIGITAL HEALTH, 2024, 10
  • [36] Keeping you posted: analysis of fertility-related social media posts after introduction of the COVID-19 vaccine
    Pecoriello, Jillian
    Yoder, Nicole
    Smith, Meghan B.
    Blakemore, Jennifer K.
    EUROPEAN JOURNAL OF CONTRACEPTION AND REPRODUCTIVE HEALTH CARE, 2023, 28 (03) : 168 - 172
  • [37] The role of social media in monitoring COVID-19 vaccine uptake
    Garett, Renee
    Young, Sean D.
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2022, 28 (04) : 650 - 652
  • [38] Sentiment analysis of COVID-19 social media data through machine learning
    Dharmendra Dangi
    Dheeraj K. Dixit
    Amit Bhagat
    Multimedia Tools and Applications, 2022, 81 : 42261 - 42283
  • [39] Public Officials' Engagement on Social Media During the Rollout of the COVID-19 Vaccine: Content Analysis of Tweets
    Marani, Husayn
    Song, Melodie Yunju
    Jamieson, Margaret
    Roerig, Monika
    Allin, Sara
    JMIR INFODEMIOLOGY, 2023, 3 (01):
  • [40] Large language models for newspaper sentiment analysis during COVID-19: The Guardian
    Chandra, Rohitash
    Zhu, Baicheng
    Fang, Qingying
    Shinjikashvili, Eka
    APPLIED SOFT COMPUTING, 2025, 171