They May Not Work! An evaluation of eleven sentiment analysis tools on seven social media datasets

被引:9
作者
He, Lu [1 ]
Yin, Tingjue [1 ]
Zheng, Kai [1 ,2 ]
机构
[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Informat, Irvine, CA USA
[2] Univ Calif Irvine, Sch Med, Dept Emergency Med, Irvine, CA USA
关键词
Social media; Sentiment analysis; Natural language processing; Consumer health information;
D O I
10.1016/j.jbi.2022.104142
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Sentiment analysis is an important method for understanding emotions and opinions expressed through social media exchanges. Little work has been done to evaluate the performance of existing sentiment analysis tools on social media datasets, particularly those related to health, healthcare, or public health. This study aims to address the gap. Material and methods: We evaluated 11 commonly used sentiment analysis tools on five health-related social media datasets curated in previously published studies. These datasets include Human Papillomavirus Vaccine, Health Care Reform, COVID-19 Masking, Vitals.com Physician Reviews, and the Breast Cancer Forum from MedHelp.org. For comparison, we also analyzed two non-health datasets based on movie reviews and generic tweets. We conducted a qualitative error analysis on the social media posts that were incorrectly classified by all tools. Results: The existing sentiment analysis tools performed poorly with an average weighted F1 score below 0.6. The inter-tool agreement was also low; the average Fleiss Kappa score is 0.066. The qualitative error analysis identified two major causes for misclassification: (1) correct sentiment but on wrong subject(s) and (2) failure to properly interpret inexplicit/indirect sentiment expressions. Discussion and conclusion: The performance of the existing sentiment analysis tools is insufficient to generate accurate sentiment classification results. The low inter-tool agreement suggests that the conclusion of a study could be entirely driven by the idiosyncrasies of the tool selected, rather than by the data. This is very concerning especially if the results may be used to inform important policy decisions such as mask or vaccination mandates.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Sentiment Analysis on Social Media Using Morphological Sentence Pattern Model
    Han, Youngsub
    Kim, Kwangmi Ko
    2017 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS (SERA), 2017, : 79 - 84
  • [22] Improving Sentiment Analysis of Social Media Captions through Advancements in NLP
    Harsath, Abdul M.
    Arshad, A.
    Prasath, Hari R.
    Srikanth, R.
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 732 - 736
  • [23] Seeker Optimization with Deep Learning Enabled Sentiment Analysis on Social Media
    Alghamdi, Hanan M.
    Hamza, Saadia H. A.
    Mashraqi, Aisha M.
    Abdel-Khalek, Sayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5985 - 5999
  • [24] Sentiment Analysis Using Word Polarity of Social Media
    Kigon Lyu
    Hyeoncheol Kim
    Wireless Personal Communications, 2016, 89 : 941 - 958
  • [25] A Survey of Sentiment Analysis from Social Media Data
    Chakraborty, Koyel
    Bhattacharyya, Siddhartha
    Bag, Rajib
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (02): : 450 - 464
  • [26] Exploring Semantic Relations for Social Media Sentiment Analysis
    Zeng, Jiandian
    Zhou, Jiantao
    Huang, Caishi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2382 - 2394
  • [27] Sentiment Analysis on Educational Posts from Social Media
    Relucio, Floradel S.
    Palaoag, Thelma D.
    2018 9TH INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING (IC4E 2018), 2018, : 99 - 102
  • [28] Sentiment Analysis for Arabic Social Media News Polarity
    Hnaif, Adnan A.
    Kanan, Emran
    Kanan, Tarek
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 28 (01) : 107 - 119
  • [29] Deep Learning for Automated Sentiment Analysis of Social Media
    Cheng, Li-Chen
    Tsai, Song-Lin
    PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, : 1001 - 1004
  • [30] Sentiment Analysis of National Tourism Organizations on Social Media
    Hruska, Jan
    HRADEC ECONOMIC DAYS 2020, VOL 10, PT 1, 2020, 10 : 250 - 256