They May Not Work! An evaluation of eleven sentiment analysis tools on seven social media datasets

被引:9
|
作者
He, Lu [1 ]
Yin, Tingjue [1 ]
Zheng, Kai [1 ,2 ]
机构
[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Informat, Irvine, CA USA
[2] Univ Calif Irvine, Sch Med, Dept Emergency Med, Irvine, CA USA
关键词
Social media; Sentiment analysis; Natural language processing; Consumer health information;
D O I
10.1016/j.jbi.2022.104142
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Sentiment analysis is an important method for understanding emotions and opinions expressed through social media exchanges. Little work has been done to evaluate the performance of existing sentiment analysis tools on social media datasets, particularly those related to health, healthcare, or public health. This study aims to address the gap. Material and methods: We evaluated 11 commonly used sentiment analysis tools on five health-related social media datasets curated in previously published studies. These datasets include Human Papillomavirus Vaccine, Health Care Reform, COVID-19 Masking, Vitals.com Physician Reviews, and the Breast Cancer Forum from MedHelp.org. For comparison, we also analyzed two non-health datasets based on movie reviews and generic tweets. We conducted a qualitative error analysis on the social media posts that were incorrectly classified by all tools. Results: The existing sentiment analysis tools performed poorly with an average weighted F1 score below 0.6. The inter-tool agreement was also low; the average Fleiss Kappa score is 0.066. The qualitative error analysis identified two major causes for misclassification: (1) correct sentiment but on wrong subject(s) and (2) failure to properly interpret inexplicit/indirect sentiment expressions. Discussion and conclusion: The performance of the existing sentiment analysis tools is insufficient to generate accurate sentiment classification results. The low inter-tool agreement suggests that the conclusion of a study could be entirely driven by the idiosyncrasies of the tool selected, rather than by the data. This is very concerning especially if the results may be used to inform important policy decisions such as mask or vaccination mandates.
引用
收藏
页数:9
相关论文
共 50 条
  • [11] Social media sentiment analysis based on COVID-19
    Nemes, Laszlo
    Kiss, Attila
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (01) : 1 - 15
  • [12] Sentiment Analysis in Social Media: A Comprehensive Bibliometric Analysis
    Tasente, Tanase
    Caratas, Maria Alina
    ADCOMUNICA-REVISTA CIENTIFICA DE ESTRATEGIAS TENDENCIAS E INNOVACION EN COMMUNICACION, 2024, (28): : 243 - 270
  • [13] Deep Learning for Social Media Sentiment Analysis
    Fithriasari, Kartika
    Jannah, Saidah Zahrotul
    Reyhana, Zakya
    MATEMATIKA, 2020, 36 (02) : 99 - 111
  • [14] Sentiment Analysis of Malay Social Media Text
    Chekima, Khalifa
    Alfred, Rayner
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 205 - 219
  • [15] Sentiment Analysis of Social Media Comments in Mauritius
    Sahib, Nuzhah Gooda
    Marianne, Marie Angele Justine
    Gobin-Rahimbux, Baby
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 860 - 865
  • [16] Sentiment Analysis on Social Media for Emotion Classification
    Tanna, Dilesh
    Dudhane, Manasi
    Sardar, Amrut
    Deshpande, Kiran
    Deshmukh, Neha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 911 - 915
  • [17] A Study on Sentiment Analysis of Social Media Reviews
    Felciah, M. Lovelin Ponn
    Anbuselvi, R.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [18] Supervised sentiment analysis in Czech social media
    Habernal, Ivan
    Ptacek, Tomas
    Steinberger, Josef
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (05) : 693 - 707
  • [19] Companies Image Evaluation Using Social Media and Sentiment Analysis
    Cotfas, Livu-Adrian
    Delcea, Camelia
    Paun, Ramona-Mihaela
    EURASIAN BUSINESS PERSPECTIVES, 2020, 14 (02): : 277 - 286
  • [20] Every Post Matters: A Survey on Applications of Sentiment Analysis in Social Media
    Rathan, M.
    Hulipalled, Vishwanath R.
    Murugeshwari, P.
    Sushmitha, H. M.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 709 - 714