They May Not Work! An evaluation of eleven sentiment analysis tools on seven social media datasets

被引:9
|
作者
He, Lu [1 ]
Yin, Tingjue [1 ]
Zheng, Kai [1 ,2 ]
机构
[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Informat, Irvine, CA USA
[2] Univ Calif Irvine, Sch Med, Dept Emergency Med, Irvine, CA USA
关键词
Social media; Sentiment analysis; Natural language processing; Consumer health information;
D O I
10.1016/j.jbi.2022.104142
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Sentiment analysis is an important method for understanding emotions and opinions expressed through social media exchanges. Little work has been done to evaluate the performance of existing sentiment analysis tools on social media datasets, particularly those related to health, healthcare, or public health. This study aims to address the gap. Material and methods: We evaluated 11 commonly used sentiment analysis tools on five health-related social media datasets curated in previously published studies. These datasets include Human Papillomavirus Vaccine, Health Care Reform, COVID-19 Masking, Vitals.com Physician Reviews, and the Breast Cancer Forum from MedHelp.org. For comparison, we also analyzed two non-health datasets based on movie reviews and generic tweets. We conducted a qualitative error analysis on the social media posts that were incorrectly classified by all tools. Results: The existing sentiment analysis tools performed poorly with an average weighted F1 score below 0.6. The inter-tool agreement was also low; the average Fleiss Kappa score is 0.066. The qualitative error analysis identified two major causes for misclassification: (1) correct sentiment but on wrong subject(s) and (2) failure to properly interpret inexplicit/indirect sentiment expressions. Discussion and conclusion: The performance of the existing sentiment analysis tools is insufficient to generate accurate sentiment classification results. The low inter-tool agreement suggests that the conclusion of a study could be entirely driven by the idiosyncrasies of the tool selected, rather than by the data. This is very concerning especially if the results may be used to inform important policy decisions such as mask or vaccination mandates.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Social Media Sentiment Analysis On Twitter Datasets
    Tiwari, Shikha
    Verma, Anshika
    Garg, Peeyush
    Bansal, Deepika
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 925 - 927
  • [2] Challenges of Evaluating Sentiment Analysis Tools on Social Media
    Maynard, Diana
    Bontcheva, Kalina
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1142 - 1148
  • [3] Sentiment Analysis for Social Media
    Iglesias, Carlos A.
    Moreno, Antonio
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [4] Sentiment Analysis with Machine Learning Methods on Social Media
    Basarslan, Muhammet Sinan
    Kayaalp, Fatih
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2020, 9 (03): : 5 - 15
  • [5] SENTIMENT ANALYSIS OF THE INFOGRAPHICS ON SOCIAL MEDIA
    Bratic, Diana
    Palic, Mirko
    Miljkovic, Petar
    PROCEEDINGS OF FEB ZAGREB 12TH INTERNATIONAL ODYSSEY CONFERENCE ON ECONOMICS AND BUSINESS, 2021, 2021, 3 : 1072 - 1081
  • [6] A survey of sentiment analysis in social media
    Lin Yue
    Weitong Chen
    Xue Li
    Wanli Zuo
    Minghao Yin
    Knowledge and Information Systems, 2019, 60 : 617 - 663
  • [7] A survey of sentiment analysis in social media
    Yue, Lin
    Chen, Weitong
    Li, Xue
    Zuo, Wanli
    Yin, Minghao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 617 - 663
  • [8] Improving Sentiment Analysis in Social Media by Handling Lengthened Words
    Kukkar, Ashima
    Mohana, Rajni
    Sharma, Aman
    Nayyar, Anand
    Shah, Mohd. Asif
    IEEE ACCESS, 2023, 11 : 9775 - 9788
  • [9] Social Media and Sentiment Analysis: The Nigeria Presidential Election 2019
    Oyebode, Oladapo
    Orji, Rita
    2019 IEEE 10TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2019, : 140 - 146
  • [10] Aspect based Sentiment Analysis in Social Media with Classifier Ensembles
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 273 - 278