Tracking COVID-19 Discourse on Twitter in North America: Infodemiology Study Using Topic Modeling and Aspect-Based Sentiment Analysis

被引:98
作者
Jang, Hyeju [1 ,2 ]
Rempel, Emily [2 ]
Roth, David [2 ]
Carenini, Giuseppe [1 ]
Janjua, Naveed Zafar [2 ,3 ,4 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC, Canada
[2] British Columbia Ctr Dis Control, 655 West 12th Ave, Vancouver, BC V5Z 4R4, Canada
[3] Univ British Columbia, Sch Populat & Publ Hlth, Vancouver, BC, Canada
[4] Univ British Columbia, Ctr Hlth Evaluat & Outcome Sci, Vancouver, BC, Canada
关键词
COVID-19; Twitter; topic modeling; aspect-based sentiment analysis; racism; anti-Asians; Canada; North America; sentiment analysis; social media; discourse; reaction; public health; SOCIAL MEDIA; HEALTH CRISIS; EBOLA;
D O I
10.2196/25431
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Social media is a rich source where we can learn about people's reactions to social issues. As COVID-19 has impacted people's lives, it is essential to capture how people react to public health interventions and understand their concerns. Objective: We aim to investigate people's reactions and concerns about COVID-19 in North America, especially in Canada. Methods: We analyzed COVID-19-related tweets using topic modeling and aspect-based sentiment analysis (ABSA), and interpreted the results with public health experts. To generate insights on the effectiveness of specific public health interventions for COVID-19, we compared timelines of topics discussed with the timing of implementation of interventions, synergistically including information on people's sentiment about COVID-19-related aspects in our analysis. In addition, to further investigate anti-Asian racism, we compared timelines of sentiments for Asians and Canadians. Results: Topic modeling identified 20 topics, and public health experts provided interpretations of the topics based on top-ranked words and representative tweets for each topic. The interpretation and timeline analysis showed that the discovered topics and their trend are highly related to public health promotions and interventions such as physical distancing, border restrictions, handwashing, staying home, and face coverings. After training the data using AB SA with human-in-the-loop, we obtained 545 aspect terms (eg, "vaccines," "economy," and "masks") and 60 opinion terms such as "infectious" (negative) and "professional" (positive), which were used for inference of sentiments of 20 key aspects selected by public health experts. The results showed negative sentiments related to the overall outbreak, misinformation and Asians, and positive sentiments related to physical distancing. Conclusions: Analyses using natural language processing techniques with domain expert involvement can produce useful information for public health. This study is the first to analyze COVID-19-related tweets in Canada in comparison with tweets in the United States by using topic modeling and human-in-the-loop domain-specific AB SA. This kind of information could help public health agencies to understand public concerns as well as what public health messages are resonating in our populations who use Twitter, which can be helpful for public health agencies when designing a policy for new interventions.
引用
收藏
页数:11
相关论文
共 36 条
[1]   Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study [J].
Abd-Alrazaq, Alaa ;
Alhuwail, Dari ;
Househ, Mowafa ;
Hamdi, Mounir ;
Shah, Zubair .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (04)
[2]  
[Anonymous], TWEETPR 0 6 0
[3]  
[Anonymous], 2020, spacy-langdetect 0.1.2. The Python Package Index
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   MEDICINE AND THE MEDIA How Twitter may have helped Nigeria contain Ebola [J].
Carter, Meg .
BMJ-BRITISH MEDICAL JOURNAL, 2014, 349
[6]  
Chen Emily, 2020, JMIR Public Health Surveill, V6, pe19273, DOI 10.2196/19273
[7]  
Chen L.-C., ARXIV
[8]   Prejudice Toward Asian Americans in the Covid-19 Pandemic: The Effects of Social Media Use in the United States [J].
Croucher, Stephen M. ;
Nguyen, Thao ;
Rahmani, Diyako .
FRONTIERS IN COMMUNICATION, 2020, 5
[9]   "Facts, Not Fear": Negotiating Uncertainty on Social Media During the 2014 Ebola Crisis [J].
Dalrymple, Kajsa E. ;
Young, Rachel ;
Tully, Melissa .
SCIENCE COMMUNICATION, 2016, 38 (04) :442-467
[10]  
Dong M, 2020, MEDRXIV, DOI [10.1101/2020.03.26.20044164., DOI 10.1101/2020.03.26.20044164, 10.1101/2020.03.26.20044164]