Health, Psychosocial, and Social Issues Emanating From the COVID-19 Pandemic Based on Social Media Comments: Text Mining and Thematic Analysis Approach

被引:34
作者
Oyebode, Oladapo [1 ]
Ndulue, Chinenye [1 ]
Adib, Ashfaq [1 ]
Mulchandani, Dinesh [1 ]
Suruliraj, Banuchitra [1 ]
Orji, Fidelia Anulika [2 ]
Chambers, Christine T. [3 ,4 ]
Meier, Sandra [5 ]
Orji, Rita [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, 6050 Univ Ave, Halifax, NS B3H 1W5, Canada
[2] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[3] Dalhousie Univ, Dept Psychol & Neurosci, Halifax, NS, Canada
[4] Dalhousie Univ, Fac Med, Dept Pediat, Halifax, NS, Canada
[5] Dalhousie Univ, Fac Med, Dept Psychiat, Halifax, NS, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
social media; COVID-19; coronavirus; infodemiology; infoveillance; natural language processing; text mining; thematic analysis; interventions; health issues; psychosocial issues; social issues; CORONAVIRUS;
D O I
10.2196/22734
中图分类号
R-058 [];
学科分类号
摘要
Background: The COVID-19 pandemic has caused a global health crisis that affects many aspects of human lives. In the absence of vaccines and antivirals, several behavioral change and policy initiatives such as physical distancing have been implemented to control the spread of COVID-19. Social media data can reveal public perceptions toward how governments and health agencies worldwide are handling the pandemic, and the impact of the disease on people regardless of their geographic locations in line with various factors that hinder or facilitate the efforts to control the spread of the pandemic globally. Objective: This paper aims to investigate the impact of the COVID-19 pandemic on people worldwide using social media data. Methods: We applied natural language processing (NLP) and thematic analysis to understand public opinions, experiences, and issues with respect to the COVID-19 pandemic using social media data. First, we collected over 47 million COVID-19-related comments from Twitter, Facebook, YouTube, and three online discussion forums. Second, we performed data preprocessing, which involved applying NLP techniques to clean and prepare the data for automated key phrase extraction. Third, we applied the NLP approach to extract meaningful key phrases from over 1 million randomly selected comments and computed sentiment score for each key phrase and assigned sentiment polarity (ie, positive, negative, or neutral) based on the score using a lexicon-based technique. Fourth, we grouped related negative and positive key phrases into categories or broad themes. Results: A total of 34 negative themes emerged, out of which 15 were health-related issues, psychosocial issues, and social issues related to the COVID-19 pandemic from the public perspective. Some of the health-related issues were increased mortality, health concerns, struggling health systems, and fitness issues; while some of the psychosocial issues were frustrations due to life disruptions, panic shopping, and expression of fear. Social issues were harassment, domestic violence, and wrong societal attitude. In addition, 20 positive themes emerged from our results. Some of the positive themes were public awareness, encouragement, gratitude, cleaner environment, online learning, charity, spiritual support, and innovative research. Conclusions: We uncovered various negative and positive themes representing public perceptions toward the COVID-19 pandemic and recommended interventions that can help address the health, psychosocial, and social issues based on the positive themes and other research evidence. These interventions will help governments, health professionals and agencies, institutions, and individuals in their efforts to curb the spread of COVID-19 and minimize its impact, and in reacting to any future pandemics.
引用
收藏
页数:26
相关论文
共 118 条
[1]   Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study [J].
Abdalla, Mohamed ;
Abdalla, Moustafa ;
Hirst, Graeme ;
Rudzicz, Frank .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (07)
[2]   Novel insights into views towards H1N1 during the 2009 Pandemic: a thematic analysis of Twitter data [J].
Ahmed, Wasim ;
Bath, Peter A. ;
Sbaffi, Laura ;
Demartini, Gianluca .
HEALTH INFORMATION AND LIBRARIES JOURNAL, 2019, 36 (01) :60-72
[3]   COVID-19 (Coronavirus) Pandemic: Information Sources Channels for the Public Health Awareness [J].
Ali, Muhammad Yousuf ;
Bhatti, Rubina .
ASIA-PACIFIC JOURNAL OF PUBLIC HEALTH, 2020, 32 (04) :168-169
[4]  
[Anonymous], SLANG LOOK TABL
[5]  
[Anonymous], GITHUB GISTS
[6]  
[Anonymous], 2020, EB VIR DIS
[7]  
[Anonymous], 2010, P 19 ACM INT C INFOR
[8]  
[Anonymous], SLANG WORDS DICT
[9]  
[Anonymous], 2012, P WORKSH SEM AN SOC
[10]  
[Anonymous], COVID 19 DASHB CTR S