Modeling Spatiotemporal Pattern of Depressive Symptoms Caused by COVID-19 Using Social Media Data Mining

被引:57
作者
Li, Diya [1 ]
Chaudhary, Harshita [2 ]
Zhang, Zhe [1 ]
机构
[1] Texas A&M Univ, Dept Geog, 3147 TAMU, College Stn, TX 77843 USA
[2] Texas A&M Univ, Dept Comp Sci & Engn, 3112 TAMU, College Stn, TX 77843 USA
关键词
COVID-19; pandemic; social media data mining; mental health; Basilisk algorithm; Patient Health Questionnaire (PHQ); Correlation Explanation (CorEx); ACCURACY ASSESSMENT; SENTIMENT ANALYSIS; THEMATIC MAPS; INFORMATION; CLASSIFICATION; UNCERTAINTY;
D O I
10.3390/ijerph17144988
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
By 29 May 2020, the coronavirus disease (COVID-19) caused by SARS-CoV-2 had spread to 188 countries, infecting more than 5.9 million people, and causing 361,249 deaths. Governments issued travel restrictions, gatherings of institutions were cancelled, and citizens were ordered to socially distance themselves in an effort to limit the spread of the virus. Fear of being infected by the virus and panic over job losses and missed education opportunities have increased people's stress levels. Psychological studies using traditional surveys are time-consuming and contain cognitive and sampling biases, and therefore cannot be used to build large datasets for a real-time depression analysis. In this article, we propose a CorExQ9 algorithm that integrates a Correlation Explanation (CorEx) learning algorithm and clinical Patient Health Questionnaire (PHQ) lexicon to detect COVID-19 related stress symptoms at a spatiotemporal scale in the United States. The proposed algorithm overcomes the common limitations of traditional topic detection models and minimizes the ambiguity that is caused by human interventions in social media data mining. The results show a strong correlation between stress symptoms and the number of increased COVID-19 cases for major U.S. cities such as Chicago, San Francisco, Seattle, New York, and Miami. The results also show that people's risk perception is sensitive to the release of COVID-19 related public news and media messages. Between January and March, fear of infection and unpredictability of the virus caused widespread panic and people began stockpiling supplies, but later in April, concerns shifted as financial worries in western and eastern coastal areas of the U.S. left people uncertain of the long-term effects of COVID-19 on their lives.
引用
收藏
页码:1 / 23
页数:22
相关论文
共 70 条
[1]  
Abadi M., 2016, TENSORFLOW LARGE SCA, P3243, DOI DOI 10.5555/3026877.3026899
[2]  
Aizawa A, INFORM THEORETIC PER
[3]  
[Anonymous], 2005, DEPRESSION, P9
[4]  
[Anonymous], 110 WHO
[5]  
[Anonymous], 2013, P SIGCHI C HUM FACT
[6]  
[Anonymous], 2011, P 17 ACM SIGKDD INT
[7]  
[Anonymous], 2016, IJARCCE
[8]  
[Anonymous], Algorithms for Non-negative Matrix Factorization
[9]   SentiHealth: creating health-related sentiment lexicon using hybrid approach [J].
Asghar, Muhammad Zubair ;
Ahmad, Shakeel ;
Qasim, Maria ;
Zahra, Syeda Rabail ;
Kundi, Fazal Masud .
SPRINGERPLUS, 2016, 5
[10]  
Atkeson A., 2020, working paper, V26867, P1, DOI [DOI 10.3386/W26867, 10.3386/w26867]