Determining well-being during a crisis based on Twitter data

被引:0
作者
Wang, Xiao [1 ]
Janssens, Bram [1 ,2 ,3 ]
Bogaert, Matthias [1 ,2 ]
Vanderbauwhede, Liesel [1 ]
Schetgen, Lisa [1 ]
机构
[1] Univ Ghent, Dept Mkt Innovat & Org, Tweekerkenstr 2, B-9000 Ghent, Belgium
[2] FlandersMake UGent Corelab CVAMO, B-9000 Ghent, Belgium
[3] Res Fdn Flanders, Rue Louvain 38, B-1000 Brussels, Belgium
关键词
Social media analytics; Subjective well-being; NLP; Data mining; NEGATIVE AFFECT; MENTAL-HEALTH; SOCIAL MEDIA; SHORT-FORM; LANGUAGE; VALIDATION; SELECTION; TESTS;
D O I
10.1007/s10479-025-06578-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The objective of this paper is to determine how subjective well-being (SWB) can be predicted based on Twitter (or X) data during a crisis. Typically, a questionnaire, namely the international positive and negative affect schedule short-form (I-PANAS-SF), is employed to assess affective SWB. As natural language processing (NLP) continues to advance, two kinds of techniques (sentiment analysis tools and models based on textual representation) are employed to harness social media data for estimating affective SWB. First, we estimate affective SWB during a crisis by two popular sentiment analysis tools [i.e., valence aware dictionary for sEntiment reasoner (VADER) and TextBlob]. We evaluate the reliability of sentiment analysis tools as proxies for affective SWB and compare their performance to a manually labeled scale. Second, we conduct an exploratory data analysis to identify the hidden topics and major change points related to affective SWB. Third,we investigate five different textual representation techniques over five machine learning algorithms [i.e., elastic net regression, support vector machines, random forests, neural networks, and eXtreme gradient boosting (XGBoost)]. Finally, we identify key drivers of affective SWB predictions by analyzing feature importance. Our findings reveal that for the three target variables in affective SWB (i.e., total SWB, positive affect, and negative affect), sentiment analysis tools are suboptimal for predicting affective SWB, as evidenced by their poor performance in the evaluation metrics. The models constructed using the combination of textual representation techniques and machine learning algorithms exhibit relatively strong performance. The models with the highest predictive capability include OpenAI's Ada embedding combined with neural networks for total SWB and positive affect, and OpenAI's Ada embedding in conjunction with support vector machines for negative affect, achieving RMSE reductions of 41.60%, 41.77%, and 38.00%, respectively, compared to sentiment analysis tools. Our empirical results show that models using advanced embedding methods (RoBERTa and OpenAI's ada embedding) are promising methods for predicting affective SWB. This research addresses a gap in the literature concerning the field of predicting affective SWB in the context of a crisis by utilizing advanced NLP techniques, machine learning, and social media data.
引用
收藏
页数:38
相关论文
共 77 条
[1]   The Role of Social Media in the Advent of COVID-19 Pandemic: Crisis Management, Mental Health Challenges and Implications [J].
Abbas, Jaffar ;
Wang, Dake ;
Su, Zhaohui ;
Ziapour, Arash .
RISK MANAGEMENT AND HEALTHCARE POLICY, 2021, 14 :1917-1932
[2]   Emotions of COVID-19: Content Analysis of Self-Reported Information Using Artificial Intelligence [J].
Adikari, Achini ;
Nawaratne, Rashmika ;
De Silva, Daswin ;
Ranasinghe, Sajani ;
Alahakoon, Oshadi ;
Alahakoon, Damminda .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (04)
[3]   Combined 5 x 2 cv F test for comparing supervised classification learning algorithms [J].
Alpaydin, E .
NEURAL COMPUTATION, 1999, 11 (08) :1885-1892
[4]  
Anke L. E., 2022, arXiv, DOI 10.48550/ARXIV.2202.03829
[5]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[6]   Social media optimization: Identifying an optimal strategy for increasing network size on Facebook [J].
Ballings, Michel ;
Van den Poel, Dirk ;
Bogaert, Matthias .
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2016, 59 :15-25
[7]   Natural disasters and mental health: A quantile approach [J].
Baryshnikova, Nadezhda V. ;
Pham, Ngoc T. A. .
ECONOMICS LETTERS, 2019, 180 :62-66
[8]   Declining well-being during the COVID-19 pandemic reveals US social inequities [J].
Bathina, Krishna C. ;
ten Thij, Marijn ;
Valdez, Danny ;
Rutter, Lauren A. ;
Bollen, Johan .
PLOS ONE, 2021, 16 (07)
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]   Predicting Self-declared Movie Watching Behavior Using Facebook Data and Information-Fusion Sensitivity Analysis [J].
Bogaert, Matthias ;
Ballings, Michel ;
Bergmans, Rob ;
Van den Poel, Dirk .
DECISION SCIENCES, 2021, 52 (03) :776-810