Using Content Analysis and Machine Learning to Identify COVID-19 Information Relevant to Low-income Households on Social Media

被引:0
作者
Khanal, Sarthak [1 ]
Refati, Rus [2 ]
Glandt, Kyle [3 ]
Caragea, Doina [3 ]
Xu, Sifan [4 ]
Chen, Chien-fei [2 ]
机构
[1] Kansas State Univ, Dept Comp Sci, Manhattan, KS 66506 USA
[2] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA
[3] Kansas State Univ, Dept Comp Sci, Manhattan, KS 66502 USA
[4] Univ Tennessee, Sch Advertising & Publ Relat, Knoxville, TN 37996 USA
来源
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021) | 2021年
关键词
Content analysis; Machine learning; Low-income households; Social media; Transformers; COVID-19;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00205
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
COVID-19 pandemic has caused great distress in the lives of many populations. Low-income households are among the most severely impacted groups in the United States and across the globe. Using social media, this paper aims to identify and organize the information about the impact of the pandemic on low-income households. We use content analysis to derive an annotation protocol and manually annotate a tweet dataset using this protocol. Furthermore, we use machine learning to learn models from the annotated dataset. We also employ a human-in-the-loop data augmentation procedure to improve the model's performance for the underrepresented classes. Our results show that using carefully annotated data, automated machine learning models can be trained and employed to identify information relevant to low income households, potentially in real time.
引用
收藏
页码:1522 / 1531
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 1989, CEMENT SOC SURVEY SO
[2]  
Beard V., 2020, HOUSING HOMELESSNESS
[3]  
Benfer E. A., 2020, YALE J HLTH POLICY L, V19, P3
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   CODING CHOICES FOR TEXTUAL ANALYSIS - A COMPARISON OF CONTENT-ANALYSIS AND MAP ANALYSIS [J].
CARLEY, K .
SOCIOLOGICAL METHODOLOGY 1993, VOL 23, 1993, 23 :75-126
[6]   Coronavirus comes home? Energy use, home energy management, and the social-psychological factors of COVID-19 [J].
Chen, Chien-fei ;
de Rubens, Gerardo Zarazua ;
Xu, Xiaojing ;
Li, Jiayi .
ENERGY RESEARCH & SOCIAL SCIENCE, 2020, 68
[7]   COVID-19 control in low-income settings and displaced populations: what can realistically be done? [J].
Dahab, Maysoon ;
van Zandvoort, Kevin ;
Flasche, Stefan ;
Warsame, Abdihamid ;
Ratnayake, Ruwan ;
Favas, Caroline ;
Spiegel, Paul B. ;
Waldman, Ronald J. ;
Checchi, Francesco .
CONFLICT AND HEALTH, 2020, 14 (01)
[8]   Predicting the pandemic: sentiment evaluation and predictive analysis from large-scale tweets on Covid-19 by deep convolutional neural network [J].
Das, Sourav ;
Kolya, Anup Kumar .
EVOLUTIONARY INTELLIGENCE, 2022, 15 (03) :1913-1934
[9]  
Nguyen DQ, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, P9
[10]  
Deaton A., 2021, COVID 19 GLOBAL INCO