Stigmatization in social media: Documenting and analyzing hate speech for COVID-19 on Twitter

被引:14
作者
Fan L. [1 ,2 ]
Yu H. [2 ,4 ]
Yin Z. [3 ,4 ]
机构
[1] Program in Digital Humanities, University of California, Los Angeles, CA
[2] Department of Statistics, University of California, Los Angeles, CA
[3] Department of Mathematics, University of California, Los Angeles, CA
[4] Department of Economics, University of California, Los Angeles, CA
关键词
coronavirus; COVID-19; hate speech; pandemic; Twitter;
D O I
10.1002/pra2.313
中图分类号
学科分类号
摘要
As the COVID-19 pandemic has unfolded, Hate Speech on social media about China and Chinese people has encouraged social stigmatization. For the historical and humanistic purposes, this history-in-the-making needs to be archived and analyzed. Using the query “china+and+coronavirus” to scrape from the Twitter API, we have obtained 3,457,402 key tweets about China relating to COVID-19. In this archive, in which about 40% of the tweets are from the U.S., we identify 25,467 Hate Speech occurrences and analyze them according to lexicon-based emotions and demographics using machine learning and network methods. The results indicate that there are substantial associations between the amount of Hate Speech and demonstrations of sentiments, and state demographics factors. Sentiments of surprise and fear associated with poverty and unemployment rates are prominent. This digital archive and the related analyses are not simply historical, therefore. They play vital roles in raising public awareness and mitigating future crises. Consequently, we regard our research as a pilot study in methods of analysis that might be used by other researchers in various fields. 83rd Annual Meeting of the Association for Information Science & Technology October 25-29, 2020. Author(s) retain copyright, but ASIS&T receives an exclusive publication license.
引用
收藏
相关论文
共 10 条
[1]  
Chen E., Lerman K., Ferrara E., (2020)
[2]  
Chen L., Yang T., Luo J., Lyu H., Wang Y., (2020)
[3]  
Chew C., Eysenbach G., Pandemics in the age of Twitter: content analysis of Tweets during the 2009 H1N1 outbreak, PloS one, 5, 11, (2010)
[4]  
Dredze M., Paul M.J., Bergsma S., Tran H., AAAI Workshop - Technical Report, Carmen: A twitter geolocation system with applications to public health, pp. 20-24, (2013)
[5]  
Eileen J., Leamon J.B., (2017)
[6]  
Mohammad S.M., Turney P.D., pp. 26-34, (2010)
[7]  
Ravelo J.L., Jerving S., (2020)
[8]  
Taboada M., Brooke J., Tofiloski M., Voll K., Stede M., Lexicon-based methods for sentiment analysis, Computational linguistics, 37, 2, pp. 267-307, (2011)
[9]  
Xie B., He D., Mercer T., Et al., Global health crises are also information crises: A call to action, Journal of the Association for Information Science and Technology, pp. 1-5, (2020)
[10]  
WHO issues best practices for naming new human infectious diseases, (2015)