A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data

被引:4
作者
Kejriwal, Mayank [1 ]
Fang, Ge [1 ]
Zhou, Ying [1 ]
机构
[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90007 USA
来源
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年
关键词
Crisis informatics; natural language processing; social media; sentiment analysis; text classification; TWITTER; DESIGN;
D O I
10.1109/SSCI50451.2021.9660089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crisis informatics is a multi-disciplinary area of research that has taken on renewed urgency due to the COVID-19 pandemic and the runaway effects of climate change. Due to scarce resources, technology, especially augmented artificial intelligence (AI), has the potential to play a meaningful role by using information management for facilitating better crisis response. In part, this is both due to improvements in the underlying technology, as well as an increasing willingness by stakeholders to release data and systems as open-source. Yet, it is still not clear from published literature if such established systems are truly useful on real-world crisis datasets (such as acquired from Twitter) that often contain noise and inconsistencies. In this paper, we explore this agenda by conducting a set of case studies, using real social media data collected during six disasters (including Hurricane Sandy and the Boston Marathon Bombings) and made publicly available on a crisis informatics platform. We apply established, independently developed AI tools, including a resource specifically designed for the crisis domain, to explore whether they yield useful insights that could be helpful to first-responders. Our results reveal that, while such insights can be obtained with relatively low effort, some caveats and best practices do apply, and sentiment analysis results (in particular) are not always consistent.
引用
收藏
页数:8
相关论文
共 30 条
[1]   Design Challenges/Solutions for Environments Supporting the Analysis of Social Media Data in Crisis Informatics Research [J].
Anderson, Kenneth M. ;
Aydin, Ahmet Arif ;
Barrenechea, Mario ;
Cardenas, Adam ;
Hakeem, Mazin ;
Jambi, Sahar .
2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2015, :163-172
[2]  
[Anonymous], 2015, NATURE, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[3]  
[Anonymous], 2014, 8 INT C WEBLOGS SOCI
[4]  
[Anonymous], 2010, P 19 INT C WORLD WID, DOI 10.1145/ 1772690.1772777
[5]  
[Anonymous], 2012, P 9 INT C INF SYST C
[6]  
Arthur R., 2017, ARXIV PREPRINT ARXIV
[7]   Social Diversity and Growth Levels of Open Source Software Projects on GitHub [J].
Aue, Joop ;
Haisma, Michiel ;
Tomasdottir, Kristin Fjola ;
Bacchelli, Alberto .
ESEM'16: PROCEEDINGS OF THE 10TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT, 2016,
[8]  
Avvenuti M, 2014, INT CONF PERVAS COMP, P587, DOI 10.1109/PerComW.2014.6815272
[9]  
Banks Ken, 2009, Proceedings of the 2009 International Conference on Information and Communication Technologies and Development (ICTD 2009), DOI 10.1109/ICTD.2009.5426725
[10]   Getting the Query Right: User Interface Design of Analysis Platforms for Crisis Research [J].
Barrenechea, Mario ;
Anderson, Kenneth M. ;
Aydin, Ahmet Arif ;
Hakeem, Mazin ;
Jambi, Sahar .
ENGINEERING THE WEB IN THE BIG DATA ERA, 2015, 9114 :547-564