A System Analytics Framework for Detecting Infrastructure-Related Topics in Disasters Using Social Sensing

被引:26
作者
Fan, Chao [1 ]
Mostafavi, Ali [1 ]
Gupta, Aayush [1 ]
Zhang, Cheng [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77840 USA
来源
ADVANCED COMPUTING STRATEGIES FOR ENGINEERING, PT II | 2018年 / 10864卷
基金
美国国家科学基金会;
关键词
System analytics framework; Social sensing; Infrastructure-related topics; Disaster resilience; Text mining;
D O I
10.1007/978-3-319-91638-5_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The objective of this paper is to propose and test a system analytics framework based on social sensing and text mining to detect topic evolution associated with the performance of infrastructure systems in disasters. Social media, like Twitter, as active channels of communication and information dissemination, provide insights into real-time information and first-hand experience from affected areas in mass emergencies. While the existing studies show the importance of social sensing in improving situational awareness and emergency response in disasters, the use of social sensing for detection and analysis of infrastructure systems and their resilience performance has been rather limited. This limitation is due to the lack of frameworks to model the events and topics ( e. g., grid interruption and road closure) evolution associated with infrastructure systems ( e. g., power, highway, airport, and oil) in times of disasters. The proposed framework detects infrastructure-related topics of the tweets posted in disasters and their evolutions by integrating searching relevant keywords, text lemmatization, Part-of-Speech ( POS) tagging, TF-IDF vectorization, topic modeling by using Latent Dirichlet Allocation ( LDA), and K-Means clustering. The application of the proposed framework was demonstrated in a study of infrastructure systems in Houston during Hurricane Harvey. In this case study, more than sixty thousand tweets were retrieved from 150-mile radius in Houston over 39 days. The analysis of topic detection and evolution from user-generated data were conducted, and the clusters of tweets pertaining to certain topics were mapped in networks over time. The results show that the proposed framework enables to summarize topics and track the movement of situations in different disaster phases. The analytics elements of the proposed framework can improve the recognition of infrastructure performance through text-based representation and provide evidence for decision-makers to take actionable measurements.
引用
收藏
页码:74 / 91
页数:18
相关论文
共 25 条
[1]  
Acar Adam, 2011, International Journal of Web Based Communities, V7, P392, DOI 10.1504/IJWBC.2011.041206
[2]  
[Anonymous], 2016, Text Analytics with Python: A Practitioner's Guide to Natural Language Processing
[3]  
[Anonymous], 2017, QUARTZ CONTRIBUTOR O
[4]  
Ashktorab Z., 2014, ISCRAM, P269, DOI DOI 10.1145/1835449.1835643
[5]  
Bala M. M., 2017, INT J CIV ENG TECHNO, V8, P20
[6]  
Bruns Axel, 2012, First Monday, V17, DOI 10.5210/fm.v17i4.3937
[7]  
Engineering News Record, 2017, ENG NEWS RECORD
[8]   "System-of-systems" approach for interdependent critical infrastructures [J].
Eusgeld, Irene ;
Nan, Cen ;
Dietz, Sven .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2011, 96 (06) :679-686
[9]   Ontology-based social media analysis for urban planning [J].
Gao, Xinxin ;
Yu, Wencheng ;
Rong, Yilong ;
Zhang, Songmao .
2017 IEEE 41ST ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2017, :888-896
[10]  
Hardeniya N., 2015, NLTK essentials