Extracting City Traffic Events from Social Streams

被引:60
作者
Anantharam, Pramod [1 ]
Barnaghi, Payam [2 ]
Thirunarayan, Krishnaprasad [1 ]
Sheth, Amit [1 ]
机构
[1] Wright State Univ, Kno E Sis, Dayton, OH 45324 USA
[2] Univ Surrey, Guildford GU2 7XH, Surrey, England
基金
美国国家科学基金会;
关键词
Design; Algorithms; Experimentation; Smart cities; citizen sensing; city events; tweets; event extraction; physical-cyber-social systems;
D O I
10.1145/2717317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cities are composed of complex systems with physical, cyber, and social components. Current works on extracting and understanding city events mainly rely on technology-enabled infrastructure to observe and record events. In this work, we propose an approach to leverage citizen observations of various city systems and services, such as traffic, public transport, water supply, weather, sewage, and public safety, as a source of city events. We investigate the feasibility of using such textual streams for extracting city events from annotated text. We formalize the problem of annotating social streams such as microblogs as a sequence labeling problem. We present a novel training data creation process for training sequence labeling models. Our automatic training data creation process utilizes instance-level domain knowledge (e.g., locations in a city, possible event terms). We compare this automated annotation process to a state-of-the-art tool that needs manually created training data and show that it has comparable performance in annotation tasks. An aggregation algorithm is then presented for event extraction from annotated text. We carry out a comprehensive evaluation of the event annotation and event extraction on a real-world dataset consisting of event reports and tweets collected over 4 months from the San Francisco Bay Area. The evaluation results are promising and provide insights into the utility of social stream for extracting city events.
引用
收藏
页数:27
相关论文
共 38 条
[1]  
Aggarwal C. C., 2012, SDM, V12, P624
[2]  
Anantharam Pramod, 2013, P 20 ITS WORLD C
[3]  
[Anonymous], 2006, 1 WORKSH WORLD SENS
[4]  
[Anonymous], 2006, P AAAI WORKSHOP EVEN
[5]  
[Anonymous], 2008, LINGPIPE 4 1 0
[6]  
[Anonymous], 2011, P INT AAAI C WEB SOC
[7]  
[Anonymous], J PUBLIC TRANSPORT
[8]  
[Anonymous], 2002, Mallet: a machine learning for languagetoolkit
[9]  
Belissent Jennifer., 2010, GETTING CLEVER SMART
[10]  
Belissent Jennifer, 2013, SERVICE ACCELERATE S