Event Detection on Twitter by Mapping Unexpected Changes in Streaming Data into a Spatiotemporal Lattice

被引:18
作者
Shah, Zubair [1 ,2 ]
Dunn, Adam G. [2 ]
机构
[1] Hamad Bin Khalifa Univ, Div ICT, Coll Sci Engn, Ar Rayyan, Qatar
[2] Macquarie Univ, Australian Inst Hlth Innovat, Ctr Hlth Informat ics, Macquarie Park, NSW 2109, Australia
基金
英国医学研究理事会;
关键词
Twitter; Event detection; Feature extraction; Spatiotemporal phenomena; Lattices; Urban areas; Data mining; Hierarchical patterns; events detection; twitter stream; SOCIAL MEDIA; SENTIMENT;
D O I
10.1109/TBDATA.2019.2948594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications seek to make sense of high volume streaming data from social media by identifying spatiotemporal patterns. Events, representing topics that emerge and decay over time, are detected by monitoring for changes in the language being used, but typical approaches do not consider the localisation of events in cities and countries, and within hours, days, and weeks. This work develops and evaluates a new approach to event localisation and ranking that can be applied to Twitter data streams. The proposed approach models the use of language in tweets per city per hour to produce a model that can be used to detect the magnitude of unexpected changes in the use of the language. The approach uses a spatiotemporal lattice structure and a method for traversing between hours, days, and weeks, as well as cities, regions, and countries to identify anomalies in the language used across millions of tweets. The output is a ranked list of events comprising a list of tweets posted within a location and period of time, and characterized by language features of interest. The approach was implemented and tested by comparing events detected across five example domains (suicide, shooting, elections, sports, and sentiment) using 11.7 million tweets from users located in 100 cities and posted within the 203-day study period. Experiments demonstrate that the approach can detect events across a range of application domains.
引用
收藏
页码:508 / 522
页数:15
相关论文
共 56 条
[1]  
Alvanaki F, 2011, P 2011 ACM SIGMOD IN, P1271
[2]  
[Anonymous], 2010, HUMAN LANGUAGE TECHN
[3]  
[Anonymous], 2012, P 21 ACM INT C INF K, DOI DOI 10.1145/2396761.2396785
[4]  
[Anonymous], 2012, P IEEE VISWEEK WORKS
[5]  
[Anonymous], 2010, Proceedings of the 19th international conference on World wide web, DOI [DOI 10.1145/1772690.17727777,12, DOI 10.1145/1772690.1772777]
[6]   A SURVEY OF TECHNIQUES FOR EVENT DETECTION IN TWITTER [J].
Atefeh, Farzindar ;
Khreich, Wael .
COMPUTATIONAL INTELLIGENCE, 2015, 31 (01) :132-164
[7]  
Becker H., 2011, 5 INT AAAI C WEBL SO
[8]   Indexing Evolving Events from Tweet Streams [J].
Cai, Hongyun ;
Huang, Zi ;
Srivastava, Divesh ;
Zhang, Qing .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) :3001-3015
[9]   Event Detection using Twitter: A Spatio-Temporal Approach [J].
Cheng, Tao ;
Wicks, Thomas .
PLOS ONE, 2014, 9 (06)
[10]   Mapping information exposure on social media to explain differences in HPV vaccine coverage in the United States [J].
Dunn, Adam G. ;
Surian, Didi ;
Leask, Julie ;
Dey, Aditi ;
Mandl, Kenneth D. ;
Coiera, Enrico .
VACCINE, 2017, 35 (23) :3033-3040