Event Detection on Twitter by Mapping Unexpected Changes in Streaming Data into a Spatiotemporal Lattice

被引:18
作者
Shah, Zubair [1 ,2 ]
Dunn, Adam G. [2 ]
机构
[1] Hamad Bin Khalifa Univ, Div ICT, Coll Sci Engn, Ar Rayyan, Qatar
[2] Macquarie Univ, Australian Inst Hlth Innovat, Ctr Hlth Informat ics, Macquarie Park, NSW 2109, Australia
基金
英国医学研究理事会;
关键词
Twitter; Event detection; Feature extraction; Spatiotemporal phenomena; Lattices; Urban areas; Data mining; Hierarchical patterns; events detection; twitter stream; SOCIAL MEDIA; SENTIMENT;
D O I
10.1109/TBDATA.2019.2948594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications seek to make sense of high volume streaming data from social media by identifying spatiotemporal patterns. Events, representing topics that emerge and decay over time, are detected by monitoring for changes in the language being used, but typical approaches do not consider the localisation of events in cities and countries, and within hours, days, and weeks. This work develops and evaluates a new approach to event localisation and ranking that can be applied to Twitter data streams. The proposed approach models the use of language in tweets per city per hour to produce a model that can be used to detect the magnitude of unexpected changes in the use of the language. The approach uses a spatiotemporal lattice structure and a method for traversing between hours, days, and weeks, as well as cities, regions, and countries to identify anomalies in the language used across millions of tweets. The output is a ranked list of events comprising a list of tweets posted within a location and period of time, and characterized by language features of interest. The approach was implemented and tested by comparing events detected across five example domains (suicide, shooting, elections, sports, and sentiment) using 11.7 million tweets from users located in 100 cities and posted within the 203-day study period. Experiments demonstrate that the approach can detect events across a range of application domains.
引用
收藏
页码:508 / 522
页数:15
相关论文
共 56 条
[31]   Real-time event detection for online behavioral analysis of big social data [J].
Nguyen, Duc T. ;
Jung, Jai E. .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 66 :137-145
[32]  
Osborne M., 2012, SIG 2012 WORKSH TIM
[33]  
Osborne M, 2014, PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, P37
[34]  
Parikh R, 2013, PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), P613
[35]  
Phuvipadawat S., 2010, Proceedings of the 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology - Workshops (WI-IAT 2010), P120, DOI 10.1109/WI-IAT.2010.205
[36]  
Popescu A.-M., 2010, P 19 ACM INT C INF K, P1873
[37]  
Rahimi A, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, P630
[38]   A survey on opinion mining and sentiment analysis: Tasks, approaches and applications [J].
Ravi, Kumar ;
Ravi, Vadlamani .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :14-46
[39]   SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods [J].
Ribeiro, Filipe N. ;
Araujo, Matheus ;
Goncalves, Pollyanna ;
Goncalves, Marcos Andre ;
Benevenuto, Fabircio .
EPJ DATA SCIENCE, 2016, 5
[40]  
Ritter A., 2012, P 18 ACM SIGKDD INT, P1104