Twitter Mining for Traffic Events Detection

被引:0
作者
Gutierrez, Carlos [1 ]
Figuerias, Paulo [1 ]
Oliveira, Pedro [1 ]
Costa, Ruben [1 ]
Jardim-Goncalves, Ricardo [1 ]
机构
[1] Univ Nova Lisboa, UNINOVA, Fac Ciencias & Tecnol, Ctr Technol & Syst, Lisbon, Portugal
来源
2015 SCIENCE AND INFORMATION CONFERENCE (SAI) | 2015年
关键词
machine learning; geo-parsing; information retrieval; social networks; classification; traffic events; SOCIAL MEDIA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays with the proliferation of smartphones and tablets on the market, almost everyone has access to mobile devices that offer better processing capabilities and access to new information and services. The Web is undoubtedly the best tool for sharing content, especially through social networks. One of the most useful information that can be extracted is the geographical one. Current navigation systems lack in several ways to satisfy the need to process and reason upon such volumes of data, namely, to accurately provide information about urban traffic in real-time and the possibility to personalize the information used by such systems. This paper describes an approach to integrate and fuse tweet messages from traffic agencies in UK, with the objective of detecting the geographical focus of traffic events. Tweet messages are considered in this work given its uniqueness, the real time nature of tweets which may be used to quickly detect a traffic event and its simplicity; it only cost 140 characters to generate a message (called "tweet") for any user. The approach presented here is composed by several steps: tweet classification, event type classification, name entity recognition, geolocation and event tracking. Finally, we do an experimental study on a real dataset composed by traffic related tweet messages to access the accuracy of proposed approach. We present some inaccuracies ranging from lack of geographical information, imprecise and ambiguous toponyms, overlaps and repetitions as well as visualization to our data set in UK. We finally give an outlook into potential corrections. The work presented here is still part of on-going work. Results achieved so far do not address the final conclusions but form the basis for the formalization of a domain knowledge along with the services.
引用
收藏
页码:371 / 378
页数:8
相关论文
共 19 条
  • [1] Abel Fabian, 2012, Proceedings of the 23rd ACM Conference on Hypertext and Social Media, P285, DOI [DOI 10.1145/2309996.2310043, 10.1145/2309996.2310043]
  • [2] [Anonymous], 1999, P 7 TEXT RETRIEVAL C
  • [3] [Anonymous], 2011, P ACL
  • [4] Bifet A., DISCOVERY SCI, P1
  • [5] MobiS - Personalized Mobility Services for energy efficiency and security through advanced Artificial Intelligence techniques
    Costa, Ruben
    Figueiras, Paulo
    Malo, Pedro
    Jermol, Mitja
    Kalaboukas, Kostas
    [J]. INTELLIGENT DECISION TECHNOLOGIES, 2013, 255 : 296 - 306
  • [6] Fung G.P. C., 2005, VLDB, P181
  • [7] Harnessing the Crowdsourcing Power of Social Media for Disaster Relief
    Gao, Huiji
    Barbier, Geoffrey
    Goolsby, Rebecca
    [J]. IEEE INTELLIGENT SYSTEMS, 2011, 26 (03) : 10 - 14
  • [8] Guillén R, 2008, LECT NOTES COMPUT SC, V5152, P781, DOI 10.1007/978-3-540-85760-0_98
  • [9] TEDAS: a Twitter-based Event Detection and Analysis System
    Li, Rui
    Lei, Kin Hou
    Khadiwala, Ravi
    Chang, Kevin Chen-Chuan
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1273 - 1276
  • [10] Munro R., 2011, P 15 C COMP NAT LANG