From Twitter to detector: Real-time traffic incident detection using social media data

被引:227
|
作者
Gu, Yiming [1 ,2 ]
Qian, Zhen [1 ,2 ]
Chen, Feng [3 ]
机构
[1] Carnegie Mellon Univ, Dept Civil & Environm Engn, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Heinz Coll, Pittsburgh, PA 15213 USA
[3] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA
基金
美国安德鲁·梅隆基金会;
关键词
Incident detection; Social media; Natural language processing; Geocoding; Data mining; Crowd-sourcing; SYSTEM;
D O I
10.1016/j.trc.2016.02.011
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The effectiveness of traditional incident detection is often limited by sparse sensor coverage, and reporting incidents to emergency response systems is labor-intensive. We propose to mine tweet texts to extract incident information on both highways and arterials as an efficient and cost-effective alternative to existing data sources. This paper presents a methodology to crawl, process and filter tweets that are accessible by the public for free. Tweets are acquired from Twitter using the REST API in real time. The process of adaptive data acquisition establishes a dictionary of important keywords and their combinations that can imply traffic incidents (TI). A tweet is then mapped into a high dimensional binary vector in a feature space formed by the dictionary, and classified into either TI related or not. All the TI tweets are then geocoded to determine their locations, and further classified into one of the five incident categories. We apply the methodology in two regions, the Pittsburgh and Philadelphia Metropolitan Areas. Overall, mining tweets holds great potentials to complement existing traffic incident data in a very cheap way. A small sample of tweets acquired from the Twitter API cover most of the incidents reported in the existing data set, and additional incidents can be identified through analyzing tweets text. Twitter also provides ample additional information with a reasonable coverage on arterials. A tweet that is related to TI and geocodable accounts for approximately 5% of all the acquired tweets. Of those geocodable TI tweets, 60-70% are posted by influential users (IU), namely public Twitter accounts mostly owned by public agencies and media, while the rest is contributed by individual users. There is more incident information provided by Twitter on weekends than on weekdays. Within the same day, both individuals and IUs tend to report incidents more frequently during the day time than at night, especially during traffic peak hours. Individual tweets are more likely to report incidents near the center of a city, and the volume of information significantly decays outwards from the center. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:321 / 342
页数:22
相关论文
共 50 条
  • [1] TrafficWatch: Real-Time Traffic Incident Detection and Monitoring Using Social Media
    Hoang Nguyen
    Liu, Wei
    Rivera, Paul
    Chen, Fang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 540 - 551
  • [2] Real-time traffic event detection using Twitter data
    Jones, Angelica Salas
    Georgakis, Panagiotis
    Petalas, Yannis
    Suresh, Renukappa
    INFRASTRUCTURE ASSET MANAGEMENT, 2018, 5 (03) : 77 - 84
  • [3] Real-Time Traffic Event Detection From Social Media
    Wang, Di
    Al-Rubaie, Ahmad
    Clarke, Sandra Stincic
    Davies, John
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2017, 18 (01)
  • [4] Real Time Traffic Incident Detection by Using Twitter Stream Analysis
    Afzaal, Maryam
    Nazir, Nazifa
    Akbar, Khadija
    Perveen, Sidra
    Farooq, Umer
    Ashraf, M. Khalid
    Fayyaz, Zonia
    HUMAN SYSTEMS ENGINEERING AND DESIGN, IHSED2018, 2019, 876 : 620 - 626
  • [5] Real-Time Incident Detection and Capacity Estimation Using Loop Detector Data
    Rizvi, Syed Muzammil Abbas
    Ahmed, Afzal
    Shen, Yongjun
    JOURNAL OF ADVANCED TRANSPORTATION, 2020, 2020
  • [6] Real-time Detection of Data Completeness Degree for Traffic Simulation Using Text Similarity and Time Relevance of Data from Social Media
    Putri, Eviana Tjatur
    Buliali, Joko Lianto
    Ermawati, Myrna
    2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 109 - 114
  • [7] Real-Time Detection of Traffic From Twitter Stream Analysis
    D'Andrea, Eleonora
    Ducange, Pietro
    Lazzerini, Beatrice
    Marcelloni, Francesco
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) : 2269 - 2283
  • [8] Real-Time Detection and Visualization of Traffic Conditions by Mining Twitter Data
    Khetarpaul, Sonia
    Sharma, Dolly
    Jose, Jackson I.
    Saragur, Mohith
    DATABASES THEORY AND APPLICATIONS (ADC 2022), 2022, 13459 : 141 - 152
  • [9] Real-time Traffic Incident Detection Using an Autoencoder Model
    Yang, Huan
    Wang, Yu
    Zhao, Han
    Zhu, Jinlin
    Wang, Danwei
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [10] Real-time Traffic Classification with Twitter Data Mining
    Kurniawan, Dwi Aji
    Wibirama, Sunu
    Setiawan, Noor Akhmad
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2016,