Real-Time Taxi Demand Prediction using data from the web

被引:0
作者
Markou, Ioulia [1 ]
Rodrigues, Filipe [1 ]
Pereira, Francisco C. [1 ]
机构
[1] Tech Univ Denmark DTU, Dept Management Engn, Lyngby, Denmark
来源
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC) | 2018年
关键词
time series forecasting; taxi demand; special events; textual data; topic modeling; machine learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In transportation, nature, economy, environment, and many other settings, there are multiple simultaneous phenomena happening that are of interest to model and predict. Over the last few years, the traffic data that we have at our disposal have significantly increased, and we have truly entered the era of big data for transportation. Most existing traffic flow prediction methods mainly focus on capturing recurrent mobility trends that relate to habitual/routine behaviour, and on exploiting short-term correlations with recent observation patterns. However, valuable information that is often available in the form of unstructured data is neglected when attempting to improve forecasting results. In this paper, we explore time-series data and textual information combinations using machine learning techniques in the context of creating a prediction model that is able to capture in real-time future stressful situations of the studied transportation system. Using publicly available taxi data from New York, we empirically show that the proposed models are able to significantly reduce the error in the forecasts. The final mean absolute error (MAE) of our predictions is decreased by 19.5% for a three months testing period and by 57% if we focus only on event periods.
引用
收藏
页码:1664 / 1671
页数:8
相关论文
共 28 条
  • [1] [Anonymous], 2018, TLC Trip Record Data
  • [2] [Anonymous], BEST CONCERT VENUES
  • [3] [Anonymous], 2011, P INT AAAI C WEB SOC, DOI DOI 10.1609/ICWSM.V5I1.14109
  • [4] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [5] Chan J. W., 2016, J EC BUSINESS MANAGE, V4
  • [6] Davis N, 2016, 2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), P223, DOI 10.1109/ITSC.2016.7795558
  • [7] Characterizing Urban Landscapes using Geolocated Tweets
    Frias-Martinez, Vanessa
    Soto, Victor
    Hohwald, Heath
    Frias-Martinez, Enrique
    [J]. PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, : 239 - 248
  • [8] Ide T., 2009, Proceedings of the 2009 SIAM International Conference on Data Mining (SDM), SIAM, P1185
  • [9] Temporal and weather related variation patterns of urban travel time: Considerations and caveats for value of travel time, value of variability, and mode choice studies
    Kamga, Camille
    Yazici, M. Anil
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2014, 45 : 4 - 16
  • [10] Kireyev K, 2009, NIPS WORKSH APPL TOP, V1