A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction

被引:35
作者
Xia, Dawen [1 ,2 ]
Li, Huaqing [3 ]
Wang, Binfeng [1 ]
Li, Yantao [1 ]
Zhang, Zili [1 ,4 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Guizhou Minzu Univ, Sch Informat Engn, Guiyang 550025, Peoples R China
[3] Southwest Univ, Sch Elect & Informat Engn, Chongqing 400715, Peoples R China
[4] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
来源
IEEE ACCESS | 2016年 / 4卷
基金
中国国家自然科学基金;
关键词
Big data analytics; traffic flow prediction; correlation analysis; parallel classifier; Hadoop MapReduce; TRAVEL-TIME PREDICTION; TRANSPORTATION; NETWORK; FREEWAY; SYSTEMS;
D O I
10.1109/ACCESS.2016.2570021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naive Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup.
引用
收藏
页码:2920 / 2934
页数:15
相关论文
共 50 条
  • [31] A spatiotemporal indexing approach for efficient processing of big array-based climate data with MapReduce
    Li, Zhenlong
    Hu, Fei
    Schnase, John L.
    Duffy, Daniel Q.
    Lee, Tsengdar
    Bowen, Michael K.
    Yang, Chaowei
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (01) : 17 - 35
  • [32] Traffic flow prediction based on combination of support vector machine and data denoising schemes
    Tang, Jinjun
    Chen, Xinqiang
    Hu, Zheng
    Zong, Fang
    Han, Chunyang
    Li, Leixiao
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 534
  • [33] Short term traffic flow prediction of urban road network driven by spatiotemporal data
    Zhang, Y.
    Zhang, C.Y.
    Feng, H.
    Huang, C.X.
    Tong, Y.F.
    Advances in Transportation Studies, 2024, 2 (Special issue): : 67 - 76
  • [34] Data driven hybrid fuzzy model for short-term traffic flow prediction
    Li, Chengdong
    Yan, Bingyang
    Tang, Minjia
    Yi, Jianqiang
    Zhang, Xiqiao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (06) : 6525 - 6536
  • [35] Repeatability and Similarity of Freeway Traffic Flow and Long-Term Prediction Under Big Data
    Hou, Zhongsheng
    Li, Xingyi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (06) : 1786 - 1796
  • [36] A HYBRID SHORT-TERM TRAFFIC FLOW FORECASTING METHOD BASED ON NEURAL NETWORKS COMBINED WITH K-NEAREST NEIGHBOR
    Liu, Zhao
    Guo, Jianhua
    Cao, Jinde
    Wei, Yun
    Huang, Wei
    PROMET-TRAFFIC & TRANSPORTATION, 2018, 30 (04): : 445 - 456
  • [37] MAP-REDUCE BASED DISTANCE WEIGHTED K-NEAREST NEIGHBOR MACHINE LEARNING ALGORITHM FOR BIG DATA APPLICATIONS
    Gothai, E.
    Muthukumaran, V.
    Valarmathi, K.
    Sathishkumar, V. E.
    Thillaiarasu, N.
    Karthikeyan, P.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2022, 23 (04): : 129 - 145
  • [38] GDFormer: A Graph Diffusing Attention based approach for Traffic Flow Prediction
    Su, Jie
    Jin, Zhongfu
    Ren, Jie
    Yang, Jiandang
    Liu, Yong
    PATTERN RECOGNITION LETTERS, 2022, 156 : 126 - 132
  • [39] Traffic flow prediction based on BILSTM model and data denoising scheme
    Li, Zhong-Yu
    Ge, Hong-Xia
    Cheng, Rong-Jun
    CHINESE PHYSICS B, 2022, 31 (04)
  • [40] Toward a prediction approach based on deep learning in Big Data analytics
    Haddad, Omar
    Fkih, Fethi
    Omri, Mohamed Nazih
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (08) : 6043 - 6063