A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction

被引:35
作者
Xia, Dawen [1 ,2 ]
Li, Huaqing [3 ]
Wang, Binfeng [1 ]
Li, Yantao [1 ]
Zhang, Zili [1 ,4 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Guizhou Minzu Univ, Sch Informat Engn, Guiyang 550025, Peoples R China
[3] Southwest Univ, Sch Elect & Informat Engn, Chongqing 400715, Peoples R China
[4] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
来源
IEEE ACCESS | 2016年 / 4卷
基金
中国国家自然科学基金;
关键词
Big data analytics; traffic flow prediction; correlation analysis; parallel classifier; Hadoop MapReduce; TRAVEL-TIME PREDICTION; TRANSPORTATION; NETWORK; FREEWAY; SYSTEMS;
D O I
10.1109/ACCESS.2016.2570021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naive Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup.
引用
收藏
页码:2920 / 2934
页数:15
相关论文
共 50 条
  • [1] TRAFFIC FLOW PROPHECY WITH MAPREDUCE JOB FOR BIG DATA DRIVEN
    Abirami, U.
    Sridevi, S.
    2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 13 - 18
  • [2] Traffic Flow Prediction With Big Data: A Deep Learning Approach
    Lv, Yisheng
    Duan, Yanjie
    Kang, Wenwen
    Li, Zhengxi
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (02) : 865 - 873
  • [3] Traffic Incident Duration Prediction Based On K- Nearest Neighbor
    Wen, Yuan
    Chen, Shuyan
    Xiong, Qinyuan
    Han, Rubi
    Chen, Shiyu
    SUSTAINABLE DEVELOPMENT OF URBAN INFRASTRUCTURE, PTS 1-3, 2013, 253-255 : 1675 - 1681
  • [4] WKNN-FDCNN method for big data driven traffic flow prediction in ITS
    Soni, Ravikant
    Roy, Partha
    Nagwanshi, Kapil Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25261 - 25286
  • [5] Big data-driven machine learning-enabled traffic flow prediction
    Kong, Fanhui
    Li, Jian
    Jiang, Bin
    Zhang, Tianyuan
    Song, Houbing
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (09)
  • [6] Time-Aware Multivariate Nearest Neighbor Regression Methods for Traffic Flow Prediction
    Dell'Acqua, Pietro
    Bellotti, Francesco
    Berta, Riccardo
    De Gloria, Alessandro
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (06) : 3393 - 3402
  • [7] Gaussian relevance vector MapReduce-based annealed Glowworm optimization for big medical data scheduling
    Patan, Rizwan
    Kallam, Suresh
    Gandomi, Amir H.
    Hanne, Thomas
    Ramachandran, Manikandan
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2022, 73 (10) : 2204 - 2215
  • [8] A New Data-Driven Approach to Traffic Flow Prediction Based on TCOA-BiGRU
    Wang, Haoxu
    Li, Long
    Wang, Zhiwen
    Lu, Yanrong
    Zeng, Jingxiao
    Miao, Wei
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1897 - 1901
  • [9] WKNN-FDCNN method for big data driven traffic flow prediction in ITS
    Ravikant Soni
    Partha Roy
    Kapil Kumar Nagwanshi
    Multimedia Tools and Applications, 2024, 83 : 25261 - 25286
  • [10] A big-data-driven matching model based on deep reinforcement learning for cotton blending
    Xia, Huosong
    Wang, Yuan
    Jasimuddin, Sajjad
    Zhang, Justin Zuopeng
    Thomas, Andrew
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (22) : 7573 - 7591