A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction

被引:35
作者
Xia, Dawen [1 ,2 ]
Li, Huaqing [3 ]
Wang, Binfeng [1 ]
Li, Yantao [1 ]
Zhang, Zili [1 ,4 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Guizhou Minzu Univ, Sch Informat Engn, Guiyang 550025, Peoples R China
[3] Southwest Univ, Sch Elect & Informat Engn, Chongqing 400715, Peoples R China
[4] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
来源
IEEE ACCESS | 2016年 / 4卷
基金
中国国家自然科学基金;
关键词
Big data analytics; traffic flow prediction; correlation analysis; parallel classifier; Hadoop MapReduce; TRAVEL-TIME PREDICTION; TRANSPORTATION; NETWORK; FREEWAY; SYSTEMS;
D O I
10.1109/ACCESS.2016.2570021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naive Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup.
引用
收藏
页码:2920 / 2934
页数:15
相关论文
共 50 条
  • [21] A bidirectional-a-star-based ant colony optimization algorithm for big-data-driven taxi route recommendation
    Dawen Xia
    Bingqi Shen
    Yongling Zheng
    Wenyong Zhang
    Dewei Bai
    Yang Hu
    Huaqing Li
    Multimedia Tools and Applications, 2024, 83 : 16313 - 16335
  • [22] Segmentation of vehicle detector data for improved k-nearest neighbours-based traffic flow prediction
    Bernas, Marcin
    Placzek, Bartlomiej
    Porwik, Piotr
    Pamula, Teresa
    IET INTELLIGENT TRANSPORT SYSTEMS, 2015, 9 (03) : 264 - 274
  • [23] Data-driven techniques for temperature data prediction: big data analytics approach
    Adamson Oloyede
    Simeon Ozuomba
    Philip Asuquo
    Lanre Olatomiwa
    Omowunmi Mary Longe
    Environmental Monitoring and Assessment, 2023, 195
  • [24] Data-driven techniques for temperature data prediction: big data analytics approach
    Oloyede, Adamson
    Ozuomba, Simeon
    Asuquo, Philip
    Olatomiwa, Lanre
    Longe, Omowunmi Mary
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (02)
  • [25] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
    Asadianfam, Shiva
    Shamsi, Mahboubeh
    Kenari, Abdolreza Rasouli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 2489 - 2516
  • [26] Digital Twin for Transportation Big Data: A Reinforcement Learning-Based Network Traffic Prediction Approach
    Nie, Laisen
    Wang, Xiaojie
    Zhao, Qinglin
    Shang, Zhigang
    Feng, Li
    Li, Guojun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (01) : 896 - 906
  • [27] Optimization Approach to Data-Driven Air Traffic Flow Management
    Diao, Xudong
    Lu, Shan
    TRANSPORTATION RESEARCH RECORD, 2022, 2676 (03) : 398 - 404
  • [28] LSTM-based traffic flow prediction with missing data
    Tian, Yan
    Zhang, Kaili
    Li, Jianyuan
    Lin, Xianxuan
    Yang, Bailin
    NEUROCOMPUTING, 2018, 318 : 297 - 305
  • [29] Traffic Flow Prediction Based On Expressway Operating Vehicle Data
    Ai, Yunfei
    Bai, Zhiming
    Su, Hang
    Zhong, Nan
    Sun, Yunhua
    Zhao, Jiandong
    2018 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2018), 2018, : 322 - 326
  • [30] A MapReduce Approach to Address Big Data Classification Problems Based on the Fusion of Linguistic Fuzzy Rules
    del Rio, Sara
    Lopez, Victoria
    Manuel Benitez, Jose
    Herrera, Francisco
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2015, 8 (03) : 422 - 437