A Bayesian clustering ensemble Gaussian process model for network-wide traffic flow clustering and prediction

被引:13
作者
Zhu, Zheng [1 ,2 ,8 ]
Xu, Meng [3 ]
Ke, Jintao [4 ]
Yang, Hai [3 ,5 ]
Chen, Xiqun [1 ,2 ,6 ,7 ,8 ]
机构
[1] Zhejiang Univ, Inst Intelligent Transportat Syst, Coll Civil Engn & Architecture, Hangzhou, Peoples R China
[2] Alibaba Zhejiang Univ Joint Res Inst Frontier Tech, Hangzhou, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[4] Univ Hong Kong, Dept Civil Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol Guangzhou, Intelligent Transportat Thrust, Guangzhou, Peoples R China
[6] Zhejiang Univ, Haining, Peoples R China
[7] Univ Illinois Urbana Champaign Inst, Champaign, IL 61820 USA
[8] Zhejiang Prov Engn Res Ctr Intelligent Transportat, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Gaussian Process; Statistical learning; Traffic flow prediction; Dirichlet process mixture model; CONVOLUTIONAL NEURAL-NETWORK; SPEED; SYSTEM; VOLUME;
D O I
10.1016/j.trc.2023.104032
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Traffic flow prediction is an essential component in intelligent transportation systems. Recently, there has been a notable trend in applying machine learning models, especially deep learning, for network-wide traffic prediction. However, existing studies have limitations on model interpretability, model generalization, and over-reliance on image data processing or fine-designed deep learning structures for extracting traffic attributes. This paper attempts to tackle these limitations by proposing a Bayesian clustering ensemble Gaussian process (BCEGP) model for network-wide traffic flow clustering and prediction. The model utilizes a subset-based Dirichlet process mixture (SDPM) model to conduct a hard clustering among input data; then, within each cluster, it adopts the Gaussian Process (GP) to learn the probability relationship between inputs and outputs. During the prediction phase, the model conducts a soft clustering of the input as weights, and makes predictions via a weighted average of GPs' outputs. The merits of the BCEGP model include: (a) data with similar spatial-temporal patterns are clustered, which helps understand traffic dynamics in a non-Euclidean and non-graphical manner that enhances information extracting for model development; (b) GPs provide analytically trackable functions/gradients of predicted traffic flows with features and reveal variances of predicted traffic flow, enhancing model applicability and interpretability to some extent; (c) the model incorporates an ensemble learning framework that achieves great generalization performance as good as deep learning models; (d) the subset-based clustering and cluster-based GP learning are conducted parallelly, and thus vastly accelerate the training efficiency compared with conventional GPs (but slower than deep learning models). We test the performance of the proposed model based on both synthesized and real-world datasets. For comparison, several widely used machine learning and deep learning models are trained under the real-world dataset. The results demonstrate that the BCEGP model performs well in predictive accuracy, computational speed, and applicability, which can be a promising method for transportation problems.
引用
收藏
页数:20
相关论文
共 78 条
[1]   Unidirectional and Bidirectional LSTM Models for Short-Term Traffic Prediction [J].
Abduljabbar, Rusul L. ;
Dia, Hussein ;
Tsai, Pei-Wei .
JOURNAL OF ADVANCED TRANSPORTATION, 2021, 2021
[2]  
Ahmed M. S., 1979, TRANSP RES REC, P1
[3]   Variable speed limits: Safety and operational impacts of a candidate control strategy for freeway applications [J].
Allaby, Peter ;
Hellinga, Bruce ;
Bullock, Mara .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2007, 8 (04) :671-680
[4]  
[Anonymous], 2016, Int. J. Big Data Intell., DOI DOI 10.1504/IJBDI.2016.073903
[5]   Eco-Routing Navigation System Based on Multisource Historical and Real-Time Traffic Information [J].
Boriboonsomsin, Kanok ;
Barth, Matthew J. ;
Zhu, Weihua ;
Vu, Alexander .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (04) :1694-1704
[6]   Globally Approximate Gaussian Processes for Big Data With Application to Data-Driven Metamaterials Design [J].
Bostanabad, Ramin ;
Chan, Yu-Chin ;
Wang, Liwei ;
Zhu, Ping ;
Chen, Wei .
JOURNAL OF MECHANICAL DESIGN, 2019, 141 (11)
[7]  
Chen C, 2001, TRANSPORT RES REC, P96
[8]  
Chen L, 2007, 2007 IEEE INT C SYST, P1
[9]   Surrogate-Based Optimization for Solving a Mixed Integer Network Design Problem [J].
Chen, Xiqun ;
Zhu, Zheng ;
He, Xiang ;
Zhang, Lei .
TRANSPORTATION RESEARCH RECORD, 2015, (2497) :124-134
[10]  
[陈旭梅 Chen Xumei], 2012, [交通运输系统工程与信息, Journal of Transporation Systems Engineering & Information Technology], V12, P29