A Hybrid Method for Traffic Incident Duration Prediction Using BOA-Optimized Random Forest Combined with Neighborhood Components Analysis

被引:39
作者
Shang, Qiang [1 ]
Tan, Derong [1 ]
Gao, Song [1 ]
Feng, Linlin [2 ]
机构
[1] Shandong Univ Technol, Sch Transportat & Vehicle Engn, Zibo 255049, Shandong, Peoples R China
[2] Shandong Univ Technol, Sch Marxism Studies, Zibo 255049, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
CLEARANCE TIME; MIXTURE MODEL; CLASSIFICATION; REGRESSION; TREES;
D O I
10.1155/2019/4202735
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Predicting traffic incident duration is important for effective and real-time traffic incident management (TIM), which helps to minimize traffic congestion, environmental pollution, and secondary incident related to this incident. Traffic incident duration prediction methods often use more input variables to obtain better prediction results. However, the problems that available variables are limited at the beginning of an incident and how to select significant variables are ignored to some extent. In this paper, a novel prediction method named NCA-BOA-RF is proposed using the Neighborhood Components Analysis (NCA) and the Bayesian Optimization Algorithm (BOA)-optimized Random Forest (RF) model. Firstly, the NCA is applied to select feature variables for traffic incident duration. Then, RF model is trained based on the training set constructed using feature variables, and the BOA is employed to optimize the RF parameters. Finally, confusion matrix is introduced to measure the optimized RF model performance and compare with other methods. In addition, the performance is also tested in the absence of some feature variables. The results demonstrate that the proposed method not only has high accuracy, but also exhibits excellent reliability and robustness.
引用
收藏
页数:11
相关论文
共 38 条
[1]  
[Anonymous], 2015, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
[2]   Configuring in-memory cluster computing using random forest [J].
Bei, Zhendong ;
Yu, Zhibin ;
Luo, Ni ;
Jiang, Chuntao ;
Xu, Chengzhong ;
Feng, Shengzhong .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 79 :1-15
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Brochu E., 2010, CoRR
[5]  
Chang H.-L.T.-P. Chang., 2013, Journal of the Eastern Asia Society for Transportation Studies, V10, P1964, DOI DOI 10.11175/EASTS.10.1964
[6]   Quantifying non-recurrent traffic congestion caused by freeway work zones using archived work zone and ITS traffic data [J].
Chung, Younshik ;
Kim, Hyun ;
Park, Minyoung .
TRANSPORTMETRICA, 2012, 8 (04) :307-320
[7]   Adaptive Learning in Bayesian Networks for Incident Duration Prediction [J].
Demiroluk, Sami ;
Ozbay, Kaan .
TRANSPORTATION RESEARCH RECORD, 2014, (2460) :77-85
[8]   Variable selection using random forests [J].
Genuer, Robin ;
Poggi, Jean-Michel ;
Tuleau-Malot, Christine .
PATTERN RECOGNITION LETTERS, 2010, 31 (14) :2225-2236
[9]   Variable Importance Assessment in Regression: Linear Regression versus Random Forest [J].
Groemping, Ulrike .
AMERICAN STATISTICIAN, 2009, 63 (04) :308-319
[10]   Hazard based models for freeway traffic incident duration [J].
Hojati, Ahmad Tavassoli ;
Ferreira, Luis ;
Washington, Simon ;
Charles, Phil .
ACCIDENT ANALYSIS AND PREVENTION, 2013, 52 :171-181