Accident Prediction Accuracy Assessment for Highway-Rail Grade Crossings Using Random Forest Algorithm Compared with Decision Tree

被引:136
作者
Zhou, Xiaoyi [1 ]
Lu, Pan [2 ]
Zheng, Zijian [3 ]
Tolliver, Denver [2 ]
Keramati, Amin [1 ]
机构
[1] North Dakota State Univ, Upper Great Plain Transportat Inst, NDSU Dept 2880 POB 6050, Fargo, ND 58108 USA
[2] North Dakota State Univ, Upper Great Plain Transportat Inst, Dept Transportat Logist & Finance, NDSU Dept 2880 POB 6050, Fargo, ND 58108 USA
[3] Gates Corp, Int Supply Chain & Logist Analyst, 1144 Fifteenth St Suite 1400, Denver, CO 80202 USA
关键词
Random Forest; Prediction Accuracy; Low False Alarm; Highway Rail Grade Crossing; Safety; Data Mining; MOTOR-VEHICLE CRASHES; INJURY SEVERITY; MODELS;
D O I
10.1016/j.ress.2020.106931
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Safety is a major concern of transportation planners and engineers in their design of highway rail grade crossings (HRGCs). Safety agencies rely on prediction models to allocate their crossing safety improvement resources. The prediction accuracy performance of those models is under-researched. This paper performs model forecasting accuracy comparison analysis for a proposed random forest method. Compared with the decision tree, the random forest method is capable of improving unbalanced data forecasting performance because of its bootstrap characteristic, which is a common resampling method to handle imbalanced data. Data imbalance is frequently encountered in safety analysis, where the use of inadequate performance metrics, such as accuracy, and specificity, will lead to overestimated generalization results. That is because the model/classifiers tend to predict the dominant class, non-crash class, in the area of safety analysis. The proposed random forest method is evaluated by various prediction performance measurements and compared with the decision tree. Results show that the random forest method dramatically improves the prediction accuracy without providing additional false negative predictions or false positive predictions which are known as false alarms.
引用
收藏
页数:9
相关论文
共 39 条
[1]   An empirical assessment of fixed and random parameter logit models using crash- and non-crash-specific injury data [J].
Anastasopoulos, Panagiotis Ch. ;
Mannering, Fred .
ACCIDENT ANALYSIS AND PREVENTION, 2011, 43 (03) :1140-1147
[2]  
[Anonymous], PRIOR PROB SAS ENT M
[3]  
[Anonymous], DECISION TREE CLASSI
[4]  
[Anonymous], SOCIAL BEHAV SCI
[5]  
[Anonymous], INT J EMERGING TREND
[6]  
[Anonymous], DET RAR CLASS SAN EN
[7]  
[Anonymous], HIGHW RAIL GRAD CROS
[8]  
[Anonymous], E ASIA SOC TRANSPORT
[9]  
[Anonymous], IDENTIFYING OVERCOMI
[10]  
[Anonymous], DECISION TREES