Identification of Secondary Crash Risk Factors using Penalized Logistic Regression Model

被引:22
作者
Kitali, Angela E. [1 ]
Alluri, Priyanka [1 ]
Sando, Thobias [2 ]
Wu, Wensong [3 ]
机构
[1] Florida Int Univ, Dept Civil & Environm Engn, Miami, FL 33199 USA
[2] Univ North Florida, Sch Engn, Jacksonville, FL USA
[3] Florida Int Univ, Dept Math & Stat, Miami, FL 33199 USA
关键词
REAL-TIME PREDICTION; MANAGEMENT; LIKELIHOOD;
D O I
10.1177/0361198119849053
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Secondary crashes (SCs) have increasingly been recognized as a major problem leading to reduced capacity and additional traffic delays. However, the limited knowledge on the nature and characteristics of SCs has largely impeded their mitigation strategies. There are two main issues with analyzing SCs. First, relevant variables are unknown, but, at the same time, most of the variables considered in the models are highly correlated. Second, only a small proportion of incidents results in SCs, making it an imbalanced classification problem. This study developed a reliable SC risk prediction model using the Least Absolute Shrinkage and Selection Operator (LASSO) penalized logistic regression model with Synthetic Minority Oversampling TEchnique-Nominal Continuous (SMOTE-NC). The proposed model is considered to improve the predictive accuracy of the SC risk model because it accounts for the asymmetric nature of SCs, performs variable selection, and removes highly correlated variables. The study data were collected on a 35-mi I-95 section for 3 years in Jacksonville, Florida. SCs were identified based on real-time speed data. The results indicated that real-time traffic variables and primary incident characteristics significantly affect the likelihood of SCs. The most influential variables included mean of detector occupancy, coefficient of variation of equivalent hourly volume, mean of speed, primary incident type, percentage of lanes closed, incident occurrence time, shoulder blocked, number of responding agencies, incident impact duration, incident clearance duration, and roadway alignment. The study results can be used by agencies to develop SC mitigation strategies, and therefore improve the operational and safety performance of freeways.
引用
收藏
页码:901 / 914
页数:14
相关论文
共 35 条
[1]   Modeling traffic flow interrupted by incidents [J].
Baykal-Guersoy, M. ;
Xiao, W. ;
Ozbay, K. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 195 (01) :127-138
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[4]  
Florida Department of Transportation, 2015, ROADW CHAR INV RCI D
[5]  
Friedman J, 2018, GLMNET LASSO ELASTIC
[6]   Probability of Secondary Crash Occurrence on Freeways with the Use of Private-Sector Speed Data [J].
Goodall, Noah J. .
TRANSPORTATION RESEARCH RECORD, 2017, (2635) :11-18
[7]   Short-term traffic flow prediction using fuzzy information granulation approach under different time intervals [J].
Guo, Jianhua ;
Liu, Zhao ;
Huang, Wei ;
Wei, Yun ;
Cao, Jinde .
IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (02) :143-150
[8]  
James G, 2013, SPRINGER TEXTS STAT, V103, P1, DOI [10.1007/978-1-4614-7138-7, 10.1007/978-1-4614-7138-7_1]
[9]  
Karlaftis MG, 1999, ITS J, V5, P39
[10]   Incident management integration tool: dynamically predicting incident durations, secondary incident occurrence and incident delays [J].
Khattak, A. ;
Wang, X. ;
Zhang, H. .
IET INTELLIGENT TRANSPORT SYSTEMS, 2012, 6 (02) :204-214