Short-Term Segment-Level Crash Risk Prediction Using Advanced Data Modeling with Proactive and Reactive Crash Data

被引:12
作者
Dimitrijevic, Branislav [1 ]
Khales, Sina Darban [1 ,2 ]
Asadi, Roksana [1 ]
Lee, Joyoung [1 ]
机构
[1] New Jersey Inst Technol, John A Reif Jr Dept Civil & Environm Engn, Newark, NJ 07102 USA
[2] Precis Syst Inc PSI, Washington, DC 20003 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 02期
关键词
crash risk analysis; crash prediction; crash likelihood; crash injury severity; machine learning; LEARNING-METHODS; SEVERITY; MACHINE; LIKELIHOOD;
D O I
10.3390/app12020856
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Highway crashes, along with the property damage, personal injuries, and fatalities that they cause, continue to present one of the most significant and critical transportation problems. At the same time, provision of safe travel is one of the main goals of any transportation system. For this reason, both in transportation research and practice much attention has been given to the analysis and modeling of traffic crashes, including the development of models that can be applied to predict crash occurrence and crash severity. In general, such models assess short-term crash risks at a given highway facility, thus providing intelligence that can be used to identify and implement traffic operations strategies for crash mitigation and prevention. This paper presents several crash risk and injury severity assessment models applied at a highway segment level, considering the input data that is typically collected or readily available to most transportation agencies in real-time and at a regional network scale, which would render them readily applicable in practice. The input data included roadway geometry characteristics, traffic flow characteristics, and weather condition data. The paper develops, tests, and compares the performance of models that employ Random effects Bayesian Logistics Regression, Gaussian Naive Bayes, K-Nearest Neighbor, Random Forest, and Gradient Boosting Machine methods. The paper applies random oversampling examples (ROSE) method to deal with the problem of data imbalance associated with the injury severity analysis. The models were trained and tested using a dataset of 10,155 crashes that occurred on two interstate highways in New Jersey over a two-year period. The paper also analyzes the potential improvement in the prediction abilities of the tested models by adding reactive data to the analysis. To that end, traffic crashes were classified in multiple classes based on the driver age and the vehicle age to assess the impact of these attributes on driver injury severity outcomes. The results of this analysis are promising, showing that the simultaneous use of reactive and proactive data can improve the prediction performance of the presented models.
引用
收藏
页数:22
相关论文
共 36 条
[1]   The Viability of Using Automatic Vehicle Identification Data for Real-Time Crash Prediction [J].
Ahmed, Mohamed M. ;
Abdel-Aty, Mohamed A. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (02) :459-468
[2]  
[Anonymous], 1993, An introduction to the bootstrap
[3]  
[Anonymous], 1998, Classification and regression trees
[4]  
[Anonymous], 2012, BUGS BOOK PRACTICAL
[5]  
Bowman A.W., 1997, APPL SMOOTHING TECHN
[6]  
Breiman L., 2000, 579 U CAL BERK STAT
[7]   An explanatory analysis of driver injury severity in rear-end crashes using a decision table/Naive Bayes (DTNB) hybrid classifier [J].
Chen, Cong ;
Zhang, Guohui ;
Yang, Jinfu ;
Milton, John C. ;
Alcantara, Adelamar Dely .
ACCIDENT ANALYSIS AND PREVENTION, 2016, 90 :95-107
[8]  
Cigdem A., 2018, Int. J. Intell. Syst. Appl. Eng, V6, P72, DOI DOI 10.18201/IJISAE.2018637934
[9]  
Dingus T., 2006, The 100-car naturalistic driving study, phase II-results of the 100-car field experiment
[10]   Handling Imbalanced Data in Road Crash Severity Prediction by Machine Learning Algorithms [J].
Fiorentini, Nicholas ;
Losa, Massimo .
INFRASTRUCTURES, 2020, 5 (07)