Leveraging Machine Learning Algorithms to Predict and Analyze Single-Vehicle and Multi-Vehicle Crash Occurrences on Motorways

被引:1
作者
Bin Masud, Saumik Sakib [1 ]
Mahajan, Kirti [1 ]
Kondyli, Alexandra [1 ]
Deliali, Katerina [2 ]
Yannis, George [2 ]
机构
[1] Univ Kansas, Dept Civil Environm & Architectural Engn, Lawrence, KS 66045 USA
[2] Natl Tech Univ Athens, Dept Civil Engn, Athens, Greece
关键词
traffic safety; crash analysis; crash data; crash frequency; crash prediction models; crash severity; single-vehicle and multi-vehicle crash; DRIVER INJURY SEVERITY; VARIABLES; MODEL;
D O I
10.1177/03611981241250348
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Road crashes are a common occurrence in many parts of the world, causing significant loss of life, injury, and economic damage. Crashes can be broadly classified into single-vehicle (SV) crashes and multi-vehicle (MV) crashes. Various statistical approaches have been implemented to identify the key factors behind these two types of crashes and it has been concluded that these factors need to be analyzed separately. The dataset for this research included various types of roadway design parameters and traffic conditions. Combinations of three feature-selection techniques, namely ANOVA, correlation matrix, and ExtraTreesClassifier algorithm, were utilized to separately select the appropriate variables for SV and MV crash analysis. Various machine learning (ML) models (e.g., LightGBM, XGBoost, etc.) along with a statistical method (binary logistic regression) have been adopted to predict SV and MV crash occurrences. The results show that gradient boosting-type ML algorithms outperform the remaining prediction models, and the LightGBM was found to be the most powerful in prediction. The LightGBM classifier produced accuracy, ROC_AUC, and avg. F-1 score of 0.75, 0.83, and 0.76, respectively, for MV crashes and 0.76, 0.82, and 0.76, respectively, for SV crashes. The SHapley Additive exPlanations (SHAP) analysis was used to explain how each variable affected the models' output. The results confirmed that the crash factors associated with SV and MV crashes are different and that some variables have inverse impact. Artificial intelligence and ML can assist transportation professionals in better understanding the causes of SV and MV crashes and advance the process toward Vision Zero.
引用
收藏
页码:1329 / 1345
页数:17
相关论文
共 54 条
[1]   Modeling traffic accident occurrence and involvement [J].
Abdel-Aty, MA ;
Radwan, AE .
ACCIDENT ANALYSIS AND PREVENTION, 2000, 32 (05) :633-642
[2]  
Abdella Galal M., 2019, International Journal of Operational Research, V34, P507
[3]  
Abdulhafedh A., 2022, OPEN ACCESS LIB J, V9, P1, DOI [10.4236/oalib.1108873, DOI 10.4236/OALIB.1108873]
[4]  
Al Daoud E., 2019, INT J COMPUT INF ENG, V13, P6, DOI [10.5281/zenodo.3607805, DOI 10.5281/ZENODO.3607805]
[5]   The impact of higher speed limits on the frequency and severity of freeway crashes: Accounting for temporal shifts and unobserved heterogeneity [J].
Alnawmasi, Nawaf ;
Mannering, Fred .
ANALYTIC METHODS IN ACCIDENT RESEARCH, 2022, 34
[6]   A random forest guided tour [J].
Biau, Gerard ;
Scornet, Erwan .
TEST, 2016, 25 (02) :197-227
[7]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Causal Analysis and Classification of Traffic Crash Injury Severity Using Machine Learning Algorithms [J].
Meghna Chakraborty ;
Timothy J. Gates ;
Subhrajit Sinha .
Data Science for Transportation, 2023, 5 (2)