Enhancing prediction and analysis of UK road traffic accident severity using AI: Integration of machine learning, econometric techniques, and time series forecasting in public health research

被引:5
作者
Abu Sufian, Md [1 ,3 ]
Varadarajan, Jayasree [2 ]
Niu, Mingbo [1 ]
机构
[1] Changan Univ, IVR Low Carbon Res Inst, Xian 710018, Shaanxi, Peoples R China
[2] Univ Leicester, Sch Geog, Leicester, England
[3] Univ Leicester, Sch Comp & Math Sci, Leicester, England
关键词
Machine learning; ARIMA; Winsorization; Explainable AI (XAI); SHAP (Shapley Additive Explanations); Public health; Road safety; Policy; GMM; VAR; Factor analysis; H2O AutoML;
D O I
10.1016/j.heliyon.2024.e28547
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This research project explored into the intricacies of road traffic accidents severity in the UK, employing a potent combination of machine learning algorithms, econometric techniques, and traditional statistical methods to analyse longitudinal historical data. Our robust analysis framework includes descriptive, inferential, bivariate, multivariate methodologies, correlation analysis: Pearson's and Spearman's Rank Correlation Coefficient, multiple logistic regression models, Multicollinearity Assessment, and Model Validation. In addressing heteroscedasticity or autocorrelation in error terms, we've advanced the precision and reliability of our regression analyses using the Generalized Method of Moments (GMM). Additionally, our application of the Vector Autoregressive (VAR) model and the Autoregressive Integrated Moving Average (ARIMA) models have enabled accurate time series forecasting. With this approach, we've achieved superior predictive accuracy and marked by a Mean Absolute Scaled Error (MASE) of 0.800 and a Mean Error (ME) of -73.80 compared to a naive forecast. The project further extends its machine learning application by creating a random forest classifier model with a precision of 73%, a recall of 78%, and an F1 -score of 73%. Building on this, we employed the H2O AutoML process to optimize our model selection, resulting in an XGBoost model that exhibits exceptional predictive power as evidenced by an RMSE of 0.1761205782994506 and MAE of 0.0874235576229789. Factor Analysis was leveraged to identify underlying variables or factors that explain the pattern of correlations within a set of observed variables. Scoring history, a tool to observe the model's performance throughout the training process was incorporated to ensure the highest possible performance of our machine learning models. We also incorporated Explainable AI (XAI) techniques, utilizing the SHAP (Shapley Additive Explanations) model to comprehend the contributing factors to accident severity. Features such as Driver_Home_Area_Type, Longitude, Driver_IMD_Decile, Road_Type, Casualty_Home_Area_Type, and Casualty_IMD_Decile were identified as significant influencers. Our research contributes to the nuanced understanding of traffic accident severity and demonstrates the potential of advanced statistical, econometric, machine learning techniques in informing evidence based interventions and policies for enhancing road safety.
引用
收藏
页数:33
相关论文
共 43 条
[1]   Examining the role of road safety audits worldwide: exploring road safety expert's opinions [J].
AlHamad, Souad ;
Almallah, Mustafa ;
Naser, Mohammad N. ;
Alhajyaseen, Wael K. M. ;
de Roos, Michael Philip .
INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2023, 30 (01) :106-115
[2]   The impact of climate change on winter road maintenance and traffic accidents in West Midlands, UK [J].
Andersson, Anna K. ;
Chapman, Lee .
ACCIDENT ANALYSIS AND PREVENTION, 2011, 43 (01) :284-289
[3]  
Ansari S., 2022, 2022 ADV SCI ENG TEC, P1
[4]   Risk factors of road accident severity and the development of a new system for prevention: New insights from China [J].
Benlagha, Noureddine ;
Charfeddine, Lanouar .
ACCIDENT ANALYSIS AND PREVENTION, 2020, 136
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Medications and risk of motor vehicle collision responsibility in British Columbia, Canada: a population-based case-control study [J].
Brubacher, Jeffrey R. ;
Chan, Herbert ;
Erdelyi, Shannon ;
Zed, Peter J. ;
Staples, John A. ;
Etminan, Mahyar .
LANCET PUBLIC HEALTH, 2021, 6 (06) :E374-E385
[7]  
Bucsuhazy K., 2020, Transport Res Procedia, V45, P555, DOI [DOI 10.1016/J.TRPRO.2020.03.057, 10.1016/j.trpro.2020.03.057]
[8]  
Chandler J., 2019, Cochrane Handbook for Systematic Reviews of Interventions Version, V5, P3
[9]  
Chen T., XGBoost documentation
[10]   An international experience on the evolution of road costs during the project life cycle [J].
Chong, Uven ;
Hopkins, Omar .
TRANSPORT POLICY, 2016, 48 :60-66