Explainable artificial intelligence model for accident severity modeling

被引:0
作者
Mohammad Ali Khasawneh [1 ]
Ibrahim Khalil Umar [2 ]
Ahmad Ali Khasawneh [3 ]
机构
[1] Civil Engineering Department, Prince Mohammad Bin Fahd University, P.O. Box 1664, Al Khobar
[2] Department of Civil Engineering Technology, Kano State Polytechnic, Kano
[3] Management Engineering and Process Improvement Department/Integrated Systems Engineering Department, Ohio State University Wexner Medical Center, Ohio State University, 660 Ackerman Rd, Columbus, 43202, OH
关键词
Accident severity prediction; Explainable AI; Feature importance; Machine learning models; Risk assessment; Traffic safety;
D O I
10.1007/s42107-025-01318-7
中图分类号
学科分类号
摘要
Accident severity prediction is a critical challenge in traffic safety management, emergency response, and urban mobility planning. Road accidents remain a leading cause of fatalities worldwide, yet existing accident analysis frameworks often lack predictive accuracy, interpretability, and real-time decision-making capabilities. Traditional statistical models fail to capture complex interactions between vehicle attributes, driver behavior, and environmental factors, limiting their effectiveness in accident severity assessment. This study addresses these gaps by developing an explainable artificial intelligence (XAI) framework for accident severity prediction, leveraging machine learning models (Random Forest, Extreme Gradient Boosting, Support Vector Machine, and Naïve Bayes) and SHapley Additive exPlanations (SHAP) analysis to enhance model transparency. The dataset, sourced from the Fatality Analysis Reporting System (FARS), consists of 7394 recorded crashes and incorporates key predictors such as airbag deployment, control devices, seatbelt usage, and driver demographics. Experimental results demonstrate that XGBoost outperforms other models, achieving the highest accuracy (80.8%), recall (80.8%), and F1-score (81.0%), making it the most reliable classifier for distinguishing between severe and non-severe accidents. SHAP analysis reveals that airbag deployment, seatbelt usage, and control devices significantly impact accident severity outcomes, providing valuable insights into policy-driven interventions and traffic management strategies. Despite its effectiveness, the study highlights limitations such as data imbalance, lack of real-time behavioral factors, and exclusion of non-fatal crashes, suggesting deep learning integration, real-time telematics, and hybrid AI models in future research. The proposed framework offers a data-driven approach to accident severity prediction, improving road safety policies, vehicle design enhancements, and emergency response efficiency. © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2025.
引用
收藏
页码:2433 / 2445
页数:12
相关论文
共 28 条
[1]  
Abba S.I., Abdulkadir R.A., Sammen S.S., Pham Q.B., Lawan A.A., Esmaili P., Malik A., Al-Ansari N., Integrating feature extraction approaches with hybrid emotional neural networks for water quality index modeling, Applied Soft Computing, 114, (2022)
[2]  
Adefabi A., Olisah S., Obunadike C., Oyetubo O., Taiwo E., Tella E., Predicting accident severity: An analysis of factors affecting accident severity using random forest model, International Journal on Cybernetics & Informatics, 12, 6, pp. 107-121, (2023)
[3]  
Ali S.A., Parvin F., Pham Q.B., Vojtek M., Vojtekova J., Costache R., Linh N.T.T., Nguyen H.Q., Ahmad A., Ghorbani M.A., GIS-based comparative assessment of flood susceptibility mapping using hybrid multi-criteria decision-making approach, naïve Bayes tree, bivariate statistics and logistic regression: A case of Topľa basin, Slovakia, Ecological Indicators, 117, (2020)
[4]  
Aljarrah M.F., Khasawneh M.A., Al-Omari A.A., Investigating key factors influencing the severity of drivers injuries in car crashes using supervised machine learning techniques, Journal of Engineering Science and Technology Review, 12, 4, pp. 15-27, (2019)
[5]  
Al-Mukhtar M., Random forest, support vector machine, and neural networks to modelling suspended sediment in Tigris River-Baghdad, Environmental Monitoring and Assessment, 191, 11, (2019)
[6]  
Awad M., Khanna R., Support vector regression, Efficient learning machines, pp. 67-80, (2015)
[7]  
Blanquero R., Carrizosa E., Ramirez-Cobo P., Sillero-Denamiel M.R., Variable selection for Naïve Bayes classification, Computers & Operations Research, 135, (2021)
[8]  
Breiman L., Random forests, Machine Learning, 6, pp. 5-32, (2001)
[9]  
Ge Y., Zhao H., Liu T., Prediction and analysis of the severity of road traffic accidents at traffic signs under rainstorm conditions, In: Fourth International Conference on Intelligent Traffic Systems and Smart City (ITSSC 2024), 13422, pp. 422-427, (2025)
[10]  
Kaveh A., Applications of artificial neural networks and machine learning in civil engineering, studies in computational intelligence 1168, (2024)