Predicting and Analyzing Road Traffic Injury Severity Using Boosting-Based Ensemble Learning Models with SHAPley Additive exPlanations

被引:49
|
作者
Dong, Sheng [1 ]
Khattak, Afaq [2 ]
Ullah, Irfan [3 ]
Zhou, Jibiao [4 ]
Hussain, Arshad [5 ]
机构
[1] Ningbo Univ Technol, Sch Civil & Transportat Engn, Fenghua Rd 201, Ningbo 315211, Peoples R China
[2] Tongji Univ, Minist Educ, Key Lab Rd & Traff Engn, 4800 Caoan Rd, Shanghai 201804, Peoples R China
[3] Int Islamic Univ, Dept Civil Engn, Sect H-10, Islamabad 1243, Pakistan
[4] Tongji Univ, Coll Transportat Engn, 4800 Caoan Rd, Shanghai 201804, Peoples R China
[5] Natl Univ Sci & Technol, NUST Inst Civil Engn, Sect H-12, Islamabad 44000, Pakistan
基金
中国国家自然科学基金;
关键词
traffic safety; road traffic injuries; boosting-based ensemble models; SHapley Additive exPlanations; DECISION TREE; CRASH COUNTS; DRIVERS; CLASSIFICATION; ACCIDENTS; TIME;
D O I
10.3390/ijerph19052925
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Road traffic accidents are one of the world's most serious problems, as they result in numerous fatalities and injuries, as well as economic losses each year. Assessing the factors that contribute to the severity of road traffic injuries has proven to be insightful. The findings may contribute to a better understanding of and potential mitigation of the risk of serious injuries associated with crashes. While ensemble learning approaches are capable of establishing complex and non-linear relationships between input risk variables and outcomes for the purpose of injury severity prediction and classification, most of them share a critical limitation: their "black-box" nature. To develop interpretable predictive models for road traffic injury severity, this paper proposes four boosting-based ensemble learning models, namely a novel Natural Gradient Boosting, Adaptive Gradient Boosting, Categorical Gradient Boosting, and Light Gradient Boosting Machine, and uses a recently developed SHapley Additive exPlanations analysis to rank the risk variables and explain the optimal model. Among four models, LightGBM achieved the highest classification accuracy (73.63%), precision (72.61%), and recall (70.09%), F1-scores (70.81%), and AUC (0.71) when tested on 2015-2019 Pakistan's National Highway N-5 (Peshawar to Rahim Yar Khan Section) accident data. By incorporating the SHapley Additive exPlanations approach, we were able to interpret the model's estimation results from both global and local perspectives. Following interpretation, it was determined that the Month_of_Year, Cause_of_Accident, Driver_Age and Collision_Type all played a significant role in the estimation process. According to the analysis, young drivers and pedestrians struck by a trailer have a higher risk of suffering fatal injuries. The combination of trailers and passenger vehicles, as well as driver at-fault, hitting pedestrians and rear-end collisions, significantly increases the risk of fatal injuries. This study suggests that combining LightGBM and SHAP has the potential to develop an interpretable model for predicting road traffic injury severity.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Enhancing co-pyrolysis process of biomass and coal using machine learning insights and Shapley additive explanations based on cooperative game theory
    Le, Quang Dung
    Paramasivam, Prabhu
    Chohan, Jasgurpreet Singh
    Sirohi, Ranjana
    Bui, Van Hung
    Kowalski, Jerzy
    Le, Huu Cuong
    Tran, Viet Dung
    ENERGY & ENVIRONMENT, 2025,
  • [42] Breast cancer molecular subtype prediction: Improving interpretability of complex machine-learning models based on multiparametric-MRI features using SHapley Additive exPlanations (SHAP) methodology
    Crombe, Amandine
    Kataoka, Masako
    DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2024, 105 (05) : 161 - 162
  • [43] Predicting the pathological invasiveness in patients with a solitary pulmonary nodule via Shapley additive explanations interpretation of a tree-based machine learning radiomics model: a multicenter study
    Zhang, Rong
    Hong, Minping
    Cai, Hongjie
    Liang, Yanting
    Chen, Xinjie
    Liu, Ziwei
    Wu, Meilian
    Zhou, Cuiru
    Bao, Chenzhengren
    Wang, Huafeng
    Yang, Shaomin
    Hu, Qiugen
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2023, 13 (12) : 7828 - +
  • [44] Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models
    De Bock, Koen W.
    Van den Poel, Dirk
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) : 6816 - 6826
  • [45] An analytic framework using deep learning for prediction of traffic accident injury severity based on contributing factors
    Ma, Zhengjing
    Mei, Gang
    Cuomo, Salvatore
    ACCIDENT ANALYSIS AND PREVENTION, 2021, 160 (160):
  • [46] Creating machine learning models that interpretably link systemic inflammatory index, sex steroid hormones, and dietary antioxidants to identify gout using the SHAP (SHapley Additive exPlanations) method
    Cao, Shunshun
    Hu, Yangyang
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [47] Prediction and Interpretation of Low-Level Wind Shear Criticality Based on Its Altitude above Runway Level: Application of Bayesian Optimization-Ensemble Learning Classifiers and SHapley Additive exPlanations
    Khattak, Afaq
    Chan, Pak-Wai
    Chen, Feng
    Peng, Haorong
    ATMOSPHERE, 2022, 13 (12)
  • [48] Predicting child occupant crash injury severity in the United Arab Emirates using machine learning models for imbalanced dataset
    Abdulazeez, Muhammad Uba
    Khan, Wasif
    Abdullah, Kassim Abdulrahman
    IATSS RESEARCH, 2023, 47 (02) : 134 - 159
  • [49] Effect of reservoir heterogeneity on well placement prediction in CO2-EOR projects using machine learning surrogate models: Benchmarking of boosting-based algorithms
    Esfandi, Tanin
    Sadeghnejad, Saeid
    Jafari, Arezou
    GEOENERGY SCIENCE AND ENGINEERING, 2024, 233
  • [50] Predicting the Severity and Discharge Prognosis of Traumatic Brain Injury Based on Intracranial Pressure Data Using Machine Learning Algorithms
    Zhu, Jun
    Shan, Yingchi
    Li, Yihua
    Wu, Xiang
    Gao, Guoyi
    WORLD NEUROSURGERY, 2024, 185 : E1348 - E1360