Road Traffic Crash Severity Analysis: A Bayesian-Optimized Dynamic Ensemble Selection Guided by Instance Hardness and Region of Competence Strategy

被引：1

作者：

Aziz, Kamran ^{[1
]}

Chen, Feng ^{[1
]}

Khan, Inamullah ^{[2
]}

Khahro, Shabir Hussain ^{[3
]}

Muhammad, Abdul Malik ^{[3
]}

Memon, Zubair Ahmed ^{[3
]}

Khattak, Afaq ^{[1
]}

机构：

[1] Tongji Univ, Key Lab Rd & Traff Engn, Minist Educ, Shanghai 201804, Peoples R China

[2] Natl Univ Sci & Technol, Natl Inst Transportat, Islamabad 24080, Pakistan

[3] Prince Sultan Univ, Coll Engn, Riyadh 11586, Saudi Arabia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Accidents; Accuracy; Machine learning; Vehicle dynamics; Ensemble learning; Random forests; Predictive models; Bayes methods; Bayesian optimization; cost-sensitive learning; crash severity analysis; dynamic ensemble selection; DES-RRC-light GBM; KNORAE-Cat Boost; SHapley Additive exPlanations; CLASSIFIER SELECTION; INJURY SEVERITY; MODEL;

D O I：

10.1109/ACCESS.2024.3465489

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Millions of lives are lost in road accidents annually, underscoring the severity of traffic incidents. Despite numerous studies on crash severity forecasting with static ensemble machine learning, the potential of Dynamic Ensemble Selection (DES) for combining classifiers and improving performance remains relatively unexplored in this field. In contrast to previous research focused on static ensemble machine learning models, our study introduces an interpretable Bayesian Optimized Dynamic Ensemble Selection strategy, reinforced with instance hardness and region of competence considerations, for crash risk assessment. Utilizing six algorithms - Random Forest, Extra Tree, Cat Boost, Light GBM, KT Boost, and XGBoost - optimized through cross-validation, we employed bagging as a hybrid ensemble selection criterion to ensure ensemble diversity for DES models. Addressing class imbalance with cost-sensitive learning, we evaluated six DES algorithms, 'KNORA-E, KNORA-U, DES-KNN, DES-RRC, DES-KL, and META-DES' using a pool of input classifiers, based on key metrics such as Balanced Accuracy Score, Sensitivity, G-mean, and Matthews Correlation Coefficient. Our results revealed that the KNORA-E ensemble with Cat Boost achieved the highest performance, with a Balanced Accuracy (BCA) of 69.3%, Matthews Correlation Coefficient (MCC) of 0.36, and a G-mean score of 69.10%. DES-RRC demonstrated notable performance across all base classifiers, particularly in addressing imbalanced classes, and DES-RRC with LightGBM emerged as the second-best performer. SHAP analysis for the KNORAE-Cat Boost model identified key factors in crash severity as, road user gender, airbag deployment, vehicle age and road surface conditions. This approach could be a valuable tool for informed decision-making and targeted interventions.

引用

页码：139540 / 139559

页数：20