An analytic framework using deep learning for prediction of traffic accident injury severity based on contributing factors

被引：94

作者：

Ma, Zhengjing ^{[1
]}

Mei, Gang ^{[1
]}

Cuomo, Salvatore ^{[2
]}

机构：

[1] China Univ Geosci Beijing, Sch Engn & Technol, Beijing 100083, Peoples R China

[2] Univ Naples Federico II, Dept Math & Applicat R Caccioppoli, Naples, Italy

来源：

ACCIDENT ANALYSIS AND PREVENTION | 2021年 / 160卷 / 160期

基金：

中国国家自然科学基金;

关键词：

Road safety; Traffic accidents; Injury severity; Deep learning; DECISION RULES; LOGIT MODEL; CRASHES; TIME; PATTERNS; MACHINE; LEVEL; ZONES;

D O I：

10.1016/j.aap.2021.106322

中图分类号：

TB18 [人体工程学];

学科分类号：

1201 ;

摘要：

Vulnerable road users (VRUs) are exposed to the highest risk in the road traffic environment. Analyzing contributing factors that affect injury severity facilitates injury severity prediction and further application in developing countermeasures to guarantee VRUs safety. Recently, machine learning approaches have been introduced, in which analyses tend to be one-sided and may ignore important information. To solve this problem, this paper proposes a comprehensive analytic framework that employs a deep learning model referred to as the stacked sparse autoencoder (SSAE) to predict the injury severity of traffic accidents based on contributing factors. The essential idea of the method is to integrate various analyses into an analytical framework that performs corresponding data processing and analysis by different machine learning approaches. In the proposed method, first, we utilize a machine learning approach (i.e., Catboost) to analyze the importance and dependence of the contributing factors to injury severity and remove low correlation factors; second, according to the geographical information, we classify the data into different classes by utilizing a machine learning approach (i.e., k-means clustering); third, by employing high correlation factors, we employ an SSAE-based deep learning model to perform injury severity prediction in each data class. By experiments with a real-world traffic accident dataset, we demonstrated the effectiveness and applicability of the framework. Specifically, (1) the importance and dependence of contributing factors were obtained by CatBoost and the Shapley value, and (2) the SSAE-based deep learning model achieved the best performance compared to other baseline models. The proposed analytic framework can also be utilized for other accident data for severity or other risk indicator analyses involving VRUs safety.

引用

页数：16

共 71 条

[51] Prokhorenkova L., 2018, CATBOOST UNBIASED BO, P6639, DOI [10.1016/j.aap.2011.04.003, DOI 10.1016/J.AAP.2011.04.003]
[52] Choosing the proper autoencoder for feature fusion based on data complexity and classifiers: Analysis, tips and guidelines
Pulgar, Francisco J.
Charte, Francisco
Rivera, Antonio J.
del Jesus, Maria J.
[J]. INFORMATION FUSION, 2020, 54 : 44 - 60
[53] Anomaly Detection for Road Traffic: A Visual Analytics Framework
Riveiro, Maria
Lebram, Mikael
Elmer, Marcus
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (08) : 2260 - 2270
[54] CatBoost for RS Image Classification With Pseudo Label Support From Neighbor Patches-Based Clustering
Samat, Alim
Li, Erzhu
Du, Peijun
Liu, Sicong
Miao, Zelang
Zhang, Wei
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[55] Deep learning in neural networks: An overview
Schmidhuber, Juergen
[J]. NEURAL NETWORKS, 2015, 61 : 85 - 117
[56] Shapley L. S., 1997, CLASSICS GAME THEORY, V69, DOI [10.1515/9781400881970-018, DOI 10.1515/9781400881970-018]
[57] A feature learning approach based on XGBoost for driving assessment and risk prediction
Shi, Xiupeng
Wong, Yiik Diew
Li, Michael Zhi-Feng
Palanisamy, Chandrasekar
Chai, Chen
[J]. ACCIDENT ANALYSIS AND PREVENTION, 2019, 129 : 170 - 179
[58] Spring Forward at Your Own Risk: Daylight Saving Time and Fatal Vehicle Crashes
Smith, Austin C.
[J]. AMERICAN ECONOMIC JOURNAL-APPLIED ECONOMICS, 2016, 8 (02) : 65 - 91
[59] Explaining prediction models and individual predictions with feature contributions
Strumbelj, Erik
Kononenko, Igor
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 647 - 665
[60] Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates
Taamneh, Madhar
Alkheder, Sharaf
Taamneh, Salah
[J]. JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2017, 9 (02) : 146 - 166

← 1 2 3 4 5 6 7 8 →