An analytic framework using deep learning for prediction of traffic accident injury severity based on contributing factors

被引:94
作者
Ma, Zhengjing [1 ]
Mei, Gang [1 ]
Cuomo, Salvatore [2 ]
机构
[1] China Univ Geosci Beijing, Sch Engn & Technol, Beijing 100083, Peoples R China
[2] Univ Naples Federico II, Dept Math & Applicat R Caccioppoli, Naples, Italy
基金
中国国家自然科学基金;
关键词
Road safety; Traffic accidents; Injury severity; Deep learning; DECISION RULES; LOGIT MODEL; CRASHES; TIME; PATTERNS; MACHINE; LEVEL; ZONES;
D O I
10.1016/j.aap.2021.106322
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
Vulnerable road users (VRUs) are exposed to the highest risk in the road traffic environment. Analyzing contributing factors that affect injury severity facilitates injury severity prediction and further application in developing countermeasures to guarantee VRUs safety. Recently, machine learning approaches have been introduced, in which analyses tend to be one-sided and may ignore important information. To solve this problem, this paper proposes a comprehensive analytic framework that employs a deep learning model referred to as the stacked sparse autoencoder (SSAE) to predict the injury severity of traffic accidents based on contributing factors. The essential idea of the method is to integrate various analyses into an analytical framework that performs corresponding data processing and analysis by different machine learning approaches. In the proposed method, first, we utilize a machine learning approach (i.e., Catboost) to analyze the importance and dependence of the contributing factors to injury severity and remove low correlation factors; second, according to the geographical information, we classify the data into different classes by utilizing a machine learning approach (i.e., k-means clustering); third, by employing high correlation factors, we employ an SSAE-based deep learning model to perform injury severity prediction in each data class. By experiments with a real-world traffic accident dataset, we demonstrated the effectiveness and applicability of the framework. Specifically, (1) the importance and dependence of contributing factors were obtained by CatBoost and the Shapley value, and (2) the SSAE-based deep learning model achieved the best performance compared to other baseline models. The proposed analytic framework can also be utilized for other accident data for severity or other risk indicator analyses involving VRUs safety.
引用
收藏
页数:16
相关论文
共 71 条
  • [51] Prokhorenkova L., 2018, CATBOOST UNBIASED BO, P6639, DOI [10.1016/j.aap.2011.04.003, DOI 10.1016/J.AAP.2011.04.003]
  • [52] Choosing the proper autoencoder for feature fusion based on data complexity and classifiers: Analysis, tips and guidelines
    Pulgar, Francisco J.
    Charte, Francisco
    Rivera, Antonio J.
    del Jesus, Maria J.
    [J]. INFORMATION FUSION, 2020, 54 : 44 - 60
  • [53] Anomaly Detection for Road Traffic: A Visual Analytics Framework
    Riveiro, Maria
    Lebram, Mikael
    Elmer, Marcus
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (08) : 2260 - 2270
  • [54] CatBoost for RS Image Classification With Pseudo Label Support From Neighbor Patches-Based Clustering
    Samat, Alim
    Li, Erzhu
    Du, Peijun
    Liu, Sicong
    Miao, Zelang
    Zhang, Wei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [55] Deep learning in neural networks: An overview
    Schmidhuber, Juergen
    [J]. NEURAL NETWORKS, 2015, 61 : 85 - 117
  • [56] Shapley L. S., 1997, CLASSICS GAME THEORY, V69, DOI [10.1515/9781400881970-018, DOI 10.1515/9781400881970-018]
  • [57] A feature learning approach based on XGBoost for driving assessment and risk prediction
    Shi, Xiupeng
    Wong, Yiik Diew
    Li, Michael Zhi-Feng
    Palanisamy, Chandrasekar
    Chai, Chen
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2019, 129 : 170 - 179
  • [58] Spring Forward at Your Own Risk: Daylight Saving Time and Fatal Vehicle Crashes
    Smith, Austin C.
    [J]. AMERICAN ECONOMIC JOURNAL-APPLIED ECONOMICS, 2016, 8 (02) : 65 - 91
  • [59] Explaining prediction models and individual predictions with feature contributions
    Strumbelj, Erik
    Kononenko, Igor
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 647 - 665
  • [60] Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates
    Taamneh, Madhar
    Alkheder, Sharaf
    Taamneh, Salah
    [J]. JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2017, 9 (02) : 146 - 166