Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引：39

作者：

Wang, Chen ^{[1
,2
]}

Liu, Lin ^{[3
]}

Xu, Chengcheng ^{[1
]}

Lv, Weitao ^{[3
]}

机构：

[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China

[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH | 2019年 / 16卷 / 03期

关键词：

driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;

D O I：

10.3390/ijerph16030334

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.

引用

页数：18

共 50 条

[1] Understanding the context of alcohol impaired driving for fatal crash-involved drivers: A descriptive case analysis
Wundersitz, Lisa
Raftery, Simon
TRAFFIC INJURY PREVENTION, 2017, 18 (08) : 781 - 787
[2] Investigating the factors influencing Repeatedly Crash-Involved Drivers (RCIDs): A Random Parameter Hazard-Based Duration approach
Eljailany, Hala A.
Lee, Jaeyoung Jay
Huang, Helai
Zhou, Hanchu
Ibrahim, Ali. M. A.
ACCIDENT ANALYSIS AND PREVENTION, 2025, 211
[3] Analysis of the Outcome of the Driving Test for Learner Drivers Based on an Interpretable Machine Learning Framework
Ding, Yang
Zhao, Xiaohua
Yao, Ying
He, Chenxi
Chai, Rui
Liu, Shuo
TRANSPORTATION RESEARCH RECORD, 2024, : 1917 - 1934
[4] Transparent deep machine learning framework for predicting traffic crash severity
Sattar, Karim
Oughali, Feras Chikh
Assi, Khaled
Ratrout, Nedal
Jamal, Arshad
Rahman, Syed Masiur
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1535 - 1547
[5] Transparent deep machine learning framework for predicting traffic crash severity
Karim Sattar
Feras Chikh Oughali
Khaled Assi
Nedal Ratrout
Arshad Jamal
Syed Masiur Rahman
Neural Computing and Applications, 2023, 35 : 1535 - 1547
[6] The impact of machine learning in predicting risk of violence: A systematic review
Parmigiani, Giovanna
Barchielli, Benedetta
Casale, Simona
Mancini, Toni
Ferracuti, Stefano
FRONTIERS IN PSYCHIATRY, 2022, 13
[7] Application of machine learning in predicting the risk of postpartum depression: A systematic review
Zhong, Minhui
Zhang, Han
Yu, Chan
Jiang, Jinxia
Duan, Xia
JOURNAL OF AFFECTIVE DISORDERS, 2022, 318 : 364 - 379
[8] Heterogeneous ensemble learning for enhanced crash forecasts-A frequentist and machine learning based stacking framework
Ahmad, Numan
Wali, Behram
Khattak, Asad J.
JOURNAL OF SAFETY RESEARCH, 2023, 84 : 418 - 434
[9] Predicting the Risk of Driving Under the Influence of Alcohol Using EEG-Based Machine Learning
Sun, Cheuk-Kwan (ed105983@edah.org.tw), 2025, 184
[10] Machine Learning in Predicting Tooth Loss: A Systematic Review and Risk of Bias Assessment
Hasuike, Akira
Watanabe, Taito
Wakuda, Shin
Kogure, Keisuke
Yanagiya, Ryo
Byrd, Kevin M.
Sato, Shuichi
JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (10):

← 1 2 3 4 5 →