Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引:39
|
作者
Wang, Chen [1 ,2 ]
Liu, Lin [3 ]
Xu, Chengcheng [1 ]
Lv, Weitao [3 ]
机构
[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China
[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China
关键词
driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;
D O I
10.3390/ijerph16030334
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach
    Sun, Christopher L. F.
    Zuccarelli, Eugenio
    Zerhouni, El Ghali A.
    Lee, Jason
    Muller, James
    Scott, Karen M.
    Lujan, Alida M.
    Levi, Retsef
    JOURNAL OF THE AMERICAN MEDICAL DIRECTORS ASSOCIATION, 2020, 21 (11) : 1533 - +
  • [22] The application of machine learning techniques for driving behavior analysis: A conceptual framework and a systematic literature review
    Abou Elassad, Zouhair Elamrani
    Mousannif, Hajar
    Al Moatassime, Hassan
    Karkouch, Aimad
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 87
  • [23] Injury Risk Assessment and Interpretation for Roadway Crashes Based on Pre-Crash Indicators and Machine Learning Methods
    Gu, Chenwei
    Xu, Jinliang
    Li, Shuqi
    Gao, Chao
    Ma, Yongji
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [24] Predicting and factor analysis of rider injury severity in two-wheeled motorcycle and vehicle crash accidents based on an interpretable machine learning framework
    Wei, Tianzheng
    Zhu, Tong
    Lin, Miao
    Liu, Haoxue
    TRAFFIC INJURY PREVENTION, 2024, 25 (02) : 194 - 201
  • [25] A Machine Learning Framework for Predicting Purchase by online customers based on Dynamic Pricing
    Gupta, Rajan
    Pathak, Chaitanya
    COMPLEX ADAPTIVE SYSTEMS, 2014, 36 : 599 - +
  • [26] A machine learning–based framework for predicting postpartum chronic pain: a retrospective study
    Fan Liu
    Ting Li
    Dongxu Zhou
    Shengnan Shi
    Xingrui Gong
    BMC Medical Informatics and Decision Making, 25 (1)
  • [27] Unveiling the principle descriptor for predicting the electron inelastic mean free path based on a machine learning framework
    Liu, Xun
    Hou, Zhufeng
    Lu, Dabao
    Da, Bo
    Yoshikawa, Hideki
    Tanuma, Shigeo
    Sun, Yang
    Ding, Zejun
    SCIENCE AND TECHNOLOGY OF ADVANCED MATERIALS, 2019, 20 (01) : 1090 - 1102
  • [28] A machine learning-based framework for predicting the power factor of thermoelectric materials
    Zeng, Yuxuan
    Cao, Wei
    Peng, Tan
    Hou, Yue
    Miao, Ling
    Wang, Ziyu
    Shi, Jing
    APPLIED MATERIALS TODAY, 2025, 43
  • [29] A Machine Learning-Based Model for Predicting the Risk of Cardiovascular Disease
    Hsiao, Chiu-Han
    Yu, Po-Chun
    Hsieh, Chia-Ying
    Zhong, Bing-Zi
    Tsai, Yu-Ling
    Cheng, Hao-min
    Chang, Wei-Lun
    Lin, Frank Yeong-Sung
    Huang, Yennun
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 364 - 374
  • [30] A hybrid modelling framework of machine learning and extreme value theory for crash risk estimation using traffic conflicts
    Hussain, Fizza
    Li, Yuefeng
    Arun, Ashutosh
    Haque, Md. Mazharul
    ANALYTIC METHODS IN ACCIDENT RESEARCH, 2022, 36