Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引:39
|
作者
Wang, Chen [1 ,2 ]
Liu, Lin [3 ]
Xu, Chengcheng [1 ]
Lv, Weitao [3 ]
机构
[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China
[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China
关键词
driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;
D O I
10.3390/ijerph16030334
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Predicting the risk of lung cancer using machine learning: A large study based on UK Biobank
    Zhang, Siqi
    Yang, Liangwei
    Xu, Weiwen
    Wang, Yue
    Han, Liyuan
    Zhao, Guofang
    Cai, Ting
    MEDICINE, 2024, 103 (16) : E37879
  • [32] Machine learning to predict pregnancy outcomes: a systematic review, synthesizing framework and future research agenda
    Muhammad Nazrul Islam
    Sumaiya Nuha Mustafina
    Tahasin Mahmud
    Nafiz Imtiaz Khan
    BMC Pregnancy and Childbirth, 22
  • [33] Machine learning to predict pregnancy outcomes: a systematic review, synthesizing framework and future research agenda
    Islam, Muhammad Nazrul
    Mustafina, Sumaiya Nuha
    Mahmud, Tahasin
    Khan, Nafiz Imtiaz
    BMC PREGNANCY AND CHILDBIRTH, 2022, 22 (01)
  • [34] Research on predicting the driving forces of digital transformation in Chinese media companies based on machine learning
    Wang, Zhan
    Li, Yao
    Zhao, Xu
    Wang, Yuxuan
    Xiao, Zihan
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [35] Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
    Wang, Chen
    Kou, Siyuan
    Song, Yanchao
    ENTROPY, 2019, 21 (11)
  • [36] Development of a Machine Learning-Based Framework for Predicting Vessel Size Based on Container Capacity
    Chatterjee, Indranath
    Cho, Gyusung
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [37] Machine learning based inverse framework for predicting the transverse and shear modulus of carbon fiber
    Divakarraju, P., V
    Mishra, Neeraj
    Pandurangan, V
    Nithyadharan, M.
    COMPUTATIONAL MATERIALS SCIENCE, 2023, 230
  • [38] Exploitation of Vulnerabilities: A Topic-Based Machine Learning Framework for Explaining and Predicting Exploitation
    Charmanas, Konstantinos
    Mittas, Nikolaos
    Angelis, Lefteris
    INFORMATION, 2023, 14 (07)
  • [39] Predicting future technological convergence patterns based on machine learning using link prediction
    Joon Hyung Cho
    Jungpyo Lee
    So Young Sohn
    Scientometrics, 2021, 126 : 5413 - 5429
  • [40] Predicting future technological convergence patterns based on machine learning using link prediction
    Cho, Joon Hyung
    Lee, Jungpyo
    Sohn, So Young
    SCIENTOMETRICS, 2021, 126 (07) : 5413 - 5429