Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引:39
|
作者
Wang, Chen [1 ,2 ]
Liu, Lin [3 ]
Xu, Chengcheng [1 ]
Lv, Weitao [3 ]
机构
[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China
[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China
关键词
driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;
D O I
10.3390/ijerph16030334
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Predicting survival in heart failure: a risk score based on machine-learning and change point algorithm
    Kim, Wonse
    Park, Jin Joo
    Lee, Hae-Young
    Kim, Kye Hun
    Yoo, Byung-Su
    Kang, Seok-Min
    Baek, Sang Hong
    Jeon, Eun-Seok
    Kim, Jae-Joong
    Cho, Myeong-Chan
    Chae, Shung Chull
    Oh, Byung-Hee
    Kook, Woong
    Choi, Dong-Ju
    CLINICAL RESEARCH IN CARDIOLOGY, 2021, 110 (08) : 1321 - 1333
  • [42] Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer
    Wang, Qiang
    Xu, Jianhua
    Wang, Anrong
    Chen, Yi
    Wang, Tian
    Chen, Danyu
    Zhang, Jiaxing
    Brismar, Torkel B. B.
    RADIOLOGIA MEDICA, 2023, 128 (02): : 136 - 148
  • [43] Predicting pedestrian crash occurrence and injury severity in Texas using tree-based machine learning models
    Zhao, Bo
    Zuniga-Garcia, Natalia
    Xing, Lu
    Kockelman, Kara M.
    TRANSPORTATION PLANNING AND TECHNOLOGY, 2024, 47 (08) : 1205 - 1226
  • [44] Predicting seismic-based risk of lost circulation using machine learning
    Geng, Zhi
    Wang, Hanqing
    Fan, Meng
    Lu, Yunhu
    Nie, Zhen
    Ding, Yunhong
    Chen, Mian
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 176 : 679 - 688
  • [45] An Approach to Truck Driving Risk Identification: A Machine Learning Method Based on Optuna Optimization
    Wang, Zhaofei
    Li, Hao
    Wang, Qiuping
    IEEE ACCESS, 2025, 13 : 42723 - 42732
  • [46] Traffic conflict-based crash risk estimation: machine learning meets extreme value theory
    Zheng, Lai
    Wei, Wei
    TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2025,
  • [47] A Machine Learning-Based Framework for the Prediction of Cervical Cancer Risk in Women
    Kaushik, Keshav
    Bhardwaj, Akashdeep
    Bharany, Salil
    Alsharabi, Naif
    Rehman, Ateeq Ur
    Eldin, Elsayed Tag
    Ghamry, Nivin A.
    SUSTAINABILITY, 2022, 14 (19)
  • [48] Predicting the Risk of Sleep Disorders Using a Machine Learning-Based Simple Questionnaire: Development and Validation Study
    Ha, Seokmin
    Choi, Su Jung
    Lee, Sujin
    Wijaya, Reinatt Hansel
    Kim, Jee Hyun
    Joo, Eun Yeon
    Kim, Jae Kyoung
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [49] Machine Learning Framework for Classifying and Predicting Depressive Behavior Based on PPG and ECG Feature Extraction
    Alzate, Mateo
    Torres, Robinson
    De la Roca, Jose
    Quintero-Zea, Andres
    Hernandez, Martha
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [50] Machine learning-based farm risk management: A systematic mapping review
    Ghaffarian, Saman
    van der Voort, Mariska
    Valente, Joao
    Tekinerdogan, Bedir
    de Mey, Yann
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 192