Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引：39

作者：

Wang, Chen ^{[1
,2
]}

Liu, Lin ^{[3
]}

Xu, Chengcheng ^{[1
]}

Lv, Weitao ^{[3
]}

机构：

[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China

[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH | 2019年 / 16卷 / 03期

关键词：

driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;

D O I：

10.3390/ijerph16030334

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.

引用

页数：18

共 50 条

[31] Predicting the risk of lung cancer using machine learning: A large study based on UK Biobank
Zhang, Siqi
Yang, Liangwei
Xu, Weiwen
Wang, Yue
Han, Liyuan
Zhao, Guofang
Cai, Ting
MEDICINE, 2024, 103 (16) : E37879
[32] Machine learning to predict pregnancy outcomes: a systematic review, synthesizing framework and future research agenda
Muhammad Nazrul Islam
Sumaiya Nuha Mustafina
Tahasin Mahmud
Nafiz Imtiaz Khan
BMC Pregnancy and Childbirth, 22
[33] Machine learning to predict pregnancy outcomes: a systematic review, synthesizing framework and future research agenda
Islam, Muhammad Nazrul
Mustafina, Sumaiya Nuha
Mahmud, Tahasin
Khan, Nafiz Imtiaz
BMC PREGNANCY AND CHILDBIRTH, 2022, 22 (01)
[34] Research on predicting the driving forces of digital transformation in Chinese media companies based on machine learning
Wang, Zhan
Li, Yao
Zhao, Xu
Wang, Yuxuan
Xiao, Zihan
SCIENTIFIC REPORTS, 2024, 14 (01)
[35] Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
Wang, Chen
Kou, Siyuan
Song, Yanchao
ENTROPY, 2019, 21 (11)
[36] Development of a Machine Learning-Based Framework for Predicting Vessel Size Based on Container Capacity
Chatterjee, Indranath
Cho, Gyusung
APPLIED SCIENCES-BASEL, 2022, 12 (19):
[37] Machine learning based inverse framework for predicting the transverse and shear modulus of carbon fiber
Divakarraju, P., V
Mishra, Neeraj
Pandurangan, V
Nithyadharan, M.
COMPUTATIONAL MATERIALS SCIENCE, 2023, 230
[38] Exploitation of Vulnerabilities: A Topic-Based Machine Learning Framework for Explaining and Predicting Exploitation
Charmanas, Konstantinos
Mittas, Nikolaos
Angelis, Lefteris
INFORMATION, 2023, 14 (07)
[39] Predicting future technological convergence patterns based on machine learning using link prediction
Joon Hyung Cho
Jungpyo Lee
So Young Sohn
Scientometrics, 2021, 126 : 5413 - 5429
[40] Predicting future technological convergence patterns based on machine learning using link prediction
Cho, Joon Hyung
Lee, Jungpyo
Sohn, So Young
SCIENTOMETRICS, 2021, 126 (07) : 5413 - 5429

← 1 2 3 4 5 →