Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

被引：39

作者：

Wang, Chen ^{[1
,2
]}

Liu, Lin ^{[3
]}

Xu, Chengcheng ^{[1
]}

Lv, Weitao ^{[3
]}

机构：

[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Intelligent Transportat Res Ctr, Nanjing 210096, Jiangsu, Peoples R China

[3] Jiangsu Intelligent Transportat Syst Co Ltd, Nanjing 210096, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH | 2019年 / 16卷 / 03期

关键词：

driving risk; traffic violation behavior; machine learning; temporal transferability; TRAFFIC VIOLATIONS; ACCIDENT-RISK; EXPERIENCE; LIKELIHOOD; BEHAVIORS; SEVERITY; MODEL; FAULT;

D O I：

10.3390/ijerph16030334

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers' two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013-2014, 2014-2015, 2015-2016, and 2016-2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016-2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1-2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.

引用

页数：18

共 50 条

[41] Predicting survival in heart failure: a risk score based on machine-learning and change point algorithm
Kim, Wonse
Park, Jin Joo
Lee, Hae-Young
Kim, Kye Hun
Yoo, Byung-Su
Kang, Seok-Min
Baek, Sang Hong
Jeon, Eun-Seok
Kim, Jae-Joong
Cho, Myeong-Chan
Chae, Shung Chull
Oh, Byung-Hee
Kook, Woong
Choi, Dong-Ju
CLINICAL RESEARCH IN CARDIOLOGY, 2021, 110 (08) : 1321 - 1333
[42] Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer
Wang, Qiang
Xu, Jianhua
Wang, Anrong
Chen, Yi
Wang, Tian
Chen, Danyu
Zhang, Jiaxing
Brismar, Torkel B. B.
RADIOLOGIA MEDICA, 2023, 128 (02): : 136 - 148
[43] Predicting pedestrian crash occurrence and injury severity in Texas using tree-based machine learning models
Zhao, Bo
Zuniga-Garcia, Natalia
Xing, Lu
Kockelman, Kara M.
TRANSPORTATION PLANNING AND TECHNOLOGY, 2024, 47 (08) : 1205 - 1226
[44] Predicting seismic-based risk of lost circulation using machine learning
Geng, Zhi
Wang, Hanqing
Fan, Meng
Lu, Yunhu
Nie, Zhen
Ding, Yunhong
Chen, Mian
JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 176 : 679 - 688
[45] An Approach to Truck Driving Risk Identification: A Machine Learning Method Based on Optuna Optimization
Wang, Zhaofei
Li, Hao
Wang, Qiuping
IEEE ACCESS, 2025, 13 : 42723 - 42732
[46] Traffic conflict-based crash risk estimation: machine learning meets extreme value theory
Zheng, Lai
Wei, Wei
TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2025,
[47] A Machine Learning-Based Framework for the Prediction of Cervical Cancer Risk in Women
Kaushik, Keshav
Bhardwaj, Akashdeep
Bharany, Salil
Alsharabi, Naif
Rehman, Ateeq Ur
Eldin, Elsayed Tag
Ghamry, Nivin A.
SUSTAINABILITY, 2022, 14 (19)
[48] Predicting the Risk of Sleep Disorders Using a Machine Learning-Based Simple Questionnaire: Development and Validation Study
Ha, Seokmin
Choi, Su Jung
Lee, Sujin
Wijaya, Reinatt Hansel
Kim, Jee Hyun
Joo, Eun Yeon
Kim, Jae Kyoung
JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
[49] Machine Learning Framework for Classifying and Predicting Depressive Behavior Based on PPG and ECG Feature Extraction
Alzate, Mateo
Torres, Robinson
De la Roca, Jose
Quintero-Zea, Andres
Hernandez, Martha
APPLIED SCIENCES-BASEL, 2024, 14 (18):
[50] Machine learning-based farm risk management: A systematic mapping review
Ghaffarian, Saman
van der Voort, Mariska
Valente, Joao
Tekinerdogan, Bedir
de Mey, Yann
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 192

← 1 2 3 4 5 →