A Comparative Study of Machine Learning Algorithms to Predict Road Accident Severity

被引:0
作者
Ahmed, Shakil [1 ]
Hossain, Md Akbar [1 ]
Bhuiyan, Md Mafijul Islam [2 ]
Ray, Sayan Kumar [1 ]
机构
[1] Manukau Inst Technol, Sch Digital Technol, Auckland, New Zealand
[2] Univ Alberta, Dept Computat Phys, Edmonton, AB, Canada
来源
20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS) | 2021年
关键词
road accident; machine learning; injury severity; single classifier; ensemble classifier; CRASH-FREQUENCY; MODELS;
D O I
10.1109/IUCC-CIT-DSCI-SMARTCNS55181.2021.00069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Road accidents is a global issue that cause deaths and injuries besides other direct and indirect losses. Countries and international organisations have designed technologies, systems, and policies to prevent accidents. The use of big traffic data and artificial intelligence may help develop a promising solution to predict or reduce the risk of road accidents. Most existing studies examine the impact of road geometry, environment, and weather parameters on road accidents. However, human factors such as alcohol, drug, age, and gender are often ignored when determining accident severity. In this work, we considered various contributing factors and their impact on the prediction of the severity of accidents. For this, we studied a set of single and ensemble mode machine learning (ML) methods and compared their performance in terms of prediction accuracy, precision, recall, F1 score, area under the receiver operator characteristic (AUROC). This research considered the road accident severity prediction as a classification problem that can classify the intensity of an accident in two categories: (i) binary classification (e.g. grievous and non-grievous), and (ii) multiclass classification (fatal, serious, minor, and non-injury). Our results show that Random Forest (RF) outperformed other methods' like logistic regression (LR), K-nearest neighbor (KNN), naive Bayes (NB), extreme gradient boosting (XGBoost), and adaptive boosting (AdaBoost) (e.g., 86.64% for binary and 67.67% for multiclass classification) in both single and ensemble ML methods in comparison to other methods considered in this research. LR, KNN, and NB are single mode ML methods that show similar performance to each other for both binary and multiclass classification. Compared to single mode, ensemble ML methods can predict the severity of accident more accurately with the order of RF, XGBoost, and Adaboost. The findings from this study can help to gain insights into the accident contributing factors and the severity of injuries as a result.
引用
收藏
页码:390 / 397
页数:8
相关论文
共 19 条
[1]  
Abdulhafedh Azad., 2008, Journal of Transportation Technologies, V6, P169
[2]   Analysis of traffic accident severity using Decision Rules via Decision Trees [J].
Abellan, Joaquin ;
Lopez, Griselda ;
de Ona, Juan .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) :6047-6054
[3]   Rapid Assessment of Environmental Health Impacts for Policy Support: The Example of Road Transport in New Zealand [J].
Briggs, David ;
Mason, Kylie ;
Borman, Barry .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2016, 13 (01)
[4]   Data mining of tree-based models to analyze freeway accident frequency [J].
Chang, LY ;
Chen, WC .
JOURNAL OF SAFETY RESEARCH, 2005, 36 (04) :365-375
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]   Support vector machine in crash prediction at the level of traffic analysis zones: Assessing the spatial proximity effects [J].
Dong, Ni ;
Huang, Helai ;
Zheng, Liang .
ACCIDENT ANALYSIS AND PREVENTION, 2015, 82 :192-198
[7]   The negative binomial-Lindley generalized linear model: Characteristics and application using crash data [J].
Geedipally, Srinivas Reddy ;
Lord, Dominique ;
Dhavala, Soma Sekhar .
ACCIDENT ANALYSIS AND PREVENTION, 2012, 45 :258-265
[8]   Exploring precrash maneuvers using classification trees and random forests [J].
Harb, Rami ;
Yan, Xuedong ;
Radwan, Essam ;
Su, Xiaogang .
ACCIDENT ANALYSIS AND PREVENTION, 2009, 41 (01) :98-107
[9]   Modeling young driver motor vehicle crashes: data with extra zeros [J].
Lee, AH ;
Stevenson, MR ;
Wang, K ;
Yau, KKW .
ACCIDENT ANALYSIS AND PREVENTION, 2002, 34 (04) :515-521
[10]   Extension of the Application of Conway-Maxwell-Poisson Models: Analyzing Traffic Crash Data Exhibiting Underdispersion [J].
Lord, Dominique ;
Geedipally, Srinivas Reddy ;
Guikema, Seth D. .
RISK ANALYSIS, 2010, 30 (08) :1268-1276