Factors That Influence the Type of Road Traffic Accidents: A Case Study in a District of Portugal

被引:5
作者
Infante, Paulo [1 ,2 ]
Jacinto, Goncalo [1 ,2 ]
Afonso, Anabela [1 ,2 ]
Rego, Leonor [2 ]
Nogueira, Pedro [3 ,4 ]
Silva, Marcelo [3 ,4 ]
Nogueira, Vitor [5 ,6 ]
Saias, Jose [5 ,6 ]
Quaresma, Paulo [5 ,6 ]
Santos, Daniel [6 ]
Gois, Patricia [7 ]
Manuel, Paulo Rebelo [1 ]
机构
[1] Univ Evora, CIMA, IIFA, P-7000671 Evora, Portugal
[2] Univ Evora, Dept Math, ECT, P-7000671 Evora, Portugal
[3] Univ Evora, ICT, IIFA, P-7000671 Evora, Portugal
[4] Univ Evora, Dept Geosci, P-7000671 Evora, Portugal
[5] Univ Evora, Algoritmi Res Ctr, P-7000671 Evora, Portugal
[6] Univ Evora, Dept Informat, ECT, P-7000671 Evora, Portugal
[7] Univ Evora, Dept Visual Arts & Design, EA, P-7000208 Evora, Portugal
关键词
imbalance data; machine learning algorithms; multinomial logit model; ROSE technique; type of road traffic accident; SEVERITY; VEHICLE; CRASHES; SINGLE; MODELS;
D O I
10.3390/su15032352
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Road traffic accidents (RTAs) are a problem with repercussions in several dimensions: social, economic, health, justice, and security. Data science plays an important role in its explanation and prediction. One of the main objectives of RTA data analysis is to identify the main factors associated with a RTA. The present study aims to contribute to the identification of the determinants for the type of RTA: collision, crash, or pedestrian running-over. These factors are essential for identifying specific countermeasures because there is a relevant relationship between the type of RTA and its severity. Daily RTA data from 2016 to 2019 in a district of Portugal were analyzed. A statistical multinomial logit model was fitted. The identified determinants for the type of RTA were geographical (municipality, location, and parking areas), meteorological (air temperature and weather), time of the day (hour, day of the week, and month), driver's characteristics (gender and age), vehicle's features (type and age) and road characteristics (road layout and type). The multinomial model results were compared with several machine learning algorithms, since the original data of the type of RTA are severely imbalanced. All models showed poor performance. However, when combining these models with ROSE for class balancing, their performance improved considerably, with the random forest algorithm showing the best performance.
引用
收藏
页数:16
相关论文
共 33 条
[1]   The effect of road and environmental characteristics on pedestrian hit-and-run accidents in Ghana [J].
Aidoo, Eric Nimako ;
Amoh-Gyimah, Richard ;
Ackaah, Williams .
ACCIDENT ANALYSIS AND PREVENTION, 2013, 53 :23-27
[2]  
ANSR, 2013, B EST AC VIAC
[3]   Multinomial Logistic Regression Model for Single-Vehicle and Multivehicle Collisions on Urban U.S. Highways in Arkansas [J].
Bham, Ghulam H. ;
Javvadi, Bhanu S. ;
Manepalli, Uday R. R. .
JOURNAL OF TRANSPORTATION ENGINEERING, 2012, 138 (06) :786-797
[4]   Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling [J].
Boo, Yookyung ;
Choi, Youngjin .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (11)
[5]   Differences in Factors Affecting Various Crash Types with High Numbers of Fatalities and Injuries in China [J].
Chen, Yikai ;
Wang, Kai ;
King, Mark ;
He, Jie ;
Ding, Jianxun ;
Shi, Qin ;
Wang, Changjun ;
Li, Pingfan .
PLOS ONE, 2016, 11 (07)
[6]   Identifying crash type propensity using real-time traffic data on freeways [J].
Christoforou, Zoi ;
Cohen, Simon ;
Karlaftis, Matthew G. .
JOURNAL OF SAFETY RESEARCH, 2011, 42 (01) :43-50
[7]   mlogit: Random Utility Models in R [J].
Croissant, Yves .
JOURNAL OF STATISTICAL SOFTWARE, 2020, 95 (11) :1-41
[8]   A deep generative approach for crash frequency model with heterogeneous imbalanced data [J].
Ding, Hongliang ;
Lu, Yuhuan ;
Sze, N. N. ;
Chen, Tiantian ;
Guo, Yanyong ;
Lin, Qinghai .
ANALYTIC METHODS IN ACCIDENT RESEARCH, 2022, 34
[9]  
Eurostat, 2021, ROAD ACC NUMB FAT CO
[10]   Handling Imbalanced Data in Road Crash Severity Prediction by Machine Learning Algorithms [J].
Fiorentini, Nicholas ;
Losa, Massimo .
INFRASTRUCTURES, 2020, 5 (07)