Identification of Traffic Accident Patterns via Cluster Analysis and Test Scenario Development for Autonomous Vehicles

被引:15
作者
Esenturk, Emre [1 ]
Wallace, Albert G. [1 ]
Khastgir, Siddartha [1 ]
Jennings, Paul [1 ]
机构
[1] Univ Warwick, WMG, Coventry CV4 7AL, W Midlands, England
基金
英国科研创新办公室;
关键词
Accidents; Clustering algorithms; Data mining; Testing; Clustering methods; Task analysis; Junctions; Accident analysis; scenario development; cluster analysis; market basket analysis; DRIVER INJURY SEVERITY; PEDESTRIAN CRASHES; ASSOCIATION RULES; SAFETY; PREDICTION; ALGORITHM; MODELS; FRAMEWORK;
D O I
10.1109/ACCESS.2021.3140052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increased safety is one of the main motivations for traffic research and planning. The arduous task has two components: (i) improving the existing traffic policies based on a good understanding of risk factors related to trends in traffic accidents, and (ii) underpinning the emerging technologies that will advance the safety of vehicles. For the latter route, the introduction of connected and automated vehicles (CAVs) is a promising option as CAVs can potentially reduce the number of accidents. However, to reap their benefits, they need to be introduced in a safe manner and tested for their ability to safely deal with risky scenarios. Unfortunately, the identification of such test scenarios remains a key challenge for the industry. This study contributes to increased safety by (i) analyzing UK's STATS19 accident data to identify patterns in past traffic accidents, and (ii) utilizing this information to systematically generate scenarios for CAV testing. For task (i), the patterns in the accidents were identified in terms of static and time-dependent internal and external factors. For this purpose, the study employed a clustering algorithm, COOLCAT, which is particularly suitable for dealing with high-dimensional categorical data. Six different clusters emerged naturally as a result of the algorithm. To interpret the clusters, we applied a frequency analysis to each cluster. The frequency tests showed that in each cluster, certain distinct real-world situations were represented more significantly compared to the non-clustered reference case, which are the markers of each cluster. The second task (ii) complemented the first task by synthesizing the relationships between attributes. This was done by association rule mining using the market basket analysis approach. The method enabled us to develop, drawing from the characteristics of the clusters, non-trivial test scenarios that can be used in the testing of CAVs, especially in virtual testing.
引用
收藏
页码:6660 / 6675
页数:16
相关论文
共 61 条
[1]   Development of artificial neural network models to predict driver injury severity in traffic accidents at signalized intersections [J].
Abdelwahab, HT ;
Abdel-Aty, MA .
HIGHWAY SAFETY: MODELING, ANALYSIS, MANAGEMENT, STATISTICAL METHODS, AND CRASH LOCATION: SAFETY AND HUMAN PERFORMANCE, 2001, (1746) :6-13
[2]  
Aggarwal CC., 2012, Mining Text Data, P77
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]   Kernel density estimation and K-means clustering to profile road accident hotspots [J].
Anderson, Tessa K. .
ACCIDENT ANALYSIS AND PREVENTION, 2009, 41 (03) :359-364
[5]  
Andritsos P, 2004, LECT NOTES COMPUT SC, V2992, P123
[6]  
[Anonymous], 2020, BSI PAS 1883 2020: Operational Design Domain (ODD): taxonomy for automated driving systems (ADS). Specification
[7]  
[Anonymous], 2009, IEEE Spectr
[8]  
Barbara D., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P582, DOI 10.1145/584792.584888
[9]   A crash-prediction model for multilane roads [J].
Caliendo, Ciro ;
Guida, Maurizio ;
Parisi, Alessandra .
ACCIDENT ANALYSIS AND PREVENTION, 2007, 39 (04) :657-670
[10]   Data mining of tree-based models to analyze freeway accident frequency [J].
Chang, LY ;
Chen, WC .
JOURNAL OF SAFETY RESEARCH, 2005, 36 (04) :365-375