A data mining approach to characterize road accident locations附视频

被引：0

作者：

Sachin Kumar ^{[1
]}

Durga Toshniwal ^{[2
]}

机构：

[1] Centre for Transportation Systems(CTRANS), Indian Institute of Technology Roorkee

[2] Computer Science & Engineering Department, Indian Institute of Technology

来源：

Journal of Modern Transportation | 2016年 / 01期

关键词：

Road accidents; Accident analysis; Data mining; k-Means; Association rule mining;

D O I：

暂无

中图分类号：

TP311.13 []; U495 [电子计算机在公路运输和公路工程中的应用];

学科分类号：

1201 ; 0838 ;

摘要：

Data mining has been proven as a reliable technique to analyze road accidents and provide productive results. Most of the road accident data analysis use data mining techniques, focusing on identifying factors that affect the severity of an accident. However, any damage resulting from road accidents is always unacceptable in terms of health, property damage and other economic factors. Sometimes, it is found that road accident occurrences are more frequent at certain specific locations. The analysis of these locations can help in identifying certain road accident features that make a road accident to occur frequently in these locations. Association rule mining is one of the popular data mining techniques that identify the correlation in various attributes of road accident. In this paper, we first applied k-means algorithm to group the accident locations into three categories, high-frequency,moderate-frequency and low-frequency accident locations.k-means algorithm takes accident frequency count as a parameter to cluster the locations. Then we used association rule mining to characterize these locations. The rules revealed different factors associated with road accidents at different locations with varying accident frequencies. Theassociation rules for high-frequency accident location disclosed that intersections on highways are more dangerous for every type of accidents. High-frequency accident locations mostly involved two-wheeler accidents at hilly regions. In moderate-frequency accident locations, colonies near local roads and intersection on highway roads are found dangerous for pedestrian hit accidents. Low-frequency accident locations are scattered throughout the district and the most of the accidents at these locations were not critical. Although the data set was limited to some selected attributes, our approach extracted some useful hidden information from the data which can be utilized to take some preventive efforts in these locations.

引用

页码：62 / 72

页数：11

共 5 条

[1] Application of classification algorithms for analysis of road safety risk factor dependencies[J] . Oh Hoon Kwon,Wonjong Rhee,Yoonjin Yoon. Accident Analysis and Prevention . 2015
[2] Analysis of traffic accident severity using Decision Rules via Decision Trees
Abellan, Joaquin
Lopez, Griselda
de Ona, Juan
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 6047 - 6054
[3] Poisson Family Regression Techniques for Prediction of Crash Counts Using Bayesian Inference[J] . C. Naveen Kumar,M. Parida,S.S. Jain. Procedia - Social and Behavioral Sciences . 2013
[4] Road traffic crashes and risk groups in India: Analysis, interpretations, and prevention strategies[J] . Raj V. Ponnaluri. IATSS Research . 2011 (2)
[5] Estimating the number of clusters in a data set via the gap statistic[J] . RobertTibshirani,GuentherWalther,TrevorHastie. Journal of the Royal Statistical Society: Series B (Statistical Methodology) . 2002 (2)

← 1 →